Semi-parametric inference for the means of heavy-tailed distributions

dc.contributor.authorTaddy, Matt
dc.contributor.authorHEDIBERT FREITAS LOPES
dc.contributor.authorGoldberg, David
dc.contributor.authorGardner, Matt
dc.coverage.cidadeSão Paulopt_BR
dc.coverage.paisBrasilpt_BR
dc.creatorTaddy, Matt
dc.creatorGoldberg, David
dc.creatorGardner, Matt
dc.date.accessioned2023-07-20T16:36:23Z
dc.date.available2023-07-20T16:36:23Z
dc.date.issued2016
dc.description.abstractHeavy tailed distributions present a tough setting for inference. They are also common in industrial applications, particularly with Internet transaction datasets, and machine learners often analyze such data without considering the biases and risks associated with the misuse of standard tools. This article outlines a procedure for inference about the (possibly conditional) mean of a heavy tailed distribution that combines nonparametric inference for the bulk of the support with parametric inference – motivated from extreme value theory – for the heavy tail. We are able to derive analytic posterior conditional means and variances for the expected value of a heavy tailed distributivo. We also introduce a simple and novel independence Metropolis Hastings algorithm that samples from the distribution for tail parameters via small adjustments to a parametric bootstrap, and through this algorithm are able to provide comparisons between our framework and frequentist semiparametric inference. We also provide a modeling extension that shrinks tails across distributions to an overall background tail. We illustrate on two examples: treatment effect estimation on a set of 72 A/B experiments, and the fitting of regression trees for prediction of user spending. Both use data from tens of millions of users of eBay.com.
dc.description.otherHeavy tailed distributions present a tough setting for inference. They are also common in indus trial applications, particularly with Internet trans action datasets, and machine learners often an alyze such data without considering the biases and risks associated with the misuse of standard tools. This article outlines a procedure for infer ence about the (possibly conditional) mean of a heavy tailed distribution that combines nonpara metric inference for the bulk of the support with parametric inference – motivated from extreme value theory – for the heavy tail. We are able to derive analytic posterior conditional means and variances for the expected value of a heavy tailed distribution. We also introduce a simple and novel independence Metropolis Hastings al gorithm that samples from the distribution for tail parameters via small adjustments to a paramet ric bootstrap, and through this algorithm are able to provide comparisons between our framework and frequentist semiparametric inference. We also provide a modeling extension that shrinks tails across distributions to an overall background tail. We illustrate on two examples: treatment ef fect estimation on a set of 72 A/B experiments, and the fitting of regression trees for prediction of user spending. Both use data from tens of mil lions of users of eBay.com.pt_BR
dc.format.extent11 p.pt_BR
dc.format.mediumDigitalpt_BR
dc.identifier.issueBEWP 232/2016
dc.identifier.urihttps://repositorio.insper.edu.br/handle/11224/5896
dc.language.isoInglêspt_BR
dc.publisherInsperpt_BR
dc.relation.ispartofseriesInsper Working Paperpt_BR
dc.rights.licenseO INSPER E ESTE REPOSITÓRIO NÃO DETÊM OS DIREITOS DE USO E REPRODUÇÃO DOS CONTEÚDOS AQUI REGISTRADOS. É RESPONSABILIDADE DO USUÁRIO VERIFICAR OS USOS PERMITIDOS NA FONTE ORIGINAL, RESPEITANDO-SE OS DIREITOS DE AUTOR OU EDITORpt_BR
dc.titleSemi-parametric inference for the means of heavy-tailed distributionspt_BR
dc.typeworking paper
dspace.entity.typePublication
local.subject.cnpqCiências Exatas e da Terrapt_BR
local.typeWorking Paperpt_BR
relation.isAuthorOfPublication41f844cb-0e5a-4ef1-bb19-5ab1cec8e2ca
relation.isAuthorOfPublication.latestForDiscovery41f844cb-0e5a-4ef1-bb19-5ab1cec8e2ca

Arquivos

Pacote original

Agora exibindo 1 - 1 de 1
N/D
Nome:
BEWP_232_2016_Semi_parametric_inference_for_the_means_of_heavy_tailed_distributions_TC.pdf
Tamanho:
549.47 KB
Formato:
Adobe Portable Document Format