Search CORE

35 research outputs found

Rejoinder of: Statistical analysis of an archeological find

Author: Feuerverger Andrey
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2008
Field of study

Rejoinder of ``Statistical analysis of an archeological find'' [arXiv:0804.0079]Comment: Published in at http://dx.doi.org/10.1214/08-AOAS99REJ the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

CiteSeerX

Crossref

Discussion of: Brownian distance covariance

Author: Feuerverger Andrey
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 05/10/2010
Field of study

Discussion on "Brownian distance covariance" by G\'{a}bor J. Sz\'{e}kely, Maria L. Rizzo [arXiv:1010.0297]Comment: Published in at http://dx.doi.org/10.1214/09-AOAS312D the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

Crossref

Statistical Significance of the Netflix Challenge

Author: Feuerverger Andrey
He Yu
Khatri Shashi
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 24/07/2012
Field of study

Inspired by the legacy of the Netflix contest, we provide an overview of what has been learned---from our own efforts, and those of others---concerning the problems of collaborative filtering and recommender systems. The data set consists of about 100 million movie ratings (from 1 to 5 stars) involving some 480 thousand users and some 18 thousand movies; the associated ratings matrix is about 99% sparse. The goal is to predict ratings that users will give to movies; systems which can do this accurately have significant commercial applications, particularly on the world wide web. We discuss, in some detail, approaches to "baseline" modeling, singular value decomposition (SVD), as well as kNN (nearest neighbor) and neural network models; temporal effects, cross-validation issues, ensemble methods and other considerations are discussed as well. We compare existing models in a search for new models, and also discuss the mission-critical issues of penalization and parameter shrinkage which arise when the dimensions of a parameter space reaches into the millions. Although much work on such problems has been carried out by the computer science and machine learning communities, our goal here is to address a statistical audience, and to provide a primarily statistical treatment of the lessons that have been learned from this remarkable set of data.Comment: Published in at http://dx.doi.org/10.1214/11-STS368 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

Crossref

Using statistical smoothing to date medieval manuscripts

Author: Feuerverger Andrey
Gervers Michael
Hall Peter
Tilahun Gelila
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 16/05/2008
Field of study

We discuss the use of multivariate kernel smoothing methods to date manuscripts dating from the 11th to the 15th centuries, in the English county of Essex. The dataset consists of some 3300 dated and 5000 undated manuscripts, and the former are used as a training sample for imputing dates for the latter. It is assumed that two manuscripts that are ``close'', in a sense that may be defined by a vector of measures of distance for documents, will have close dates. Using this approach, statistical ideas are used to assess ``similarity'', by smoothing among distance measures, and thus to estimate dates for the 5000 undated manuscripts by reference to the dated ones.Comment: Published in at http://dx.doi.org/10.1214/193940307000000248 the IMS Collections (http://www.imstat.org/publications/imscollections.htm) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

Crossref

Distance Measures and Smoothing Methodology for Imputing Features of Documents

Author: Andrey Feuerverger
Berry M. W.
Berry M. W.
Gelila Tilahun
Michael Gervers
Peter Hall
Quang P. X.
Reinert M.
Simon H.
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

On Some ECF Procedures for Testing Independence

Author: Andrey Feuerverger
Publication venue: Springer Netherlands
Publication date: 01/01/1987
Field of study

Crossref

A bound for estimation in nonlinear time series models by independence testing methods

Author: Feuerverger Andrey
Publication venue
Publication date
Field of study

The bound (19) is derived for the asymptotic efficiency of estimates of the parameter [beta] in the nonlinear time series model (1) when the estimation is based on testing for independence of the estimated residuals from the series past. For intrinsically linear time series models efficiency is attainable regardless of the distribution of the error terms.time series analysis nonlinear time series models estimation asymptotic efficiency testing independence

Research Papers in Economics