A statistical model for predicting individual house prices and constructing a
house price index is proposed utilizing information regarding sale price, time
of sale and location (ZIP code). This model is composed of a fixed time effect
and a random ZIP (postal) code effect combined with an autoregressive
component. The former two components are applied to all home sales, while the
latter is applied only to homes sold repeatedly. The time effect can be
converted into a house price index. To evaluate the proposed model and the
resulting index, single-family home sales for twenty US metropolitan areas from
July 1985 through September 2004 are analyzed. The model is shown to have
better predictive abilities than the benchmark S&P/Case--Shiller model, which
is a repeat sales model, and a conventional mixed effects model. Finally, Los
Angeles, CA, is used to illustrate a historical housing market downturn.Comment: Published in at http://dx.doi.org/10.1214/10-AOAS380 the Annals of
Applied Statistics (http://www.imstat.org/aoas/) by the Institute of
Mathematical Statistics (http://www.imstat.org
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.