Land Use Regression (LUR) models have been used increasingly for modeling small-scale spatial variation in air pollution concentrations and estimating individual exposure for participants of cohort studies. Within the ESCAPE project, concentrations of PM(2.5), PM(2.5) absorbance, PM(10), and PM(coarse) were measured in 20 European study areas at 20 sites per area. GIS-derived predictor variables (e.g., traffic intensity, population, and land-use) were evaluated to model spatial variation of annual average concentrations for each study area. The median model explained variance (R(2)) was 71% for PM(2.5) (range across study areas 35-94%). Model R(2) was higher for PM(2.5) absorbance (median 89%, range 56-97%) and lower for PM(coarse) (median 68%, range 32- 81%). Models included between two and five predictor variables, with various traffic indicators as the most common predictors. Lower R(2) was related to small concentration variability or limited availability of predictor variables, especially traffic intensity. Cross validation R(2) results were on average 8-11% lower than model R(2). Careful selection of monitoring sites, examination of influential observations and skewed variable distributions were essential for developing stable LUR models. The final LUR models are used to estimate air pollution concentrations at the home addresses of participants in the health studies involved in ESCAPE.
h i g h l i g h t s < LUR models were developed in 36 study areas in Europe using a standardized approach. < NO 2 models explained a large fraction of concentration variability (median R 2 82%). < Local traffic intensity data were important predictors for LUR model development.
The study of time series models for count data has become a topic of special interest during the last years. However, while research on univariate time series for counts now flourishes, the literature on multivariate time series models for count data is notably more limited. In the present paper, a bivariate integer-valued autoregressive process of order 1 (BINAR(1)) is introduced. Emphasis is placed on models with bivariate Poisson and bivariate negative binomial innovations. We discuss properties of the BINAR(1) model and propose the method of conditional maximum likelihood for the estimation of its unknown parameters. Issues of diagnostics and forecasting are considered and predictions are produced by means of the conditional forecast distribution. Estimation uncertainty is accommodated by taking advantage of the asymptotic normality of maximum likelihood estimators and constructing appropriate confidence intervals for the fe-step-ahead conditional probability mass function. The proposed model is applied to a bivariate data series concerning daytime and nighttime road accidents in the Netherlands.
Land use regression models (LUR) frequently use leave-one-out-cross-validation (LOOCV) to assess model fit, but recent studies suggested that this may overestimate predictive ability in independent data sets. Our aim was to evaluate LUR models for nitrogen dioxide (NO2) and particulate matter (PM) components exploiting the high correlation between concentrations of PM metrics and NO2. LUR models have been developed for NO2, PM2.5 absorbance, and copper (Cu) in PM10 based on 20 sites in each of the 20 study areas of the ESCAPE project. Models were evaluated with LOOCV and "hold-out evaluation (HEV)" using the correlation of predicted NO2 or PM concentrations with measured NO2 concentrations at the 20 additional NO2 sites in each area. For NO2, PM2.5 absorbance and PM10 Cu, the median LOOCV R(2)s were 0.83, 0.81, and 0.76 whereas the median HEV R(2) were 0.52, 0.44, and 0.40. There was a positive association between the LOOCV R(2) and HEV R(2) for PM2.5 absorbance and PM10 Cu. Our results confirm that the predictive ability of LUR models based on relatively small training sets is overestimated by the LOOCV R(2)s. Nevertheless, in most areas LUR models still explained a substantial fraction of the variation of concentrations measured at independent sites.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.