In this note we discuss (Gaussian) intrinsic conditional autoregressive (CAR) models for disconnected graphs, with the aim of providing practical guidelines for how these models should be defined, scaled and implemented. We show how these suggestions can be implemented in two examples on disease mapping.
The analysis of large data sets of standardized mortality ratios (SMRs), obtained by collecting observed and expected disease counts in a map of contiguous regions, is a first step in descriptive epidemiology to detect potential environmental risk factors. A common situation arises when counts are collected in small areas, that is, where the expected count is very low, and disease risks underlying the map are spatially correlated. Traditional p-value-based methods, which control the false discovery rate (FDR) by means of Poisson p-values, might achieve small sensitivity in identifying risk in small areas. This problem is the focus of the present work, where a Bayesian approach which performs a test to evaluate the null hypothesis of no risk over each SMR and controls the posterior FDR is proposed. A Bayesian hierarchical model including spatial random effects to allow for extra-Poisson variability is implemented providing estimates of the posterior probabilities that the null hypothesis of absence of risk is true. By means of such posterior probabilities, an estimate of the posterior FDR conditional on the data can be computed. A conservative estimation is needed to achieve the control which is checked by simulation. The availability of this estimate allows the practitioner to determine nonarbitrary FDR-based selection rules to identify high-risk areas according to a preset FDR level. Sensitivity and specificity of FDR-based rules are studied via simulation and a comparison with p-value-based rules is also shown. A real data set is analyzed using rules based on several FDR levels.
Bayesian P-splines assume an intrinsic Gaussian Markov random field prior on the spline coefficients, conditional on a precision hyper-parameter τ . Prior elicitation of τ is difficult.To overcome this issue we aim to building priors on an interpretable property of the model, indicating the complexity of the smooth function to be estimated. Following this idea, we propose Penalized Complexity (PC) priors for the number of effective degrees of freedom.We present the general ideas behind the construction of these new PC priors, describe their properties and show how to implement them in P-splines for Gaussian data.
Investigating the relationship between vegetation cover and substrate typologies is important for habitat conservation. To study these relationships, common practice in modern ecological surveys is to collect information regarding vegetation cover and substrate typology over fine regular lattices, as derived from digital ground photos. Information on substrate typologies is often available as compositional measures, e.g., the area proportion occupied by a certain substrate. Two primary issues are of interest for ecologists: first, how much substrate typologies differ in terms of relative suitability for vegetation cover and, second, whether suitability varies over time. This paper develops a novel procedure for managing compositional covariates within a Bayesian hierarchical framework to effectively address the aforementioned issues. A spatio-temporal model is adopted to estimate the temporal pattern characterizing substrate relative suitability for vegetation cover and, at the same time, to account for spatio-temporal correlation. Relative suitability is modeled by time-varying regression coefficients, and spatial, temporal and spatio-temporal random effects are modeled using Gaussian Markov Random Field models.
Lack of independence in the residuals from linear regression motivates the use of random effect models in many applied fields. We start from the one-way anova model and extend it to a general class of one-factor Bayesian mixed models, discussing several correlation structures for the within group residuals. All the considered group models are parametrized in terms of a single correlation (hyper-)parameter, controlling the shrinkage towards the case of independent residuals (iid). We derive a penalized complexity (PC) prior for the correlation parameter of a generic group model. This prior has desirable properties from a practical point of view: i) it ensures appropriate shrinkage to the iid case; ii) it depends on a scaling parameter whose choice only requires a prior guess on the proportion of total variance explained by the grouping factor; iii) it is defined on a distance scale common to all group models, thus the scaling parameter can be chosen in the same manner regardless the adopted group model. We show the benefit of using these PC priors in a case study in community ecology where different group models are compared.
Summary
Adjusting for an unmeasured confounder is generally an intractable problem, but in the spatial setting it may be possible under certain conditions. We derive necessary conditions on the coherence between the exposure and the unmeasured confounder that ensure the effect of exposure is estimable. We specify our model and assumptions in the spectral domain to allow for different degrees of confounding at different spatial resolutions. One assumption that ensures identifiability is that confounding present at global scales dissipates at local scales. We show that this assumption in the spectral domain is equivalent to adjusting for global-scale confounding in the spatial domain by adding a spatially smoothed version of the exposure to the mean of the response variable. Within this general framework, we propose a sequence of confounder adjustment methods that range from parametric adjustments based on the Matérn coherence function to more robust semiparametric methods that use smoothing splines. These ideas are applied to areal and geostatistical data for both simulated and real datasets.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.