This article discusses and extends statistical models to jointly analyse the spatial variation of rates of several diseases with common risk factors. We start with a review of methods for separate analyses of diseases, then move to ecological regression approaches, where the rates from one of the diseases enter as surrogate covariates for exposure. Finally, we propose a general framework for jointly modelling the variation of two or more diseases, some of which share latent spatial fields, but with possibly different risk gradients. In our application, we consider mortality data on oral, oesophagus, larynx and lung cancers for males in Germany, which all share smoking as a common risk factor. Furthermore, the first three cancers are also known to be related to excessive alcohol consumption. An empirical comparison of the different models based on a formal model criterion as well as on the posterior precision of the relative risk estimates strongly suggests that the joint modelling approach is a useful and valuable extension over individual analyses.
Ecological studies aim to analyse the variation of disease risk in relation to exposure variables that are measured at an area unit level. In practice it is rarely possible to use the exposure variables themselves, either because the corresponding data are not available or because the causes of the disease are not fully understood. It is therefore quite common to use crude proxies of the real exposure to the disease in question. These proxies are rarely able to explain the disease variation and hence additional area level random effects are introduced to account for the residual variation. In this paper we investigate the possibility to model the effect of ecological covariates non-parametrically, with and without additional random effects for the residual spatial variation. We illustrate the issues arising through analyses of simulated and real data on larynx cancer mortality in Germany, during the years of 1986 to 1990, where we use the corresponding lung cancer rates as a proxy for smoking consumption.
The occurrence of influenza epidemics during winters, in the northern hemisphere countries, is known to be associated with observed excess mortality for all causes. A large variety of methods have been developed in order to estimate, from weekly or monthly mortality time series, the number of influenza-associated deaths in each season. The present work focus on the group of methods characterised by fitting statistical models to interrupted mortality time series. The study objective is to find a common ground between these methods in order to describe and compare them. They are unified in a single class, being categorised according to three main parameters: the model used to fit the interrupted time series and obtain a baseline, the a priori chosen type of periods used to estimate the influenza epidemic periods and the procedure used to fit the model to the time series (iterative or non-iterative). This generalisation led quite naturally to the construction of a set of user friendly R-routines, package flubase, implementing all these models. These routines were applied to data on about 20 years of weekly Portuguese number of deaths by pneumonia and influenza showing that, in this case, the parameter that had the highest impact on influenza-associated deaths estimates was the a priori chosen type of period used.
Logistic Regression Models have been widely used in many areas of research, namely in health sciences, to study risk factors associated to diseases. Many population based surveys, such as Demographic and Health Survey (DHS), are constructed assuming complex sampling, i.e., probabilistic, stratified and multistage sampling, with unequal weights in the observations; this complex design must be taken into account in order to have reliable results. However, this very relevant issue usually is not well analyzed in the literature. The aim of the study is to specify the logistic regression model with complex sample design, and to demonstrate how to estimate it using the R software survey package. More specifically, we used Mozambique Demographic Health and Survey data 2011 (MDHS 2011) to illustrate how to correct for the effect of sample design in the particular case of estimating the risk factors associated to the probability of using mosquito bed nets. Our results show that in the presence of complex sampling, appropriate methods must be used both in descriptive and inferential statistics.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.