Summary The analysis of case–control studies with several disease subtypes is increasingly common, e.g. in cancer epidemiology. For matched designs, a natural strategy is based on a stratified conditional logistic regression model. Then, to account for the potential homogeneity among disease subtypes, we adapt the ideas of data shared lasso, which has been recently proposed for the estimation of stratified regression models. For unmatched designs, we compare two standard methods based on $L_1$-norm penalized multinomial logistic regression. We describe formal connections between these two approaches, from which practical guidance can be derived. We show that one of these approaches, which is based on a symmetric formulation of the multinomial logistic regression model, actually reduces to a data shared lasso version of the other. Consequently, the relative performance of the two approaches critically depends on the level of homogeneity that exists among disease subtypes: more precisely, when homogeneity is moderate to high, the non-symmetric formulation with controls as the reference is not recommended. Empirical results obtained from synthetic data are presented, which confirm the benefit of properly accounting for potential homogeneity under both matched and unmatched designs, in terms of estimation and prediction accuracy, variable selection and identification of heterogeneities. We also present preliminary results from the analysis of a case–control study nested within the EPIC (European Prospective Investigation into Cancer and nutrition) cohort, where the objective is to identify metabolites associated with the occurrence of subtypes of breast cancer.
Graphical models are used in many applications such as medical diagnostics and computer security. Increasingly often, the estimation of such models has to be performed on several predefined strata of the whole population. For instance, in epidemiology and clinical research, strata are often defined according to age, gender, treatment, or disease type. In this article, we propose new approaches dedicated to the estimation of binary graphical models on such strata. These approaches are implemented by combining well-known methods that have been developed in the context of a single binary graphical model, with penalties encouraging structured sparsity, which have recently been shown to be appropriate when dealing with stratified data. Empirical comparisons on synthetic data highlight that our approaches generally outperform its competitors. We present an application of the approach to study associations among the injuries suffered by victims of road accidents according to road user type.
Background Vehicle accidents are still a heavy social burden despite improvements due the latest technologies and policies. To pursue the trend of decrease, having a more detailed view and understanding of the injury patterns would contribute to inform both the rescue team to optimize victim’s management and policymakers in order for them to tackle at best this issue. Methods Two complementary analyses of injury associations were performed, one using a biomechanical classification and the other an anatomic one, computed on data stratified by car accident type (lateral or frontal). Our objective is to understand whether these two categories of crash lead to similar or heterogeneous injury association patterns, and analyze these findings from an impact mechanics point of view. Indeed, having an improved understanding of the injury mechanisms would facilitate their diagnosis and prevention. Results While each type of accident possesses its own injury profile, most injury associations are found for both types. Injuries such as clavicle and rib fractures were identified as involved in a high number of associations. Several associations between fractures and blood vessel injuries were found. Conclusions The results suggests three main conclusions: (i) Injury associations are rather independent from crash characteristics, (ii) Clavicle and rib fractures are typical of poly-traumatized victims, (iii) Certain fractures can be used to early detect victims at higher risk of hemorrhage. Overall, this study provide paramedics and doctors with data to orientate them toward a faster and more appropriate decision. Moreover, this exploratory work revealed the potential that injury association analyses have to inform policy-making and issue recommendations to decrease road accident mortality and morbidity.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.