Abstract:Trajectory classification has become frequent in clinical research to understand the heterogeneity of individual trajectories. The standard classification model for trajectories assumes no between‐individual variance within groups. However, this assumption is often not appropriate, which may overestimate the error variance of the model, leading to a biased classification. Hence, two extensions of the standard classification model were developed through a mixed model. The first one considers an equal between‐in… Show more
“…Moreover, it also helps to pinpoint geographical units that could be interesting for future data collection efforts. Secondly, we made use of a newly developed clustering method [21, 22] that is able to assess the similarities between age-specific patterns of cervical cancer incidence, which have not been considered by previous studies.…”
Section: Discussionmentioning
confidence: 99%
“…The statistical method employed in the Clustering step to cluster registry-specific cervical cancer incidence data was a Poisson-regression-based CEM clustering algorithm,[21, 22] described in detail in Appendix S1 . Briefly, clusters of age-specific cervical cancer incidence were obtained by likelihood-based optimization under Poisson regression model.…”
Background
Context-specific cervical cancer epidemiological data are essential to derive local impact projections of cervical cancer preventive measures. However, these are not always available, in particular in low- and middle-income countries (LMICs), where impact projections are essential to plan cervical cancer control programs.
Methods and Findings
We developed a framework, hereafter named Footprinting, to approximate the sexual behavior, human papillomavirus (HPV) prevalence, and/or cervical cancer incidence data needed for impact projections. The framework was applied to a case study in India, the country with the highest expected cervical cancer burden but still limited access to cervical cancer prevention. With our Footprinting framework, we 1) identified clusters of Indian states with similar cervical cancer incidence patterns, 2) classified states without incidence data to the identified clusters based on similarity in sexual behavior data, 3) approximated missing cervical cancer incidence and HPV prevalence data based on available data within each cluster. Two main patterns of cervical cancer incidence, characterized by high and low incidence, were identified for 6 and 8 Indian states, respectively. States in the low-incidence cluster were characterized by less sexual activity with non-regular partners in men and earlier sexual debut in women. Based on these patterns, all 11 Indian states with missing cervical cancer incidence data were classified to the low-incidence cluster. Finally, missing data on cervical cancer incidence and HPV prevalence were approximated based on the mean of the available data within each cluster.
Conclusions
With the Footprinting framework, we enabled approximation of missing cervical cancer epidemiological data and derivation of context-specific impact projection of cervical cancer prevention measures, assisting public health decisions on cervical cancer prevention in India and other LMICs.
“…Moreover, it also helps to pinpoint geographical units that could be interesting for future data collection efforts. Secondly, we made use of a newly developed clustering method [21, 22] that is able to assess the similarities between age-specific patterns of cervical cancer incidence, which have not been considered by previous studies.…”
Section: Discussionmentioning
confidence: 99%
“…The statistical method employed in the Clustering step to cluster registry-specific cervical cancer incidence data was a Poisson-regression-based CEM clustering algorithm,[21, 22] described in detail in Appendix S1 . Briefly, clusters of age-specific cervical cancer incidence were obtained by likelihood-based optimization under Poisson regression model.…”
Background
Context-specific cervical cancer epidemiological data are essential to derive local impact projections of cervical cancer preventive measures. However, these are not always available, in particular in low- and middle-income countries (LMICs), where impact projections are essential to plan cervical cancer control programs.
Methods and Findings
We developed a framework, hereafter named Footprinting, to approximate the sexual behavior, human papillomavirus (HPV) prevalence, and/or cervical cancer incidence data needed for impact projections. The framework was applied to a case study in India, the country with the highest expected cervical cancer burden but still limited access to cervical cancer prevention. With our Footprinting framework, we 1) identified clusters of Indian states with similar cervical cancer incidence patterns, 2) classified states without incidence data to the identified clusters based on similarity in sexual behavior data, 3) approximated missing cervical cancer incidence and HPV prevalence data based on available data within each cluster. Two main patterns of cervical cancer incidence, characterized by high and low incidence, were identified for 6 and 8 Indian states, respectively. States in the low-incidence cluster were characterized by less sexual activity with non-regular partners in men and earlier sexual debut in women. Based on these patterns, all 11 Indian states with missing cervical cancer incidence data were classified to the low-incidence cluster. Finally, missing data on cervical cancer incidence and HPV prevalence were approximated based on the mean of the available data within each cluster.
Conclusions
With the Footprinting framework, we enabled approximation of missing cervical cancer epidemiological data and derivation of context-specific impact projection of cervical cancer prevention measures, assisting public health decisions on cervical cancer prevention in India and other LMICs.
“…This method is classically used for models without random effects, but it was evaluated and validated for mixed effects models 5 . In the M step, the estimates of and can be obtained using classical algorithms for mixed effects models, such as the ones included in the lme function from the nlme R package 10 (R version 3.6.2).…”
Section: The Mixed Effects Classification Modelmentioning
confidence: 99%
“…The classification model applied to the dataset was defined as the model used for data simulation. The CEM algorithm was initialized from the true partition because the performances of the CEM algorithm were already evaluated in a previous paper 5 . Except for the sensitivity analyses on the number of SEM iterations, all simulations were performed with a number of SEM iterations equal to 100.…”
Section: Simulationsmentioning
confidence: 99%
“…This article focuses on the classification model. In order to take into account the between‐individual variability within groups, it is possible to extend the classification model by incorporating random effects around the typical trajectories 5 …”
Classifying patient biomarker trajectories into groups has become frequent in clinical research. Mixed effects classification models can be used to model the heterogeneity of longitudinal data. The estimated parameters of typical trajectories and the partition can be provided by the classification version of the expectation maximization algorithm, named CEM. However, the variance of the parameter estimates obtained underestimates the true variance because classification uncertainties are not taken into account. This article takes into account these uncertainties by using the stochastic EM algorithm (SEM), a stochastic version of the CEM algorithm, after convergence of the CEM algorithm. The simulations showed correct coverage probabilities of the 95% confidence intervals (close to 95% except for scenarios with high bias in typical trajectories). The method was applied on a trial, called low‐cyclo, that compared the effects of low vs standard cyclosporine A doses on creatinine levels after cardiac transplantation. It identified groups of patients for whom low‐dose cyclosporine may be relevant, but with high uncertainty on the dose‐effect estimate.
Local cervical cancer epidemiological data essential to project the context-specific impact of cervical cancer preventive measures are often missing. We developed a framework, hereafter named Footprinting, to approximate missing data on sexual behaviour, human papillomavirus (HPV) prevalence, or cervical cancer incidence, and applied it to an Indian case study. With our framework, we (1) identified clusters of Indian states with similar cervical cancer incidence patterns, (2) classified states without incidence data to the identified clusters based on similarity in sexual behaviour, (3) approximated missing cervical cancer incidence and HPV prevalence data based on available data within each cluster. Two main patterns of cervical cancer incidence, characterized by high and low incidence, were identified. Based on the patterns in the sexual behaviour data, all Indian states with missing data on cervical cancer incidence were classified to the low-incidence cluster. Finally, missing data on cervical cancer incidence and HPV prevalence were approximated based on the mean of the available data within each cluster. With the Footprinting framework, we approximated missing cervical cancer epidemiological data and made context-specific impact projections for cervical cancer preventive measures, to assist public health decisions on cervical cancer prevention in India and other countries.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.