Trajectory clustering using mixed classification models

Klich, Amna; Écochard, René; Subtil, Fabien

doi:10.1002/sim.8975

Cited by 4 publications

(10 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Moreover, it also helps to pinpoint geographical units that could be interesting for future data collection efforts. Secondly, we made use of a newly developed clustering method [21, 22] that is able to assess the similarities between age-specific patterns of cervical cancer incidence, which have not been considered by previous studies.…”

Section: Discussionmentioning

confidence: 99%

“…The statistical method employed in the Clustering step to cluster registry-specific cervical cancer incidence data was a Poisson-regression-based CEM clustering algorithm,[21, 22] described in detail in Appendix S1 . Briefly, clusters of age-specific cervical cancer incidence were obtained by likelihood-based optimization under Poisson regression model.…”

Section: Methodsmentioning

confidence: 99%

See 1 more Smart Citation

“Footprinting” missing epidemiological data for cervical cancer: a case study in India

Man

Georges

Bonjour

et al. 2022

Preprint

View full text Add to dashboard Cite

Background Context-specific cervical cancer epidemiological data are essential to derive local impact projections of cervical cancer preventive measures. However, these are not always available, in particular in low- and middle-income countries (LMICs), where impact projections are essential to plan cervical cancer control programs. Methods and Findings We developed a framework, hereafter named Footprinting, to approximate the sexual behavior, human papillomavirus (HPV) prevalence, and/or cervical cancer incidence data needed for impact projections. The framework was applied to a case study in India, the country with the highest expected cervical cancer burden but still limited access to cervical cancer prevention. With our Footprinting framework, we 1) identified clusters of Indian states with similar cervical cancer incidence patterns, 2) classified states without incidence data to the identified clusters based on similarity in sexual behavior data, 3) approximated missing cervical cancer incidence and HPV prevalence data based on available data within each cluster. Two main patterns of cervical cancer incidence, characterized by high and low incidence, were identified for 6 and 8 Indian states, respectively. States in the low-incidence cluster were characterized by less sexual activity with non-regular partners in men and earlier sexual debut in women. Based on these patterns, all 11 Indian states with missing cervical cancer incidence data were classified to the low-incidence cluster. Finally, missing data on cervical cancer incidence and HPV prevalence were approximated based on the mean of the available data within each cluster. Conclusions With the Footprinting framework, we enabled approximation of missing cervical cancer epidemiological data and derivation of context-specific impact projection of cervical cancer prevention measures, assisting public health decisions on cervical cancer prevention in India and other LMICs.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Methodsmentioning

confidence: 99%

“Footprinting” missing epidemiological data for cervical cancer: a case study in India

Man

Georges

Bonjour

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…This method is classically used for models without random effects, but it was evaluated and validated for mixed effects models 5 . In the M step, the estimates of

{\boldsymbol{\beta}}_{\boldsymbol{k}},{\boldsymbol{D}}_{\boldsymbol{k}},

and

{\sigma}_k^2

can be obtained using classical algorithms for mixed effects models, such as the ones included in the lme function from the nlme R package 10 (R version 3.6.2).…”

Section: The Mixed Effects Classification Modelmentioning

confidence: 99%

“…The classification model applied to the dataset was defined as the model used for data simulation. The CEM algorithm was initialized from the true partition because the performances of the CEM algorithm were already evaluated in a previous paper 5 . Except for the sensitivity analyses on the number of SEM iterations, all simulations were performed with a number of SEM iterations equal to 100.…”

Section: Simulationsmentioning

confidence: 99%

See 1 more Smart Citation

Clustering of trajectories with mixed effects classification model: Inference taking into account classification uncertainties

Dugourd,

Abichou‐Klich,

Ecochard

et al. 2023

Statistics in Medicine

Self Cite

View full text Add to dashboard Cite

Classifying patient biomarker trajectories into groups has become frequent in clinical research. Mixed effects classification models can be used to model the heterogeneity of longitudinal data. The estimated parameters of typical trajectories and the partition can be provided by the classification version of the expectation maximization algorithm, named CEM. However, the variance of the parameter estimates obtained underestimates the true variance because classification uncertainties are not taken into account. This article takes into account these uncertainties by using the stochastic EM algorithm (SEM), a stochastic version of the CEM algorithm, after convergence of the CEM algorithm. The simulations showed correct coverage probabilities of the 95% confidence intervals (close to 95% except for scenarios with high bias in typical trajectories). The method was applied on a trial, called low‐cyclo, that compared the effects of low vs standard cyclosporine A doses on creatinine levels after cardiac transplantation. It identified groups of patients for whom low‐dose cyclosporine may be relevant, but with high uncertainty on the dose‐effect estimate.

show abstract

Approximating missing epidemiological data for cervical cancer through Footprinting: A case study in India

Man

Georges

Bonjour

et al. 2023

eLife

View full text Add to dashboard Cite

Local cervical cancer epidemiological data essential to project the context-specific impact of cervical cancer preventive measures are often missing. We developed a framework, hereafter named Footprinting, to approximate missing data on sexual behaviour, human papillomavirus (HPV) prevalence, or cervical cancer incidence, and applied it to an Indian case study. With our framework, we (1) identified clusters of Indian states with similar cervical cancer incidence patterns, (2) classified states without incidence data to the identified clusters based on similarity in sexual behaviour, (3) approximated missing cervical cancer incidence and HPV prevalence data based on available data within each cluster. Two main patterns of cervical cancer incidence, characterized by high and low incidence, were identified. Based on the patterns in the sexual behaviour data, all Indian states with missing data on cervical cancer incidence were classified to the low-incidence cluster. Finally, missing data on cervical cancer incidence and HPV prevalence were approximated based on the mean of the available data within each cluster. With the Footprinting framework, we approximated missing cervical cancer epidemiological data and made context-specific impact projections for cervical cancer preventive measures, to assist public health decisions on cervical cancer prevention in India and other countries.

show abstract

Trajectory clustering using mixed classification models

Cited by 4 publications

References 24 publications

“Footprinting” missing epidemiological data for cervical cancer: a case study in India

“Footprinting” missing epidemiological data for cervical cancer: a case study in India

Clustering of trajectories with mixed effects classification model: Inference taking into account classification uncertainties

Approximating missing epidemiological data for cervical cancer through Footprinting: A case study in India

Contact Info

Product

Resources

About