The Impact of Missing Data and Imputation Methods on the Analysis of 24-Hour Activity Patterns

Weed, Lara; Lok, Renske; Chawra, Dwijen; Zeitzer, Jamie M.

doi:10.3390/clockssleep4040039

Cited by 13 publications

(9 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In this study, there were some missing values in the raw data we used, and most of the missing values were filled in by manually tracing the raw materials. For a small amount of other missing values such as age and other quantitative data, we use mean interpolation to fill in, as the mean can represent the central trend of the data and help maintain its distribution.For qualitative data such as crime types, we use the median to fill in, which is a better choice because it can reduce the impact of extreme values while maintaining the order and level of the data 38 .…”

Section: Methodsmentioning

confidence: 99%

Research on a machine learning-based adaptive and efficient screening model for psychological symptoms of community correctional prisoners

Xu,

Pan,

Wang

et al. 2024

Sci Rep

View full text Add to dashboard Cite

Community correction institutions in China frequently employ the Symptom Checklist-90 (SCL-90) and the health survey brief (SF-12) as primary tools for psychological assessment of community correctional prisoners. However, in practical application, the SCL-90 Checklist faces issues such as complex item numbers, overall low cultural level of the subjects, and insufficient professional level of the administrators. The SF-12 health survey brief, as a preliminary screening tool, although only has 12 questions, to some extent simplifies the evaluation process and improves work efficiency, it is prone to missed screening. The research team collected 17-dimensional basic characteristic data and corresponding SCL-90 and SF-12 data from 25,480 samples of community correctional prisoners in Zhejiang Province, China. This study explored the application of multi-label multi-classification algorithms and oversampling techniques in building machine learning models to delve into the correlation between the psychological health risks of community correctional prisoners and their characteristic data. Inspired by computerized adaptive testing (CAT), we constructed an adaptive and efficient screening model for community correctional prisoners through experimental comparisons, based on the binary relevance algorithm with sample oversampling. This screening model personalize the assessment process by dynamically matching participants with the most relevant subset (s) of the nine dimensions of the SCL-90 based on their individual characteristics. Thus, adaptive dynamic simplification and personalized recommendation of the SCL-90 scale between question groups were achieved for the specific group of community correctional prisoners. As a screening tool for psychological symptoms of community correctional prisoners, this model significantly simplifies the number of questions compared to SCL-90, with a simplification rate of up to 65%. However, it achieves this simplification while maintaining excellent performance. The accuracy reached 0.66, with a sensitivity of 0.754, and an F1 score of 0.649. This innovation simplified the assessment process, reduced the assessment time, improved work efficiency, and enhanced the ability to judge the specificity of community correctional prisoners population. Compared to the SF-12, although the simplification rate and accuracy of the model are slightly lower than those of the SF-12, the sensitivity increased by 42.26%, and the F1 score improved by 15.28%. This means the model greatly reduces the possibility of missed screening, effectively preventing prisoners with abnormal psychological or mental states from losing control due to missed screening, and even committing suicide, self injury, or injuring others.

show abstract

Section: Methodsmentioning

confidence: 99%

Research on a machine learning-based adaptive and efficient screening model for psychological symptoms of community correctional prisoners

Xu,

Pan,

Wang

et al. 2024

Sci Rep

View full text Add to dashboard Cite

show abstract

“…High-frequency (100 Hz) accelerometer data were processed on Sherlock, a high-performance computing cluster provided by Stanford University, using the steps outlined in Weed et al 2022 [ 18 ]. In brief, data spanning 1 week of collection were down-sampled to 30 s epochs using the biobankAccelerometerAnalysis package in Python v3.6.1 [ 17 ].…”

Section: Methodsmentioning

confidence: 99%

Impaired 24-h activity patterns are associated with an increased risk of Alzheimer’s disease, Parkinson’s disease, and cognitive decline

Winer,

Lok,

Weed

et al. 2024

Alz Res Therapy

Self Cite

View full text Add to dashboard Cite

Background Sleep-wake regulating circuits are affected during prodromal stages in the pathological progression of both Alzheimer’s disease (AD) and Parkinson’s disease (PD), and this disturbance can be measured passively using wearable devices. Our objective was to determine whether accelerometer-based measures of 24-h activity are associated with subsequent development of AD, PD, and cognitive decline. Methods This study obtained UK Biobank data from 82,829 individuals with wrist-worn accelerometer data aged 40 to 79 years with a mean (± SD) follow-up of 6.8 (± 0.9) years. Outcomes were accelerometer-derived measures of 24-h activity (derived by cosinor, nonparametric, and functional principal component methods), incident AD and PD diagnosis (obtained through hospitalization or primary care records), and prospective longitudinal cognitive testing. Results One hundred eighty-seven individuals progressed to AD and 265 to PD. Interdaily stability (a measure of regularity, hazard ratio [HR] per SD increase 1.25, 95% confidence interval [CI] 1.05–1.48), diurnal amplitude (HR 0.79, CI 0.65–0.96), mesor (mean activity; HR 0.77, CI 0.59–0.998), and activity during most active 10 h (HR 0.75, CI 0.61–0.94), were associated with risk of AD. Diurnal amplitude (HR 0.28, CI 0.23–0.34), mesor (HR 0.13, CI 0.10–0.16), activity during least active 5 h (HR 0.24, CI 0.08–0.69), and activity during most active 10 h (HR 0.20, CI 0.16–0.25) were associated with risk of PD. Several measures were additionally predictive of longitudinal cognitive test performance. Conclusions In this community-based longitudinal study, accelerometer-derived metrics were associated with elevated risk of AD, PD, and accelerated cognitive decline. These findings suggest 24-h rhythm integrity, as measured by affordable, non-invasive wearable devices, may serve as a scalable early marker of neurodegenerative disease.

show abstract

“…High frequency (100 Hz) accelerometer data were processed on Sherlock, a high-performance computing cluster provided by Stanford University,using the steps outlined in Weed et al 2022 18 . In brief, data spanning 1 week of collection were down-sampled to 30 second epochs using the biobankAccelerometerAnalysis package in Python v3.6.1 17 .…”

Section: Methodsmentioning

confidence: 99%

“…Non-wear time was defined as stationary episodes lasting for at least 60 minutes in which all three axes had a standard deviation of less than 13.0 mg. If present, non-wear segments were automatically imputed using the median of similar time-of-day vector magnitude and intensity distribution data points with 30-second granularity on different days of the measurement 18 . Following these preprocessing steps, we derived the following six metrics.…”

Section: Methodsmentioning

confidence: 99%

See 1 more Smart Citation

Association of 24-hour activity patterns with risk of Alzheimer’s disease, Parkinson’s disease, and cognitive decline

Winer

Lok

Weed

et al. 2023

Preprint

Self Cite

View full text Add to dashboard Cite

Sleep-wake regulating circuits are affected during prodromal stages in the pathological progression of both Alzheimer's disease (AD) and Parkinson's disease (PD). Assessment of 24-hour rhythm impairment may serve as an early indicator of disease and cognitive decline. Our objective was to determine whether objective markers of 24-hour activity are associated with subsequent development of AD, PD, and cognitive decline. This longitudinal study obtained UK Biobank data from 82,829 individuals with valid accelerometer data (collected from June 2013 to January 2016) out of 103,671 eligible adults aged 40 to 79 years with a mean (+/-SD) follow-up of 6.8 (+/-0.9) years. AD and PD diagnoses were ascertained through September 2021, with data analysis conducted March to November 2022. The outcomes were accelerometer-derived measures of 24-hour activity (derived by cosinor, nonparametric, and functional principal component methods), incident AD and PD diagnosis (obtained through hospitalization or primary care records), longitudinal tests of cognitive function, and demographic characteristics. 82,829 participants consisted of 46,683 women (56%) and 36,146 men (44%) with a mean (+/-SD) age of 62.0 (+/-7.8) years at the time of actigraphy data collection. During the follow-up period, 191 individuals converted to AD (0.2%) and 266 to PD (0.3%). After adjusting for covariates, interdaily stability, a measure of regularity, (hazard ratio [HR] per SD increase 1.24, 95% confidence interval [CI] 1.04-1.47), diurnal amplitude (HR 0.77, CI 0.64-0.93), mesor (mean activity; HR 0.73, CI 0.56-0.95), and activity during most active 10 hours (HR 0.73, CI 0.59-0.91), were each associated with an increased risk of AD. Diurnal amplitude (HR 0.28, CI 0.23-0.34), mesor (HR 0.12, CI 0.10-0.16), activity during least active 5 hours (HR 0.24, CI 0.08-0.68), and activity during most active 10 hours (HR 0.20, CI 0.16-0.25), were associated with an increased risk of PD. Several measures were additionally predictive of longitudinal cognitive test performance. In this community-based longitudinal study, objective measures of 24-hour activity were associated with elevated risk of AD, PD, and accelerated cognitive decline, suggesting actigraphy-estimated 24-hour rhythm integrity may serve as a scalable early marker of neurodegenerative disease.

show abstract

The Impact of Missing Data and Imputation Methods on the Analysis of 24-Hour Activity Patterns

Cited by 13 publications

References 24 publications

Research on a machine learning-based adaptive and efficient screening model for psychological symptoms of community correctional prisoners

Research on a machine learning-based adaptive and efficient screening model for psychological symptoms of community correctional prisoners

Impaired 24-h activity patterns are associated with an increased risk of Alzheimer’s disease, Parkinson’s disease, and cognitive decline

Association of 24-hour activity patterns with risk of Alzheimer’s disease, Parkinson’s disease, and cognitive decline

Contact Info

Product

Resources

About