BackgroundAs more and more researchers are turning to big data for new opportunities of biomedical discoveries, machine learning models, as the backbone of big data analysis, are mentioned more often in biomedical journals. However, owing to the inherent complexity of machine learning methods, they are prone to misuse. Because of the flexibility in specifying machine learning models, the results are often insufficiently reported in research articles, hindering reliable assessment of model validity and consistent interpretation of model outputs.ObjectiveTo attain a set of guidelines on the use of machine learning predictive models within clinical settings to make sure the models are correctly applied and sufficiently reported so that true discoveries can be distinguished from random coincidence.MethodsA multidisciplinary panel of machine learning experts, clinicians, and traditional statisticians were interviewed, using an iterative process in accordance with the Delphi method.ResultsThe process produced a set of guidelines that consists of (1) a list of reporting items to be included in a research article and (2) a set of practical sequential steps for developing predictive models.ConclusionsA set of guidelines was generated to enable correct application of machine learning models and consistent reporting of model specifications and results in biomedical research. We believe that such guidelines will accelerate the adoption of big data analysis, particularly with machine learning methods, in the biomedical research community.
Obstructive sleep apnea syndrome (OSAS) is associated with cardiovascular morbidity as well as excessive daytime sleepiness and poor quality of life. In this study, we apply a machine learning technique [support vector machines (SVMs)] for automated recognition of OSAS types from their nocturnal ECG recordings. A total of 125 sets of nocturnal ECG recordings acquired from normal subjects (OSAS - ) and subjects with OSAS (OSAS +), each of approximately 8 h in duration, were analyzed. Features extracted from successive wavelet coefficient levels after wavelet decomposition of signals due to heart rate variability (HRV) from RR intervals and ECG-derived respiration (EDR) from R waves of QRS amplitudes were used as inputs to the SVMs to recognize OSAS +/- subjects. Using leave-one-out technique, the maximum accuracy of classification for 83 training sets was found to be 100% for SVMs using a subset of selected combination of HRV and EDR features. Independent test results on 42 subjects showed that it correctly recognized 24 out of 26 OSAS + subjects and 15 out of 16 OSAS - subjects (accuracy = 92.85%; Cohen's kappa value of 0.85). For estimating the relative severity of OSAS, the posterior probabilities of SVM outputs were calculated and compared with respective apnea/hypopnea index. These results suggest superior performance of SVMs in OSAS recognition supported by wavelet-based features of ECG. The results demonstrate considerable potential in applying SVMs in an ECG-based screening device that can aid a sleep specialist in the initial assessment of patients with suspected OSAS.
Background: Poincaré plot is one of the important techniques used for visually representing the heart rate variability. It is valuable due to its ability to display nonlinear aspects of the data sequence. However, the problem lies in capturing temporal information of the plot quantitatively. The standard descriptors used in quantifying the Poincaré plot (SD1, SD2) measure the gross variability of the time series data. Determination of advanced methods for capturing temporal properties pose a significant challenge. In this paper, we propose a novel descriptor "Complex Correlation Measure (CCM)" to quantify the temporal aspect of the Poincaré plot. In contrast to SD1 and SD2, the CCM incorporates point-to-point variation of the signal.
Current research applying variability measures of gait parameters has demonstrated promise for helping to solve one of the "holy grails" of geriatric research by defining markers that can be used to prospectively identify persons at risk of falling . The minimum toe clearance (MTC) event occurs during the leg swing phase of the gait cycle and is a task highly sensitive to the spatial and balance control properties of the locomotor system. The aim of this study is to build upon the current state of research by investigating the magnitude and dynamic structure from the MTC time series fluctuations due to aging and locomotor disorder. Thirty healthy young (HY), 27 healthy elderly (HE), and 10 falls risk (FR) elderly individuals (who presented a prior history of trip-related falls) participated in treadmill walking for at least 10 min at their preferred speed. Continuous MTC data were collected and the first 512 data points were analyzed. The following variability indices were quantified: 1) MTC mean and standard deviation (SD), 2) PoincarE plot indices of MTC variability (SD1, SD2, SD1/SD2), 3) a wavelet based multiscale exponent beta to describe the dynamic structure of MTC fluctuations, and 4) detrended fluctuation analysis exponent alpha to investigate the presence of long-range correlations in MTC time series data. Results showed that stride-to-stride MTC time series has a nonlinear structure in all three groups when compared against randomly shuffled surrogate MTC data. Test on aging effects showed the MTC central tendency was significantly lower (p < 0.01) and the magnitude of the MTC variability significantly higher (p < 0.01). This trend changed when comparing FR subjects against age-matched HE as both the central tendency (p < 0.01) and magnitude of the variability (p < 0.01) increased significantly in FR. Although the magnitude of MTC variability increased with age, the nonlinear indices represented by alpha, beta, and SD1/SD2 demonstrated that the nonlinear structure of MTC does not change significantly due to aging (p > 0.05). There were, however, significant differences between HY and FR for beta (between scale 1 and 2; p < 0.01) and alpha (p < 0.05). Out of all the variability measures applied, beta(Wv2-4), SD1/SD2, SD2 of critical MTC parameter were found to be potential markers to be able to reliably identify FR from HE subjects. Further research is required to understand the mechanisms underlying the cause of MTC variability.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.