2020
DOI: 10.1038/s41598-020-77220-w
|View full text |Cite
|
Sign up to set email alerts
|

A comparison of machine learning methods for survival analysis of high-dimensional clinical data for dementia prediction

Abstract: Data collected from clinical trials and cohort studies, such as dementia studies, are often high-dimensional, censored, heterogeneous and contain missing information, presenting challenges to traditional statistical analysis. There is an urgent need for methods that can overcome these challenges to model this complex data. At present there is no cure for dementia and no treatment that can successfully change the course of the disease. Machine learning models that can predict the time until a patient develops d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

9
148
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 167 publications
(173 citation statements)
references
References 37 publications
9
148
0
Order By: Relevance
“…This leads to conclusions that can be completed with the results of this work, since this will give information of when that hospital admission will take place. However, machine learning-based approaches are not the best option when working with censored data, although they are specially helpful when handling high-dimensional clinical data [ 64 ]. Note that, in a context with censored data, it is not possible to apply directly machine learning classical models since they do not account for censored observations.…”
Section: Discussionmentioning
confidence: 99%
“…This leads to conclusions that can be completed with the results of this work, since this will give information of when that hospital admission will take place. However, machine learning-based approaches are not the best option when working with censored data, although they are specially helpful when handling high-dimensional clinical data [ 64 ]. Note that, in a context with censored data, it is not possible to apply directly machine learning classical models since they do not account for censored observations.…”
Section: Discussionmentioning
confidence: 99%
“…Because of the curse of dimensionality, the high-dimensional data from gene expression profiles present challenges for the use of traditional feature selection methods, including overfitting, weak generalization ability, and high variance [ 36 ]. The relationship between the samples and features of the cancer datasets is formulated by the following matrix: where x m is defined as the m link of the characteristic vector, and y m describes the column vector representing the sample categories.…”
Section: Methodsmentioning
confidence: 99%
“…Integrative Feature Selection Scheme (FRL) for Identifying Multiple Genomic Biomarkers. Because of the curse of dimensionality, the high-dimensional data from gene expression profiles present challenges for the use of traditional feature selection methods, including overfitting, weak generalization ability, and high variance [36]. The relationship between the samples and features of the cancer datasets is formulated by the following matrix:…”
mentioning
confidence: 99%
“…Without the presence of censoring, standard LR could be used. Traditionally, the Cox proportional hazard (CPH) model has been the most widely used model to analyse censored data, but the CPH model often works for small datasets and does not scale well to high dimensions and large volumes of clinical data [ 26 ].…”
Section: Modeling the Likelihood Of Clinical Outcomesmentioning
confidence: 99%