2018
DOI: 10.1007/s10916-018-0940-7
|View full text |Cite
|
Sign up to set email alerts
|

Accurate Diabetes Risk Stratification Using Machine Learning: Role of Missing Value and Outliers

Abstract: Diabetes mellitus is a group of metabolic diseases in which blood sugar levels are too high. About 8.8% of the world was diabetic in 2017. It is projected that this will reach nearly 10% by 2045. The major challenge is that when machine learning-based classifiers are applied to such data sets for risk stratification, leads to lower performance. Thus, our objective is to develop an optimized and robust machine learning (ML) system under the assumption that missing values or outliers if replaced by a median conf… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
83
0
3

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 182 publications
(86 citation statements)
references
References 29 publications
(32 reference statements)
0
83
0
3
Order By: Relevance
“…V 6 } with 6 measurements per 24-h sampling window for each feature. Missing values were imputed by using the median value of the variable over the entire cohort at the sampling time point [23]. each as a true positive, false positive, true negative or false negative.…”
Section: Defining Optimal Prediction Timementioning
confidence: 99%
See 1 more Smart Citation
“…V 6 } with 6 measurements per 24-h sampling window for each feature. Missing values were imputed by using the median value of the variable over the entire cohort at the sampling time point [23]. each as a true positive, false positive, true negative or false negative.…”
Section: Defining Optimal Prediction Timementioning
confidence: 99%
“…The result was a time series V = {V1, V2, … V6} with 6 measurements per 24-h sampling window for each feature. Missing values were imputed by using the median value of the variable over the entire cohort at the sampling time point [23]. Performance of the models were compared, and an optimal prediction time of 6 h was chosen based on clinical and operational considerations such as anticipated impact on nursing and clinician workload.…”
Section: Defining Optimal Prediction Timementioning
confidence: 99%
“…However, for feature subset selection, ECAs lack in reducing residual features from final selection and they are costly too. In [28] development of a robust optimized machine learning -ML system is presented. The aim is to improve risk stratification accuracy by replacing outliers with median configuration, which is based on assumption.…”
Section: Multi-objective Optimization Techniquesmentioning
confidence: 99%
“…Type 2 diabetes is a metabolic disease that causes glucose to accumulate in the blood. When a person eats food it gets broken down into glucose which enters into cells for the functioning of the body [6]. A hormone called insulin is secreted by the pancreas, which is a chemical messenger that allows glucose to enter into cells.…”
Section: Fig 1projection Status Of T2d Worldwide: 2010-2030mentioning
confidence: 99%
“…The majority of studies shows that diabetes mellitus has proven to affect the growth in the health department over the last few years [6]. It has been observed that in 2013, normal Indian suffering from diabetes mellitus was estimated to have spent Rs 4,493 (US$95) a yearly treatment cost.…”
Section: Fig 1projection Status Of T2d Worldwide: 2010-2030mentioning
confidence: 99%