2023
DOI: 10.3390/info14070376
|View full text |Cite
|
Sign up to set email alerts
|

A Comparison of Machine Learning Techniques for the Detection of Type-2 Diabetes Mellitus: Experiences from Bangladesh

Abstract: Diabetes is a chronic disease caused by a persistently high blood sugar level, causing other chronic diseases, including cardiovascular, kidney, eye, and nerve damage. Prompt detection plays a vital role in reducing the risk and severity associated with diabetes, and identifying key risk factors can help individuals become more mindful of their lifestyles. In this study, we conducted a questionnaire-based survey utilizing standard diabetes risk variables to examine the prevalence of diabetes in Bangladesh. To … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
3

Relationship

2
6

Authors

Journals

citations
Cited by 14 publications
(9 citation statements)
references
References 71 publications
0
3
0
Order By: Relevance
“…Conversely, when the p -value falls below 0.05, it indicates a probable correlation between the category attributes and the dependent variable. The equation for χ 2 is given below: where the observed frequencies are denoted as , the predicted frequencies are denoted as , and the sample size is denoted as n [ 39 ].…”
Section: Methodsmentioning
confidence: 99%
“…Conversely, when the p -value falls below 0.05, it indicates a probable correlation between the category attributes and the dependent variable. The equation for χ 2 is given below: where the observed frequencies are denoted as , the predicted frequencies are denoted as , and the sample size is denoted as n [ 39 ].…”
Section: Methodsmentioning
confidence: 99%
“…The evaluation techniques used in this study are based on measures obtained from [62], namely Accuracy (AY), Precision (PN), Recall (RL), F-Measure (FE), Kappa (KA), Log-Loss (LS) and class-specific AUC ROC curves, and Confusion Matrix. These metrics serve as significant benchmarks for assessing the results of the experiment.…”
Section: Performance Measurementioning
confidence: 99%
“…The most significant probability-based order unit of measurement is log-loss. The log-loss metric quantifies the uncertainty of a probabilistic approach by evaluating its accuracy in predicting true labels [62]. A low log-loss value suggests an accurate prediction.…”
Section: Performance Measurementioning
confidence: 99%
“…In order to improve the survival rate of heart failure patients, the extra tree classifier (ETC) was proposed; it uses SMOTE to balance the data [25]. Also, the authors used SMOTE to classify diabetes and reliable stress levels [26,27]. Fitriyani et al proposed using extreme gradient boosting with SMOTE-ENN to solve the cardiovascular prediction problem [28].…”
Section: Introductionmentioning
confidence: 99%
“…Classification on imbalanced datasets can result in biased outcomes, as most standard classification algorithms favor the majority class, leading to poor prediction accuracy for the minority class. To balance the data distribution, most prior studies employed the SMOTE method [24][25][26][27][28], which has some disadvantages. The quality of the samples generated by SMOTE depends on the parameter k, which is difficult to determine due to the variety of datasets.…”
Section: Introductionmentioning
confidence: 99%