An Ensemble Approach for the Prediction of Diabetes Mellitus Using a Soft Voting Classifier with an Explainable AI

Kibria, Hafsa Binte; Nahiduzzaman, Md.; Goni, Md. Omaer Faruq; Ahsan, Mominul; Haider, Julfikar

doi:10.3390/s22197268

Cited by 35 publications

(13 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Remarkably, Glucose, BMI, and Age were recognized as the most salient features [68]. Similarly, another study employed similar methods including RF and XGBoost, and employed LIM and SHAP as explainers [69].…”

Section: Analysis Of the Xai Evaluationmentioning

confidence: 95%

Explainable AI Evaluation: A Top-Down Approach for Selecting Optimal Explanations for Black Box Models

Mirzaei,

Mao,

Al-Nima

et al. 2023

Information

View full text Add to dashboard Cite

Explainable Artificial Intelligence (XAI) evaluation has grown significantly due to its extensive adoption, and the catastrophic consequence of misinterpreting sensitive data, especially in the medical field. However, the multidisciplinary nature of XAI research resulted in diverse scholars possessing significant challenges in designing proper evaluation methods. This paper proposes a novel framework of a three-layered top-down approach on how to arrive at an optimal explainer, accenting the persistent need for consensus in XAI evaluation. This paper also investigates a critical comparative evaluation of explanations in both model agnostic and specific explainers including LIME, SHAP, Anchors, and TabNet, aiming to enhance the adaptability of XAI in a tabular domain. The results demonstrate that TabNet achieved the highest classification recall followed by TabPFN, and XGBoost. Additionally, this paper develops an optimal approach by introducing a novel measure of relative performance loss with emphasis on faithfulness and fidelity of global explanations by quantifying the extent to which a model’s capabilities diminish when eliminating topmost features. This addresses a conspicuous gap in the lack of consensus among researchers regarding how global feature importance impacts classification loss, thereby undermining the trust and correctness of such applications. Finally, a practical use case on medical tabular data is provided to concretely illustrate the findings.

show abstract

Section: Analysis Of the Xai Evaluationmentioning

confidence: 95%

Explainable AI Evaluation: A Top-Down Approach for Selecting Optimal Explanations for Black Box Models

Mirzaei,

Mao,

Al-Nima

et al. 2023

Information

View full text Add to dashboard Cite

show abstract

“…For model development, the study cohort was randomly divided to create a 70%:30% training set to test set ratio. Because the number of ESRD cases was much smaller than the number of non-ESRD cases, we performed the synthetic minority over-sampling technique (SMOTE)-Tomek algorithms to balance the number of samples taken for imbalanced data [18,19]. Six machine learning models, including logistic regression, extra trees [20], random forest [21], gradient boosting decision tree (GBDT) [22], extreme gradient boosting models (XGBoost) [23], and light gradient boosting machine (LGBM) [24], are performed.…”

Section: Data Cleaning and Machine Learning Model Developmentmentioning

confidence: 99%

Prediction of the risk of developing end-stage renal diseases in newly diagnosed type 2 diabetes mellitus using artificial intelligence algorithms

Tsai

Lee

et al. 2023

BioData Mining

View full text Add to dashboard Cite

Objectives Type 2 diabetes mellitus (T2DM) imposes a great burden on healthcare systems, and these patients experience higher long-term risks for developing end-stage renal disease (ESRD). Managing diabetic nephropathy becomes more challenging when kidney function starts declining. Therefore, developing predictive models for the risk of developing ESRD in newly diagnosed T2DM patients may be helpful in clinical settings. Methods We established machine learning models constructed from a subset of clinical features collected from 53,477 newly diagnosed T2DM patients from January 2008 to December 2018 and then selected the best model. The cohort was divided, with 70% and 30% of patients randomly assigned to the training and testing sets, respectively. Results The discriminative ability of our machine learning models, including logistic regression, extra tree classifier, random forest, gradient boosting decision tree (GBDT), extreme gradient boosting (XGBoost), and light gradient boosting machine were evaluated across the cohort. XGBoost yielded the highest area under the receiver operating characteristic curve (AUC) of 0.953, followed by extra tree and GBDT, with AUC values of 0.952 and 0.938 on the testing dataset. The SHapley Additive explanation summary plot in the XGBoost model illustrated that the top five important features included baseline serum creatinine, mean serum creatine within 1 year before the diagnosis of T2DM, high-sensitivity C-reactive protein, spot urine protein-to-creatinine ratio and female gender. Conclusions Because our machine learning prediction models were based on routinely collected clinical features, they can be used as risk assessment tools for developing ESRD. By identifying high-risk patients, intervention strategies may be provided at an early stage.

show abstract

“…As a result, Choudary focuses on Precision and Recall.. Hafsa Binte Kibria used a ratio of training data : test data is 7:3. In her research, it was found that when using the Random Forest method, the irrelevant feature was glucose, blood pressure, and pregnancy [21]. Vaishali, tried to find important features by using feature selection based on genetic algorithms.…”

Section: Research On Important Features In the Pima Indian Databasementioning

confidence: 99%

Analyze Important Features of PIMA Indian Database For Diabetes Prediction Using KNN

2023

View full text Add to dashboard Cite

Diabetes is a chronic, non-communicable disease, and a long-term health condition that affects how the body uses glucose, the type of sugar that gives energy. In Indonesia, diabetes ranks as the sixth highest cause of death, following conditions related to childbirth. In 2021, Indonesia has a total of 19.5 million diabetes patients, making it the fifth-highest in the world. Some machine learning research has used data from the PIDD (PIMA Indian Diabetes Dataset) to predict diabetes. In this research, in addition to prediction accuracy, data complexity is also important. This research analyzes important features in the PIMA Indian database using the KNN (k-nearest neighbor) method for classification. The results show that using KNN with k=22 value results in the highest accuracy of 83.12%. The analysis also found that the important features required by the KNN method to achieve high accuracy from the PIMA Indian database, in order of importance, are glucose, age, insulin, blood pressure, Body Mass Index, pregnancy, skin thickness, and diabetes pedigree function. However, when used in the KNN classification method, the diabetes pedigree function feature was found to be unnecessary, not relevant, and can be reduced.

show abstract

An Ensemble Approach for the Prediction of Diabetes Mellitus Using a Soft Voting Classifier with an Explainable AI

Cited by 35 publications

References 39 publications

Explainable AI Evaluation: A Top-Down Approach for Selecting Optimal Explanations for Black Box Models

Explainable AI Evaluation: A Top-Down Approach for Selecting Optimal Explanations for Black Box Models

Prediction of the risk of developing end-stage renal diseases in newly diagnosed type 2 diabetes mellitus using artificial intelligence algorithms

Analyze Important Features of PIMA Indian Database For Diabetes Prediction Using KNN

Contact Info

Product

Resources

About