Background and Aims Chronic Kidney Disease (CKD) is a common and debilitating condition that affects over 850 million people worldwide. The disease is associated with high mortality rates that can reach up to 10-15% per year, multiple complications, among which cardiovascular ones stand out. These complications can contribute to the progression of CKD, and this in turn to the appearance of complications, feeding each other. Despite the availability of clinical guidelines and predictive models, accurately predicting disease progression and identifying risk factors for progression in CKD patients remains a challenge. The limitations of current methodologies, including simplifying complex relationships and relying on linear assumptions, have hindered progress in this area. The advancement of Artificial Intelligence and Machine Learning has provided a new opportunity to address these challenges. The goal of this study was to evaluate the performance of gradient boosting algorithms in predicting the progression of renal disease in a large dataset of 1327 patients with a follow up of 10 years. Method Design: Retrospective analysis of a historical cohort from the Register of Renal Patients of Catalonia (RMRC) and the Data analytics program for health research and innovation (PADRIS) from Health Quality and Assessment Agency of Catalonia (AQuAS). Inclusion Criteria: > 18 y.o. CKD stages from 2 to Renal Replacement Therapy (RRT) and adequate data after pre-processing the sample. N = 1.327 patients with 27.572 records. Follow up of 10 years (January 2010 - December 2020). Variables: Age, gender, BMI, Diagnoses (ICD-10) = 95, Transplant waiting list status; RRT status; Laboratory variables = 77; f) Pharmacological treatment = 100. Method By using Light Gradient-Boosting Machine (LGBM) testing CKD progression prediction horizon in quarterly windows for multiple periods. Methodology: 1. Pre-processing of the sample and data. 2. Training and testing for variables exploration. 3. Dataset structuring in quarterly windows. 4. Samples randomization and data separation for a 5-fold cross-validation (20% test - 80% training). 5. Training and tuning of LGBM model for different prediction horizons. Results Age: 62 ± 13 years; Gender: 34% female, 66% male. Best prediction horizon was for 8 quarters (2 years), with a ROC curve of 0.967 and accuracy of 0.860. The 10 variables with major relevance in the model in order were estimated Glomerular Filtration Rate, Age, Microalbuminuria, BMI, HDL, Glucose, Urea, Platelets, Triglycerides and Sodium. Conclusion 1. The prediction of CKD progression can benefit from the use of Machine Learning with results that outperform methods based on classical statistics. 2. It can allow the individualization of the prognosis and thus be able to carry out early interventions to improve the prognosis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.