Background
Machine learning (ML) has shown exceptional promise in various domains of medical research. However, its application in predicting subsequent fragility fractures is still largely unknown. In this study, we aim to evaluate the predictive power of different ML algorithms in this area and identify key features associated with the risk of subsequent fragility fractures in osteoporotic patients.
Methods
We retrospectively analyzed data from patients presented with fragility fractures at our Fracture Liaison Service, categorizing them into index fragility fracture (n = 905) and subsequent fragility fracture groups (n = 195). We independently trained ML models using 27 features for both male and female cohorts. The algorithms tested include Random Forest, XGBoost, CatBoost, Logistic Regression, LightGBM, AdaBoost, Multi-Layer Perceptron, and Support Vector Machine. Model performance was evaluated through 10-fold cross-validation.
Results
The CatBoost model outperformed other models, achieving 87% accuracy and an AUC of 0.951 for females, and 93.4% accuracy with an AUC of 0.990 for males. The most significant predictors for females included age, serum C-reactive protein (CRP), 25(OH)D, creatinine, blood urea nitrogen (BUN), parathyroid hormone (PTH), femoral neck Z-score, menopause age, number of pregnancies, phosphorus, calcium, and body mass index (BMI); for males, the predictors were serum CRP, femoral neck T-score, PTH, hip T-score, BMI, BUN, creatinine, alkaline phosphatase, and spinal Z-score.
Conclusion
ML models, especially CatBoost, offer a valuable approach for predicting subsequent fragility fractures in osteoporotic patients. These models hold the potential to enhance clinical decision-making by supporting the development of personalized preventative strategies.