Background
We constructed a recency‐frequency (RF) model for predicting the loss to follow‐up (LTFU) in HIV/AIDS patients in China.
Methods
Data on HIV/AIDS outpatients in the research unit from 1 August 2009 to 30 September 2020 and from 1 October to 31 December 2020 were exported as the observation and prediction datasets, respectively. The classic recency‐frequency‐monetary (RFM) model was expanded into RFm, RF, RFL and RFmL models. In the observation dataset, the best predictive model was obtained using k‐means clustering and C5.0 verification. Then, two rounds of k‐means modelling were performed on the best model: data with R ≤ 6 months were retained, randomly divided into a training set (70%) and a testing set (30%) and used to perform the second round of modelling to subdivide the churn and non‐churn patients. Next, an ANN algorithm was used to predict LTFU, and the confusion matrix with prediction datasets was constructed.
Results
The observation and prediction datasets included 16 949 and 10 748 samples, respectively. The RF model with three clusters and a quality of 0.82 was the best predictive model. From the observation set, 13 799 samples were retained, and the model accuracy was 100% on the training and testing sets. These 13 799 samples were subdivided into 1563 samples of churn patients and 12 216 samples of non‐churn patients. The accuracy of ANN prediction was 99.89%. The accuracy and precision of the confusion matrix were 85.41% and 99.76%, respectively.
Conclusion
The RF model is effective in predicting the LTFU of HIV/AIDS patients in China and preventing its occurrence.