PURPOSE Patients with myelodysplastic syndromes (MDS) have a survival that can range from months to decades. Prognostic systems that incorporate advanced analytics of clinical, pathologic, and molecular data have the potential to more accurately and dynamically predict survival in patients receiving various therapies. METHODS A total of 1,471 MDS patients with comprehensively annotated clinical and molecular data were included in a training cohort and analyzed using machine learning techniques. A random survival algorithm was used to build a prognostic model, which was then validated in external cohorts. The accuracy of the proposed model, compared with other established models, was assessed using a concordance (c)index. RESULTS The median age for the training cohort was 71 years. Commonly mutated genes included SF3B1, TET2, and ASXL1. The algorithm identified chromosomal karyotype, platelet, hemoglobin levels, bone marrow blast percentage, age, other clinical variables, seven discrete gene mutations, and mutation number as having prognostic impact on overall and leukemia-free survivals. The model was validated in an independent external cohort of 465 patients, a cohort of patients with MDS treated in a prospective clinical trial, a cohort of patients with paired samples at different time points during the disease course, and a cohort of patients who underwent hematopoietic stem-cell transplantation. CONCLUSION A personalized prediction model on the basis of clinical and genomic data outperformed established prognostic models in MDS. The new model was dynamic, predicting survival and leukemia transformation probabilities at different time points that are unique for a given patient, and can upstage and downstage patients into more appropriate risk categories.
The volume and complexity of scientific and clinical data in oncology have grown markedly over recent years, including but not limited to the realms of electronic health data, radiographic and histologic data, and genomics. This growth holds promise for a deeper understanding of malignancy and, accordingly, more personalized and effective oncologic care. Such goals require, however, the development of new methods to fully make use of the wealth of available data. Improvements in computer processing power and algorithm development have positioned machine learning, a branch of artificial intelligence, to play a prominent role in oncology research and practice. This review provides an overview of the basics of machine learning and highlights current progress and challenges in applying this technology to cancer diagnosis, prognosis, and treatment recommendations, including a discussion of current takeaways for clinicians.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.