Search citation statements
Paper Sections
Citation Types
Year Published
Publication Types
Relationship
Authors
Journals
BACKGROUND There has been a rapid growth in the application of structured Electronic Health Records (EHRs) to healthcare systems, where huge amounts of diagnosis codes presenting the temporal event of the patient are collected. In the era of artificial intelligence, many models, especially Deep Learning (DL), are applied for patient outcome prediction. This systematic review aimed to identify DL models developed for sequential diagnosis codes for patient outcome prediction. OBJECTIVE The main objective of this systematic review is to identify and summarise existing DL studies predicting patient outcome using sequences of diagnosis codes, as a key part of their predictors. Additionally, this study also investigates the challenge of generalisability and explainability of the predictive models. METHODS In this review, we identified all relevant studies by using the following four databases: PubMed, Embase, IEEE Xplore, and Web of Science. After that, we evaluated the included papers in various aspects: Deep learning techniques, characteristics of the dataset, prediction tasks, performance evaluation, generalizability, and explainability. We also assessed the risk of bias (PROBAST) and the concern of applicability. RESULTS In this review, 84 papers met the eligibility criteria and were selected, which showed the growing trend in this research area. Recurrent neural networks (RNN) (and their derivatives) (n = 47; 57.3%) and Transformers (n = 22; 26.8%) were the most popular architectures in DL-based models. Most studies present their input feature as sequence of visit embedding (n = 45; 53.6%). For the prediction tasks, the most common one is next visit diagnosis (n = 30; 23.4%), followed by heart failure (18; 14.1%), and mortality (n = 17; 13.3%). Only 7 studies evaluated their models in terms of generalisability. A positive correlation was observed between training sample size and model performance (AUROC) (p-value < 0.05). However, about 70% of included studies were found to have high risk of bias. CONCLUSIONS The application of deep learning in sequence of diagnosis has demonstrated remarkable promise in predicting patient outcomes. Using multiple types of features and integration of time intervals was found to improve the predictive performance. Addressing challenges related to generalisation and explainability will be instrumental in unlocking the full potential of DL for enhancing healthcare outcomes and patient care. CLINICALTRIAL This review was registered on PROSPERO (CRD42023434032).
BACKGROUND There has been a rapid growth in the application of structured Electronic Health Records (EHRs) to healthcare systems, where huge amounts of diagnosis codes presenting the temporal event of the patient are collected. In the era of artificial intelligence, many models, especially Deep Learning (DL), are applied for patient outcome prediction. This systematic review aimed to identify DL models developed for sequential diagnosis codes for patient outcome prediction. OBJECTIVE The main objective of this systematic review is to identify and summarise existing DL studies predicting patient outcome using sequences of diagnosis codes, as a key part of their predictors. Additionally, this study also investigates the challenge of generalisability and explainability of the predictive models. METHODS In this review, we identified all relevant studies by using the following four databases: PubMed, Embase, IEEE Xplore, and Web of Science. After that, we evaluated the included papers in various aspects: Deep learning techniques, characteristics of the dataset, prediction tasks, performance evaluation, generalizability, and explainability. We also assessed the risk of bias (PROBAST) and the concern of applicability. RESULTS In this review, 84 papers met the eligibility criteria and were selected, which showed the growing trend in this research area. Recurrent neural networks (RNN) (and their derivatives) (n = 47; 57.3%) and Transformers (n = 22; 26.8%) were the most popular architectures in DL-based models. Most studies present their input feature as sequence of visit embedding (n = 45; 53.6%). For the prediction tasks, the most common one is next visit diagnosis (n = 30; 23.4%), followed by heart failure (18; 14.1%), and mortality (n = 17; 13.3%). Only 7 studies evaluated their models in terms of generalisability. A positive correlation was observed between training sample size and model performance (AUROC) (p-value < 0.05). However, about 70% of included studies were found to have high risk of bias. CONCLUSIONS The application of deep learning in sequence of diagnosis has demonstrated remarkable promise in predicting patient outcomes. Using multiple types of features and integration of time intervals was found to improve the predictive performance. Addressing challenges related to generalisation and explainability will be instrumental in unlocking the full potential of DL for enhancing healthcare outcomes and patient care. CLINICALTRIAL This review was registered on PROSPERO (CRD42023434032).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.