A Predictive Model for Medical Events Based on Contextual Embedding of Temporal Sequences

Farhan, Wael; Wang, Zhimu; Huang, Yingxiang; Wang, Shuang; Wang, Fei; Jiang, Xiaoqian

doi:10.2196/medinform.5977

Cited by 56 publications

(60 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…For this paper, we choose the skipgram model for its simplicity and efficiency. The model requires two parameters, size and window, defining the dimensionality of the final vector representation and maximum distance for contextual consideration, respectively [20,23]. There is one limitation to contextual embedding techniques such as Word2Vec and GloVe [24].…”

Section: Contextual Embeddingmentioning

confidence: 99%

“…To predict the next likely diagnosis of a new patient for structured data experiments, we use patient-diagnosis projection similarity (PDPS) method [20]. To calculate PDPS, we first create a patient vector.…”

Section: Patient Diagnosis Projection Similarity (Pdps)mentioning

confidence: 99%

“…kidney failure). Models that utilize such representation have shown higher prediction performance than previous models that do not [20,21].…”

Section: Introductionmentioning

confidence: 99%

“…Recently, there has been considerable attention in the application of neural networks to represent medical concepts as multi-dimensional and continuous vectors [20,21]. A process called contextual embedding, commonly used in natural language processing, maps each word from a corpus of text to a hyper-dimensional space where similar words in terms of meaning and/or distributed usage would be located nearby (e.g., short cosine distance).…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Privacy-Preserving Predictive Modeling: Harmonization of Contextual Embeddings From Different Sources (Preprint)

Huang¹,

Lee²,

Wang³

et al. 2017

Preprint

Self Cite

View full text Add to dashboard Cite

Background: Data sharing has been a big challenge in biomedical informatics due to privacy concerns. Contextual embedding models have demonstrated very strong representative capability to describe medical concepts (and their context), and they have shown promise as an alternative way to support deep learning applications without the need to disclose original data. However, contextual embedding models acquired from individual hospitals cannot be directly combined because their embedding spaces are different and naive pooling renders combined embeddings useless.Objective: We present a novel approach to address these issues to promote sharing representation without sharing data. We can build a global model from representations learned from local private data without sacrificing privacy and synchronize information from multiple sources. Methods:We propose a methodology that harmonizes different local contextual embeddings into a global model. We use Word2Vec to generate contextual embeddings from each source and Procrustes to fuse different vector models into one common space by using a list of corresponding pairs as anchor points. With harmonized embeddings, we performed prediction analysis. Results:We used sequential medical events extracted from the Medical Information Mart for Intensive Care III database to evaluate the proposed methodology in predicting the next likely diagnosis of a new patient using either structured data or unstructured data. Under different experimental scenarios, we confirmed that the global model built from harmonized local models achieves more accurate prediction than local models and global model built from naive pooling.Conclusions: Such aggregation of local models using our unique harmonization can serve as the proxy for a global model, combining information from a wide range of institutions and information sources. It allows information unique to a certain hospital to become available to other sites, increasing the fluidity of information flow in health care.

show abstract

Section: Contextual Embeddingmentioning

confidence: 99%

Section: Patient Diagnosis Projection Similarity (Pdps)mentioning

confidence: 99%

“…kidney failure). Models that utilize such representation have shown higher prediction performance than previous models that do not [20,21].…”

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Privacy-Preserving Predictive Modeling: Harmonization of Contextual Embeddings From Different Sources (Preprint)

Huang¹,

Lee²,

Wang³

et al. 2017

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…• Time-aware convolution: Time stamps are already demonstrated useful in predicting future events [14]. For example, Choi et al [8] integrated time information into their RNN based models to predict the occurrence and timing of near-term subsequent events.…”

Section: Introductionmentioning

confidence: 99%

Health-ATM: A Deep Architecture for Multifaceted Patient Health Record Representation and Risk Prediction

Ma¹,

Xiao²,

Wang³

2018

Proceedings of the 2018 SIAM International Conference on Data Mining

Self Cite

View full text Add to dashboard Cite

Leveraging massive electronic health records (EHR) brings tremendous promises to advance clinical and precision medicine informatics research. However, it is very challenging to directly work with multifaceted patient information encoded in their EHR data. Deriving effective representations of patient EHRs is a crucial step to bridge raw EHR information and the endpoint analytical tasks, such as risk prediction or disease subtyping. In this paper, we propose Health-ATM, a novel and integrated deep architecture to uncover patients' comprehensive health information from their noisy, longitudinal, heterogeneous and irregular EHR data. Health-ATM extracts comprehensive multifaceted patient information patterns with attentive and time-aware modulars (ATM) and a hybrid network structure composed of both Recurrent Neural Network (RNN) and Convolutional Neural Network (CNN). The learned features are finally fed into a prediction layer to conduct the risk prediction task. We evaluated the Health-ATM on both artificial and real world EHR corpus and demonstrated its promising utility and efficacy on representation learning and disease onset predictions.

show abstract

A Model-Free Comorbidities-Based Events Prediction in ICU Unit

Malygina

Drokin²

2018

Communications in Computer and Information Science

View full text Add to dashboard Cite

A Predictive Model for Medical Events Based on Contextual Embedding of Temporal Sequences

Cited by 56 publications

References 21 publications

Privacy-Preserving Predictive Modeling: Harmonization of Contextual Embeddings From Different Sources (Preprint)

Privacy-Preserving Predictive Modeling: Harmonization of Contextual Embeddings From Different Sources (Preprint)

Health-ATM: A Deep Architecture for Multifaceted Patient Health Record Representation and Risk Prediction

A Model-Free Comorbidities-Based Events Prediction in ICU Unit

Contact Info

Product

Resources

About