Pretrained transformer framework on pediatric claims data for population specific tasks

Zeng, Xianlong; Lin, Simon; Liu, Chang

doi:10.1038/s41598-022-07545-1

Cited by 8 publications

(7 citation statements)

References 33 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Several FEMRs have been trained on insurance claims, which are typically larger in size and more diverse than EMR data but contain less granular information 63 . Examples of claims datasets include Truven Health MarketScan (170 million patients) 64 and Partners For Kids (1.8 million pediatric patients) 65 . In terms of data modalities, most FEMRs are unimodal as they only consider structured codes (e.g., LOINC, SNOMED, etc.).…”

Section: State Of Published Clinical Fmsmentioning

confidence: 99%

The shaky foundations of large language models and foundation models for electronic health records

Wornow

Thapa

et al. 2023

npj Digit. Med.

View full text Add to dashboard Cite

The success of foundation models such as ChatGPT and AlphaFold has spurred significant interest in building similar models for electronic medical records (EMRs) to improve patient care and hospital operations. However, recent hype has obscured critical gaps in our understanding of these models’ capabilities. In this narrative review, we examine 84 foundation models trained on non-imaging EMR data (i.e., clinical text and/or structured data) and create a taxonomy delineating their architectures, training data, and potential use cases. We find that most models are trained on small, narrowly-scoped clinical datasets (e.g., MIMIC-III) or broad, public biomedical corpora (e.g., PubMed) and are evaluated on tasks that do not provide meaningful insights on their usefulness to health systems. Considering these findings, we propose an improved evaluation framework for measuring the benefits of clinical foundation models that is more closely grounded to metrics that matter in healthcare.

show abstract

Section: State Of Published Clinical Fmsmentioning

confidence: 99%

The shaky foundations of large language models and foundation models for electronic health records

Wornow

Thapa

et al. 2023

npj Digit. Med.

View full text Add to dashboard Cite

show abstract

“…Transformers provide a computationally efficient method for learning temporal relationships between data points. Transformers have been applied to solve a variety of predictive health care tasks, including opioid use, 35 coronavirus disease 2019, 36 suicide risk, 37 and asthma exacerbation prediction, 38 as well as an increasing number of generative tasks, such as clinical text generation. 39…”

Section: Deep Generative Modelsmentioning

confidence: 99%

Evaluating the Impact of Health Care Data Completeness for Deep Generative Models

2023

View full text Add to dashboard Cite

Background: Deep generative models (DGMs) present a promising avenue for generating realistic, synthetic data to augment existing healthcare datasets. However, exactly how the completeness of the original dataset affects the quality of the generated synthetic data is unclear. Objectives: In this paper, we investigate the effect of data completeness on samples generated by the most common DGM paradigms. Methods: We create both cross-sectional and panel datasets with varying missingness and subset rates and train generative adversarial networks (GANs), variational autoencoders (VAEs) and autoregressive models (Transformers) on these datasets. We then compare the distributions of generated data with original training data to measure similarity. Results: We find that increased incompleteness is directly correlated with increased dissimilarity between original and generated samples produced through DGMs. Conclusions: Care must be taken when using DGMs to generate synthetic data as data completeness issues can affect the quality of generated data in both panel and cross-sectional datasets.

show abstract

“…CLMBR [Steinberg et al, 2021] proposed an autoregressive, "next day" code prediction task to train RNN models. Many authors have presented work using masked language modeling objectives to predict masked or "corrupted" tokens in an input stream [Li et al, 2019, Rasmy et al, 2021b, Pang et al, 2021, Zeng et al, 2022.…”

Section: Deep Learning For Structured Ehr Datamentioning

confidence: 99%

Self-Supervised Time-to-Event Modeling with Structured Medical Records

Steinberg¹,

Xu²,

Fries³

et al. 2023

Preprint

View full text Add to dashboard Cite

Time-to-event models (also known as survival models) are used in medicine and other fields for estimating the probability distribution of the time until a particular event occurs. While providing many advantages over traditional classification models, such as naturally handling censoring, time-to-event models require more parameters and are challenging to learn in settings with limited labeled training data. High censoring rates, common in events with long time horizons, further limit available training data and exacerbate the risk of overfitting. Existing methods, such as proportional hazard or accelerated failure time-based approaches, employ distributional assumptions to reduce parameter size, but they are vulnerable to model misspecification. In this work, we address these challenges with MOTOR a self-supervised model that leverages temporal structure found in large-scale collections of timestamped, but largely unlabeled events, typical of electronic health record data. MOTOR defines a time-to-event pretraining task that naturally captures the probability distribution of event times, making it well-suited to applications in medicine. After pretraining on 8,192 tasks auto-generated from 2.7M patients (2.4B clinical events), we evaluate the performance of our pretrained model after fine-tuning to unseen time-toevent tasks. MOTOR-derived models improve upon current state-of-the-art C statistic performance by 6.6% and decrease training time (in wall time) by up to 8.2 times. We further improve sample efficiency, with adapted models matching current state-of-the-art performance using 95% less training data.

show abstract

Pretrained transformer framework on pediatric claims data for population specific tasks

Cited by 8 publications

References 33 publications

The shaky foundations of large language models and foundation models for electronic health records

The shaky foundations of large language models and foundation models for electronic health records

Evaluating the Impact of Health Care Data Completeness for Deep Generative Models

Self-Supervised Time-to-Event Modeling with Structured Medical Records

Contact Info

Product

Resources

About