2023
DOI: 10.3389/fphar.2023.1180962
|View full text |Cite
|
Sign up to set email alerts
|

Approach to machine learning for extraction of real-world data variables from electronic health records

Blythe Adamson,
Michael Waskom,
Auriane Blarre
et al.

Abstract: Background: As artificial intelligence (AI) continues to advance with breakthroughs in natural language processing (NLP) and machine learning (ML), such as the development of models like OpenAI’s ChatGPT, new opportunities are emerging for efficient curation of electronic health records (EHR) into real-world data (RWD) for evidence generation in oncology. Our objective is to describe the research and development of industry methods to promote transparency and explainability.Methods: We applied NLP with ML tech… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(3 citation statements)
references
References 34 publications
0
1
0
Order By: Relevance
“… 22 Patient smoking status was extracted by natural language processing of electronic health record documents. 23 In this cohort, overall survival (OS) with routine clinical treatment was calculated from start of treatment in the metastatic setting to death from any cause, and patients without a record of mortality were right censored at the date of their last clinic visit or structured activity. Because patients could not enter the database until a CGP report was delivered, OS risk intervals were left truncated to the date of report to account for immortal time.…”
Section: Methodsmentioning
confidence: 99%
“… 22 Patient smoking status was extracted by natural language processing of electronic health record documents. 23 In this cohort, overall survival (OS) with routine clinical treatment was calculated from start of treatment in the metastatic setting to death from any cause, and patients without a record of mortality were right censored at the date of their last clinic visit or structured activity. Because patients could not enter the database until a CGP report was delivered, OS risk intervals were left truncated to the date of report to account for immortal time.…”
Section: Methodsmentioning
confidence: 99%
“…Традиционные методы, такие как ручной просмотр карт пациентов, требуют много времени и ресурсов, что ограничивает число пациентов, доступных для исследования. ИИ может эффективно обрабатывать большие объёмы данных, позволяя исследователям получить доступ к более широкой популяции пациентов и более эффективно генерировать доказательства, полученные на основе данных РКП [6].…”
Section: адекватность релевантность и качество данных реальной клинич...unclassified
“…Плюсы использования ИИ для оценки и извлечения данных ЭМК [6]: • Эффективность и масштабируемость: методы ИИ позволяют быстро и точно обрабатывать большие объёмы неструктурированных данных, обеспечивая эффективное извлечение клинически значимой информации. Такая масштабируемость позволяет исследователям анализировать данные большего числа пациентов, что ведёт к получению более надёжных и репрезентативных доказательств.…”
Section: адекватность релевантность и качество данных реальной клинич...unclassified