Yeju Park scite author profile

Yeju Park

2Publications

0Citation Statements Received

82Citation Statements Given

How they've been cited

How they cite others

Affiliations

Kangbuk Samsung Hospital, Seoul National University Hospital, Sungkyunkwan University

Publications

Order By: Most citations

Identifying Alcohol-Related Information From Unstructured Bilingual Clinical Notes With Multilingual Transformers

Kim

Park

et al. 2023

IEEE Access

View full text Add to dashboard Cite

As a key modifiable risk factor, alcohol consumption is clinically crucial information that allows medical professionals to further understand their patients' medical conditions and suggest appropriate lifestyle modifying interventions. However, identifying alcohol-related information from unstructured freetext clinical notes is often challenging. Not only are the formats of the notes inconsistent, but they also include a massive amount of non-alcohol-related information. Furthermore, for medical institutions outside of English-speaking countries, these clinical notes contain both a mixture of English and local languages, inducing additional difficulty in the extraction. Thanks to the increasing availability of electronic medical record (EMR), several previous works explored the idea of using natural language processing (NLP) to train machine learning models that automatically identify alcohol-related information from unstructured clinical notes. However, all these previous works are limited to English clinical notes, thereby able to leverage various large-scale external ontologies during the text preprocessing. Furthermore, they rely on simple NLP techniques such as the bag-of-words models that suffer from high dimensionality and out-ofvocabulary issues. Addressing these issues, we adopt fine-tuning multilingual transformers. By leveraging their linguistically rich contextual information learned during their pre-training, we are able to extract alcohol-related information from unstructured clinical notes without preprocessing the clinical notes on any external ontologies. Furthermore, our work is the first to explore the use of transformers in bilingual clinical notes to extract alcohol-related information. Even with minimal text preprocessing, we achieve extraction accuracy of 84.70% in terms of macro F-1 score. INDEX TERMS clinical informatics, alcohol information extraction, natural language processing, information extraction from clinical notes, multilingual transformers I. INTRODUCTIONAs medical institutions worldwide are widely adopting electronic medical record (EMR), vast amounts of healthcare data are produced and stored electronically [1-5]. As a significant component of EMR, clinical notes, which record patients' conditions in free text, provide essential information such as patients' medical history, social history, or lifestyle patterns. Despite being a vital data source, its practical use in medical decision support systems is hampered by challenges in extracting key information from its unstructured text format [6][7][8][9]. For medical institutions in non-English speaking countries, these challenges are compounded by the use of both English and local languages in their clinical notes. Besides standardizing the notes to resolve any inconsistent formats or structures, they need to handle the multilingual aspect of their notes simultaneously. Due to this additional

show abstract

Implementation of Interoperable Healthcare Standards for Community Healthcare

Bae

Park

Lee

et al. 2023

View full text Add to dashboard Cite

Building an integrated data model that includes not only clinical data but also personal health records has become increasingly important. We aimed to build a big data healthcare platform by developing a common data model that can be utilized in the healthcare field. To this end, we acquired health data from various communities to establish community care digital healthcare service models. Further, to improve personal health data interoperability, we ensured conformance to international standards, namely, the Systemized Nomenclature of Medicine Clinical Terms (SNOMED-CT) and transmission standards, namely, Health Level 7 Fast Healthcare Interoperability Resource (HL7 FHIR). Furthermore, FHIR resource profiling was designed to transmit and receive data, following the HL7 FHIR R4 guidelines.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yeju Park

Identifying Alcohol-Related Information From Unstructured Bilingual Clinical Notes With Multilingual Transformers

Implementation of Interoperable Healthcare Standards for Community Healthcare

Contact Info

Product

Resources

About