Yinghan Ma scite author profile

Yinghan Ma

5Publications

10Citation Statements Received

77Citation Statements Given

How they've been cited

How they cite others

126

Affiliations

Nanjing Tech University, University of Florida

Publications

Order By: Most citations

Measurement of Semantic Textual Similarity in Clinical Texts: Comparison of Transformer-Based Models

Yang¹,

He²,

Zhang³

et al. 2020

JMIR Med Inform

View full text Add to dashboard Cite

Background Semantic textual similarity (STS) is one of the fundamental tasks in natural language processing (NLP). Many shared tasks and corpora for STS have been organized and curated in the general English domain; however, such resources are limited in the biomedical domain. In 2019, the National NLP Clinical Challenges (n2c2) challenge developed a comprehensive clinical STS dataset and organized a community effort to solicit state-of-the-art solutions for clinical STS. Objective This study presents our transformer-based clinical STS models developed during this challenge as well as new models we explored after the challenge. This project is part of the 2019 n2c2/Open Health NLP shared task on clinical STS. Methods In this study, we explored 3 transformer-based models for clinical STS: Bidirectional Encoder Representations from Transformers (BERT), XLNet, and Robustly optimized BERT approach (RoBERTa). We examined transformer models pretrained using both general English text and clinical text. We also explored using a general English STS dataset as a supplementary corpus in addition to the clinical training set developed in this challenge. Furthermore, we investigated various ensemble methods to combine different transformer models. Results Our best submission based on the XLNet model achieved the third-best performance (Pearson correlation of 0.8864) in this challenge. After the challenge, we further explored other transformer models and improved the performance to 0.9065 using a RoBERTa model, which outperformed the best-performing system developed in this challenge (Pearson correlation of 0.9010). Conclusions This study demonstrated the efficiency of utilizing transformer-based models to measure semantic similarity for clinical text. Our models can be applied to clinical applications such as clinical text deduplication and summarization.

show abstract

In situ real-time sequential potentiometric determinations of potassium concentrations from three cochlear regions in noise-exposed rats

Gerhardt

Rybak

et al. 1996

Eur Arch Otorhinolaryngol

View full text Add to dashboard Cite

Double-barrelled potassium selective microelectrodes (K-ISME) were used in situ for real-time sequential determinations of potassium concentrations (CK+) in endolymph, marginal cells and the spiral ligaments of rats exposed to moderate noise at 100 dB for 30 min (NE) and control (CTL) animals. CK+ in NE animals at these sites did not differ significantly when compared to CK+ in CTL animals. However, there was a slight decrease in CK+ in marginal cells in the noise-exposed animals.

show abstract

Effects of cooling rate and cryogenic temperature on the mechanical properties and deformation characteristics of an Al-Mg-Si-Fe-Cr alloy

Liu

Miao

et al. 2023

Journal of Alloys and Compounds

View full text Add to dashboard Cite

Identify diabetic retinopathy-related clinical concepts and their attributes using transformer-based natural language processing methods

Yang

Sweeting

et al. 2022

BMC Med Inform Decis Mak

View full text Add to dashboard Cite

Background Diabetic retinopathy (DR) is a leading cause of blindness in American adults. If detected, DR can be treated to prevent further damage causing blindness. There is an increasing interest in developing artificial intelligence (AI) technologies to help detect DR using electronic health records. The lesion-related information documented in fundus image reports is a valuable resource that could help diagnoses of DR in clinical decision support systems. However, most studies for AI-based DR diagnoses are mainly based on medical images; there is limited studies to explore the lesion-related information captured in the free text image reports. Methods In this study, we examined two state-of-the-art transformer-based natural language processing (NLP) models, including BERT and RoBERTa, compared them with a recurrent neural network implemented using Long short-term memory (LSTM) to extract DR-related concepts from clinical narratives. We identified four different categories of DR-related clinical concepts including lesions, eye parts, laterality, and severity, developed annotation guidelines, annotated a DR-corpus of 536 image reports, and developed transformer-based NLP models for clinical concept extraction and relation extraction. We also examined the relation extraction under two settings including ‘gold-standard’ setting—where gold-standard concepts were used–and end-to-end setting. Results For concept extraction, the BERT model pretrained with the MIMIC III dataset achieve the best performance (0.9503 and 0.9645 for strict/lenient evaluation). For relation extraction, BERT model pretrained using general English text achieved the best strict/lenient F1-score of 0.9316. The end-to-end system, BERT_general_e2e, achieved the best strict/lenient F1-score of 0.8578 and 0.8881, respectively. Another end-to-end system based on the RoBERTa architecture, RoBERTa_general_e2e, also achieved the same performance as BERT_general_e2e in strict scores. Conclusions This study demonstrated the efficiency of transformer-based NLP models for clinical concept extraction and relation extraction. Our results show that it’s necessary to pretrain transformer models using clinical text to optimize the performance for clinical concept extraction. Whereas, for relation extraction, transformers pretrained using general English text perform better.

show abstract

Identify Diabetic Retinopathy-related Clinical Concepts Using Transformer-based Natural Language Processing Methods

Yang

Sweeting

et al. 2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yinghan Ma

Measurement of Semantic Textual Similarity in Clinical Texts: Comparison of Transformer-Based Models

In situ real-time sequential potentiometric determinations of potassium concentrations from three cochlear regions in noise-exposed rats

Effects of cooling rate and cryogenic temperature on the mechanical properties and deformation characteristics of an Al-Mg-Si-Fe-Cr alloy

Identify diabetic retinopathy-related clinical concepts and their attributes using transformer-based natural language processing methods

Identify Diabetic Retinopathy-related Clinical Concepts Using Transformer-based Natural Language Processing Methods

Contact Info

Product

Resources

About