2020
DOI: 10.1093/jamia/ocaa106
|View full text |Cite|
|
Sign up to set email alerts
|

The 2019 n2c2/UMass Lowell shared task on clinical concept normalization

Abstract: Objective The 2019 National Natural language processing (NLP) Clinical Challenges (n2c2)/Open Health NLP (OHNLP) shared task track 3, focused on medical concept normalization (MCN) in clinical records. This track aimed to assess the state of the art in identifying and matching salient medical concepts to a controlled vocabulary. In this paper, we describe the task, describe the data set used, compare the participating systems, present results, identify the strengths and limitations of the cur… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
13
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 31 publications
(17 citation statements)
references
References 58 publications
(65 reference statements)
0
13
0
Order By: Relevance
“…There are a few similar works to our vector space model, CNN-triplet (Mondal et al, 2019), BIOSYN (Sung et al, 2020), RoBERTa-Node2Vec (Pattisapu et al, 2020), and TTI (Henry et al, 2020). CNN-triplet is a two-step approach, requiring a generator to generate candidates for training the triplet network, and requiring various embedding resources as input to CNN-based encoder.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…There are a few similar works to our vector space model, CNN-triplet (Mondal et al, 2019), BIOSYN (Sung et al, 2020), RoBERTa-Node2Vec (Pattisapu et al, 2020), and TTI (Henry et al, 2020). CNN-triplet is a two-step approach, requiring a generator to generate candidates for training the triplet network, and requiring various embedding resources as input to CNN-based encoder.…”
Section: Related Workmentioning
confidence: 99%
“…Research on concept normalization has grown thanks to shared tasks such as disorder normalization in the 2013 ShARe/CLEF (Suominen et al, 2013), chemical and disease normalization in BioCreative V Chemical Disease Relation (CDR) Task , and medical concept normalization in 2019 n2c2 shared task (Henry et al, 2020), and to the availability of annotated data (Dogan et al, 2014;Luo et al, 2019). Existing approaches can be divided into three categories: rule-based approaches using string-matching or dictionary look-up (Leal et al, 2015;D'Souza and Ng, 2015;Lee et al, 2016), which rely heavily on handcrafted rules and domain knowledge; supervised multi-class classifiers (Limsopatham and Collier, 2016;Lee et al, 2017;Tutubalina et al, 2018;Niu et al, 2019;Li et al, 2019), which cannot generalize to concept types not present in their training data; and two-step frameworks based on a nontrained candidate generator and a supervised candidate ranker (Leaman et al, 2013;Li et al, 2017;Liu and Xu, 2017;Nguyen et al, 2018;Murty et al, 2018;Mondal et al, 2019;Ji et al, 2020;Xu et al, 2020), which require complex pipelines and fail if the candidate generator does not find the gold truth concept.…”
Section: Introductionmentioning
confidence: 99%
“…Recently, a new corpus, called MCN [ 3 ], was created exclusively for the clinical term normalization task, which also includes clinical terms of other semantic types. This corpus was provided as the data set for 2019 n2c2 Track 3 [ 17 ], a shared task for clinical term normalization. In this paper, we describe our system that we had submitted for this shared task.…”
Section: Introductionmentioning
confidence: 99%
“…Our system, UWM, achieved an accuracy of 80.79% on the test data set of the MCN corpus, which ranked sixth among the 33 system submissions and was behind by only 1.15% (absolute) to the second ranked system (81.94%) and was well above the mean (74.26%) and the median (77.33%) of all the participating systems [ 17 ]. The top system scored 85.26% and used a massive end-to-end deep learning architecture.…”
Section: Introductionmentioning
confidence: 99%
“…Recently, a new corpus, called MCN [3], was created exclusively for the clinical term normalization task, which also includes clinical terms of other semantic types. This corpus was provided as the data set for 2019 n2c2 Track 3 [17], a shared task for clinical term normalization. In this paper, we describe our system that we had submitted for this shared task.…”
Section: Introductionmentioning
confidence: 99%