Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua 2022
DOI: 10.18653/v1/2022.naacl-industry.21
|View full text |Cite
|
Sign up to set email alerts
|

Medical Coding with Biomedical Transformer Ensembles and Zero/Few-shot Learning

Abstract: Medical coding (MC) is an essential prerequisite for reliable data retrieval and reporting. Given a free-text reported term (RT) such as "pain of right thigh to the knee", the task is to identify the matching lowest-level term (LLT) -in this case "unilateral leg pain"-from a very large and continuously growing repository of standardized medical terms. However, automating this task is challenging due to a large number of LLT codes (as of writing over 80 000), limited availability of training data for long tail/… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 14 publications
0
2
0
Order By: Relevance
“…The number of code representations stored in memory increases linearly as the number of candidate codes to assign increases. The significant increase in memory makes it hard to deploy in the real-world auto ICD classification setting (Ziletti et al 2022;Yan et al 2022).…”
Section: Code Frequencymentioning
confidence: 99%
“…The number of code representations stored in memory increases linearly as the number of candidate codes to assign increases. The significant increase in memory makes it hard to deploy in the real-world auto ICD classification setting (Ziletti et al 2022;Yan et al 2022).…”
Section: Code Frequencymentioning
confidence: 99%
“…Medical text classification is widely recognized as an urgent yet challenging problem due to its extremely imbalanced data distribution, large variety of rare labels (Johnson et al, 2016;Ziletti et al, 2022), and complicated label relationship (Tsai et al, 2021;Vu et al, 2021). Various downstream clinical tasks have been derived from this problem, including ICD coding (Mullenbach et al, 2018;Yuan et al, 2022;Yang et al, 2022) and automated diagnosis (Chen et al, 2020b), showcasing its potential values in modern clinical practice with machine learning approaches (Berner, 2007).…”
Section: Introductionmentioning
confidence: 99%