Automatic Diagnosis Coding of Radiology Reports: A Comparison of
            Deep Learning and Conventional Classification Methods

Karimi, Sarvnaz; Dai, Xiang; Hassanzadeh, Hamed; Nguyen, Anthony

doi:10.18653/v1/w17-2342

Cited by 52 publications

(55 citation statements)

References 11 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…The colon and brain cancer THYME corpus was used in several general domain conference and workshop articles (37,38,40,(67)(68)(69), whereas a radiology report dataset from a 2007 challenge (available from ref. 70) was used in another (71), and SEER-provided (although unshared thus not available for distribution) corpus was used in yet another (72). Other work using ad hoc resources has been used for methods development but this is a less sustainable model due to the rarity of expertise in both cancer and NLP (73)(74)(75).…”

Section: Shareable Resources For Nlp In Oncologymentioning

confidence: 99%

Use of Natural Language Processing to Extract Clinical Cancer Phenotypes from Electronic Medical Records

et al. 2019

View full text Add to dashboard Cite

Current models for correlating electronic medical records with-omics data largely ignore clinical text, which is an important source of phenotype information for patients with cancer. This data convergence has the potential to reveal new insights about cancer initiation, progression, metastasis, and response to treatment. Insights from this real-world data will catalyze clinical care, research, and regulatory activities. Natural language processing (NLP) methods are needed to extract these rich cancer phenotypes from clinical text. Here, we review the advances of NLP and information extraction methods relevant to oncology based on publications from PubMed as well as NLP and machine learning conference proceedings in the last 3 years. Given the interdisciplinary nature of the fields of oncology and information extraction, this analysis serves as a critical trail marker on the path to higher fidelity oncology phenotypes from real-world data.

show abstract

Section: Shareable Resources For Nlp In Oncologymentioning

confidence: 99%

Use of Natural Language Processing to Extract Clinical Cancer Phenotypes from Electronic Medical Records

et al. 2019

View full text Add to dashboard Cite

show abstract

“…A good scope review into radiology report-processing efforts is also presented in [4]. A more recent work involving the use of artificial neural networks and word embeddings for automated diagnosis coding of radiology reports is reported in [6]. A study used an emergency department's earlier medical records to predict and reduce its overcrowding [7].…”

Section: Introductionmentioning

confidence: 99%

“…With respect to automated text classification, in this work, we compared the approaches from the two main paradigms: (1) symbolic text classification, in which texts are represented with sparse vectors of TF-IDF weights, used as input features for traditional machine learning algorithms, such as Logistic Regression (LR) or Support Vector Machine (SVM); and (2) a more recent semantic text classification paradigm, in which dense semantic representations of words-word embeddings-are introduced as input to a neural architecture. Different deep learning architectures have been tried in a number of medical text classification tasks [25][26][27], including automated classification of radiology reports [6,28,29]. While recurrent [29,30] and attention-based neural networks [27,31] may present a viable solution, convolutional neural networks (CNN) seem to generally offer an edge in classification performance as well as faster training times [6,29].…”

Section: Introductionmentioning

confidence: 99%

“…Different deep learning architectures have been tried in a number of medical text classification tasks [25][26][27], including automated classification of radiology reports [6,28,29]. While recurrent [29,30] and attention-based neural networks [27,31] may present a viable solution, convolutional neural networks (CNN) seem to generally offer an edge in classification performance as well as faster training times [6,29]. Furthermore, due to their efficiency and being less data-hungry than, e.g., recurrent networks, CNNs have profiled themselves as a go-to text classification architecture in general-purpose natural language processing [32][33][34].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Automatic Annotation of Narrative Radiology Reports

et al. 2020

View full text Add to dashboard Cite

Narrative texts in electronic health records can be efficiently utilized for building decision support systems in the clinic, only if they are correctly interpreted automatically in accordance with a specified standard. This paper tackles the problem of developing an automated method of labeling free-form radiology reports, as a precursor for building query-capable report databases in hospitals. The analyzed dataset consists of 1295 radiology reports concerning the condition of a knee, retrospectively gathered at the Clinical Hospital Centre Rijeka, Croatia. Reports were manually labeled with one or more labels from a set of 10 most commonly occurring clinical conditions. After primary preprocessing of the texts, two sets of text classification methods were compared: (1) traditional classification models—Naive Bayes (NB), Logistic Regression (LR), Support Vector Machine (SVM), and Random Forests (RF)—coupled with Bag-of-Words (BoW) features (i.e., symbolic text representation) and (2) Convolutional Neural Network (CNN) coupled with dense word vectors (i.e., word embeddings as a semantic text representation) as input features. We resorted to nested 10-fold cross-validation to evaluate the performance of competing methods using accuracy, precision, recall, and F 1 score. The CNN with semantic word representations as input yielded the overall best performance, having a micro-averaged F 1 score of 86 . 7 % . The CNN classifier yielded particularly encouraging results for the most represented conditions: degenerative disease ( 95 . 9 % ), arthrosis ( 93 . 3 % ), and injury ( 89 . 2 % ). As a data-hungry deep learning model, the CNN, however, performed notably worse than the competing models on underrepresented classes with fewer training instances such as multicausal disease or metabolic disease. LR, RF, and SVM performed comparably well, with the obtained micro-averaged F 1 scores of 84 . 6 % , 82 . 2 % , and 82 . 1 % , respectively.

show abstract

“…However, there is still a lack of systematic study on how to select appropriate data to pretrain word vectors or LMs. We observe a range of heuristic strategies in the literature: (1) collecting a large amount of generic data, e.g., web crawl (Pennington et al, 2014;Mikolov et al, 2018); (2) selecting data from a similar field (the subject matter of the content being discussed), e.g., biology (Chiu et al, 2016;Karimi et al, 2017); and, (3) selecting data from a similar tenor (the participants in the discourse, their relationships to each other, and their purposes), e.g., Twitter, or online forums (Li et al, 2017;Chronopoulou et al, 2019). In all these settings, the decision is based on heuristics and varies according to the individual's experience.…”

Section: Introductionmentioning

confidence: 99%

Using Similarity Measures to Select Pretraining Data for

Dai

Karimi

Hachey

et al. 2019

Proceedings of the 2019 Conference of the North

Self Cite

View full text Add to dashboard Cite

Word vectors and Language Models (LMs) pretrained on a large amount of unlabelled data can dramatically improve various Natural Language Processing (NLP) tasks. However, the measure and impact of similarity between pretraining data and target task data are left to intuition. We propose three cost-effective measures to quantify different aspects of similarity between source pretraining and target task data. We demonstrate that these measures are good predictors of the usefulness of pretrained models for Named Entity Recognition (NER) over 30 data pairs. Results also suggest that pretrained LMs are more effective and more predictable than pretrained word vectors, but pretrained word vectors are better when pretraining data is dissimilar.

show abstract

Automatic Diagnosis Coding of Radiology Reports: A Comparison of Deep Learning and Conventional Classification Methods

Cited by 52 publications

References 11 publications

Use of Natural Language Processing to Extract Clinical Cancer Phenotypes from Electronic Medical Records

Use of Natural Language Processing to Extract Clinical Cancer Phenotypes from Electronic Medical Records

Automatic Annotation of Narrative Radiology Reports

Using Similarity Measures to Select Pretraining Data for

Contact Info

Product

Resources

About