2017
DOI: 10.1007/978-3-319-65340-2_12
|View full text |Cite
|
Sign up to set email alerts
|

A Deep Learning Method for ICD-10 Coding of Free-Text Death Certificates

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 12 publications
(13 citation statements)
references
References 10 publications
0
13
0
Order By: Relevance
“…Five, we update the existing manual coding system based on the rule base of regexps to reduce workload and improve the work quality of coders. The technical requirements and computational cost are less than those of the other methods found in most studies [7,11], [32][33][34][35][36]. CNN [18,[34][35][36] is one of the state of the art proposals to solve the problem of automatic ICD coding.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Five, we update the existing manual coding system based on the rule base of regexps to reduce workload and improve the work quality of coders. The technical requirements and computational cost are less than those of the other methods found in most studies [7,11], [32][33][34][35][36]. CNN [18,[34][35][36] is one of the state of the art proposals to solve the problem of automatic ICD coding.…”
Section: Discussionmentioning
confidence: 99%
“…For example, several studies based on machine learning approaches, such as the support vector machine (SVM) method [5][6][7][8], were proposed to automatically assign ICD-10 codes. With the extensive application of deep learning methods in various fields, these methods have also been widely used in automated ICD coding [9][10][11][12]. These studies indicate that deep learning models can produce interpretable results and can code automatically in a reasonable way.…”
Section: Introductionmentioning
confidence: 99%
“…(1) The number of words in common between (a) the headline and the body of the news article, and (b) the headline and the first two sentences of the body; (2) Refutation features, based on the presence of refuting words, listed in a given dictionary, in the headline (e.g., words like deny, doubt, fraud or debunk); the headline and the first two sentences of the body; (6) The soft cosine similarity metric [6], computed between representations leveraging word occurrences for (a) the headline and the entire body, or (b) the headline and the first two sentences of the body; (7) The BLEU score [33] computed between (a) the headline and the set of sentences from the body, and (b) the headline and the first two sentences of the body; (8) ROUGE scores [27] computed between (a) the headline and the set of sentences from the body, and (b) the headline and the first two sentences of the body; (9) The CIDEr similarity score [47], computed between (a) the headline and the set of sentences of the article, and (b) the headline and the first two sentences of the body; (10) The cosine similarity metric, computed between TF-IDF vector representations for the words occurring in the headline, and in the body of the news article. (11) A vector representation of the headline, with 50 dimensions, resulting from a Singular Value Decomposition (SVD) of a matrix with TF-IDF representations for the texts; (12) A vector representation for the body of the article, with 50 dimensions, resulting from a Singular Value Decomposition (SVD) of a matrix with TF-IDF representations for the texts; (13) The cosine similarity metric, computed between the SVD vectors for the headline and the body; (14) A vector representation for the headline, with 300 dimensions, produced by averaging the word2vec embeddings for the words occurring in the headline; (15) A vector representation for the body, with 300 dimensions, produced by averaging the word2vec embeddings for all the words occurring in the body of the article; (16) The cosine similarity metric, computed between averaged word2vec embeddings for the headline and body; (17) Sentiment polarity scores for the headline and the body, computed with basis on a word polarity lexicon. The first 5 features from the previous enumeration were taken from the official FNC-1 baseline system, provided by the organizers.…”
Section: Combining the Matched Representations With External Featuresmentioning
confidence: 99%
“…• The proposed neural network architecture leverages a hierarchical approach for modeling the body of news articles, taking inspiration on previous studies addressing the classification of long documents [14,52]. In this approach, a Recurrent Neural Network (RNN) is used for modeling the sequence of sentences, which in turn are individually modeled by a separate RNN encoding sequences of words.…”
Section: Introductionmentioning
confidence: 99%
“…Duarte et al (2017) use hierarchical recurrent neural networks (RNNs) to annotate death reports with ICD–10 codes. Vani et al (2017) introduced grounded RNNs for EMR coding.…”
Section: Related Workmentioning
confidence: 99%