2017
DOI: 10.1016/j.jbi.2017.06.007
|View full text |Cite
|
Sign up to set email alerts
|

Counting trees in Random Forests: Predicting symptom severity in psychiatric intake reports

Abstract: The CEGS N-GRID 2016 Shared Task (Filannino, Stubbs, Uzuner (2017)) in Clinical Natural Language Processing introduces the assignment of a severity score to a psychiatric symptom, based on a psychiatric intake report. We present a method that employs the inherent interview-like structure of the report to extract relevant information from the report and generate a representation. The representation consists of a restricted set of psychiatric concepts (and the context they occur in), identified using medical con… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
21
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 21 publications
(22 citation statements)
references
References 19 publications
0
21
0
Order By: Relevance
“…They experimented with different classifiers and approaches. The classifiers most prominently used were Support Vector Regressors [22], Decision Trees [23], Random Forests [24] and Gradient Tree Boosting [25]. The approaches included an ensemble of Convolutional Neural Networks (CNN) with word embeddings [26] and a mixture of Regularized Multinomial Logistic Regression classifiers and Neural Networks [27].…”
Section: Track 2: Symptom Severity Classificationmentioning
confidence: 99%
See 1 more Smart Citation
“…They experimented with different classifiers and approaches. The classifiers most prominently used were Support Vector Regressors [22], Decision Trees [23], Random Forests [24] and Gradient Tree Boosting [25]. The approaches included an ensemble of Convolutional Neural Networks (CNN) with word embeddings [26] and a mixture of Regularized Multinomial Logistic Regression classifiers and Neural Networks [27].…”
Section: Track 2: Symptom Severity Classificationmentioning
confidence: 99%
“…Due to the availability of unannotated data, in addition to their supervised solutions, four teams experimented with semi-supervised approaches, e.g., [22, 24]. Only three teams involved medical experts.…”
Section: Track 2: Symptom Severity Classificationmentioning
confidence: 99%
“…The goal is to decrease the correlation between individual trees, which results in diminished variance when the trees are aggregated. Random forests accommodate sparsity [12], which is favorable in this case, due to a low percentage of patients who reached the primary outcome. The individual trees are designed to overfit on features (making very specific decisions that only account for part of the data set), whereas the voting strategy mitigates these effects by generalizing over the decisions of multiple trees.…”
Section: Introductionmentioning
confidence: 99%
“…They predicted the labels by using a SVM classifier with re-weighted responses to reflect the unbalanced nature of the data. The Antwerp Universitys research team corrected misspellings and the erroneously concatenated words by using hand-written regular expressions and mapped the words to UMLS concepts by using fuzzy matching rules [24]. Instead of using the entire set of mapped concepts, they restricted it to the ones related to psychiatric diagnoses by means of the Diagnostic & Statistical Manual of Mental Disorders (DSM).…”
Section: Resultsmentioning
confidence: 99%