2012
DOI: 10.1002/pds.3205
|View full text |Cite
|
Sign up to set email alerts
|

Automating classification of free‐text electronic health records for epidemiological studies

Abstract: Although a training set still needs to be created manually, text mining can help reduce the amount of manual work needed to incorporate narrative data in an epidemiological study and will make the data extraction more reproducible. An advantage of machine learning is that it is able to pick up specific language use, such as abbreviations and synonyms used by physicians.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
27
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 31 publications
(27 citation statements)
references
References 16 publications
(15 reference statements)
0
27
0
Order By: Relevance
“…The advantage of using the assertion filter and bag-of-words feature representation on Dutch EMRs is presented in [24]. Since the total number of features was still very high even after preprocessing, which makes machine learning computationally expensive and may also hamper the predictive accuracy of the classifier, we performed chi-square feature selection [43].…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…The advantage of using the assertion filter and bag-of-words feature representation on Dutch EMRs is presented in [24]. Since the total number of features was still very high even after preprocessing, which makes machine learning computationally expensive and may also hamper the predictive accuracy of the classifier, we performed chi-square feature selection [43].…”
Section: Methodsmentioning
confidence: 99%
“…We selected the four top-performing algorithms from a previous study [24], in which many well-known machine-learning algorithms were evaluated for the classification of EMRs in a similar experimental setting.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The era of RWD and RWE calls for closer collaborations with experts from computer science, data science, informatics, genomic research, and other disciplines. Data-adaptive techniques (such as machine learning) combined with thoughtful human input are increasingly being used to mine electronic health record databases [34, 35] and improve analytic methods commonly used in pharmacoepidemiology [36, 37]. The ability to collect more data from mobile devices enables exploration of new issues, e.g., the relation between weather and joint pain in patients with rheumatoid arthritis [38].…”
Section: Boldermentioning
confidence: 99%
“…“spikes” and “sharp waves”), generalized periodic discharges, lateralized periodic discharges, and rhythmic delta activity [1]. To derive useful knowledge from these free-text reports, researchers typically have to perform weeks to months of intensive manual work to review and categorize these reports [7] [8]. Automated algorithms offer the advantages of saving human labor, increased speed and the ability to scale the process to larger datasets [11].…”
Section: Introductionmentioning
confidence: 99%