2011
DOI: 10.1136/amiajnl-2011-000155
|View full text |Cite
|
Sign up to set email alerts
|

Using machine learning for concept extraction on clinical documents from multiple data sources

Abstract: Objective Concept extraction is a process to identify phrases referring to concepts of interests in unstructured text. It is a critical component in automated text processing. We investigate the performance of machine learning taggers for clinical concept extraction, particularly the portability of taggers across documents from multiple data sources. Methods We used BioTagger-GM to train machine learning taggers, which we originally developed for the detection of gene/protein names in the biology domain. Train… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
63
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 106 publications
(63 citation statements)
references
References 32 publications
0
63
0
Order By: Relevance
“…In Expanding exploration or survey of various data source including protein, gene as well Bio Tagger was prepared on it in the domain of medical text mining. The Experimentation or Testing result demonstrated in which Bio Tagger conceivably valuable to extract the protein, gene in the form of huge dataset accommodated for the Training [12]. Content characteristics dependably assume a vital part in Named Entity Recognition; the framework"s execution can be considerably enhanced via expansion of many characteristics.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…In Expanding exploration or survey of various data source including protein, gene as well Bio Tagger was prepared on it in the domain of medical text mining. The Experimentation or Testing result demonstrated in which Bio Tagger conceivably valuable to extract the protein, gene in the form of huge dataset accommodated for the Training [12]. Content characteristics dependably assume a vital part in Named Entity Recognition; the framework"s execution can be considerably enhanced via expansion of many characteristics.…”
Section: Related Workmentioning
confidence: 99%
“…In past few researches, the use of orthographic features is widely advocated in [12]- [14]. Our used experimental orthographic features are shown in Table I.…”
Section: ) Orthographic Featuresmentioning
confidence: 99%
See 2 more Smart Citations
“…Many researchers have tried to extract concepts from texts (Gelfand et al, 1998;Hovy et al, 2009;Villalon and Calvo, 2009;Dinh and Tamine, 2011;Torii et al, 2011). Hovy narrowed the domain of interest into concepts "below" a given seed term.…”
Section: Related Workmentioning
confidence: 99%