2015
DOI: 10.1186/s13326-015-0019-z
|View full text |Cite
|
Sign up to set email alerts
|

Concept selection for phenotypes and diseases using learn to rank

Abstract: BackgroundPhenotypes form the basis for determining the existence of a disease against the given evidence. Much of this evidence though remains locked away in text – scientific articles, clinical trial reports and electronic patient records (EPR) – where authors use the full expressivity of human language to report their observations.ResultsIn this paper we exploit a combination of off-the-shelf tools for extracting a machine understandable representation of phenotypes and other related concepts that concern t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
10
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
9

Relationship

1
8

Authors

Journals

citations
Cited by 12 publications
(11 citation statements)
references
References 29 publications
(21 reference statements)
1
10
0
Order By: Relevance
“…It is interesting to find that BeCAS which uses deterministic finite automatons, performed worse than simple dictionary lookup. However, BeCAS has shown similar type of performance in previous studies ( 51 , 52 ).…”
Section: Resultssupporting
confidence: 74%
“…It is interesting to find that BeCAS which uses deterministic finite automatons, performed worse than simple dictionary lookup. However, BeCAS has shown similar type of performance in previous studies ( 51 , 52 ).…”
Section: Resultssupporting
confidence: 74%
“…"white blood cell" instead of "blood cell" or "cell"). Collier et al [26] applied MetaMap and cTAKES to the extraction of phenotypes and other related concepts that concern the diagnosis and treatment of diseases. They concluded that cTAKES performs well overall but that annotation performance varies widely across semantic types, and that MetaMap with the strict matching and word sense disambiguation features enabled can have superior precision.…”
Section: Diagnostic Knowledge Extraction Using Metamap and Ctakesmentioning
confidence: 99%
“…The majority of normalisation methods are based on matching entity mentions against concept synonyms listed in a terminological resource (e.g., [ 22 , 23 , 51 – 53 ]); more sophisticated methods combine or rank the results obtained using a number of different terminological resources [ 54 , 55 ]. Approaches based on pattern-matching or regular expressions (e.g., [ 56 – 58 ]) can account for frequently occurring variations not listed in the terminological resource (e.g., Greek or Roman suffixes for genes) and/or by helping to post-process initial normalisation output [ 59 ], in order to better handle problematic cases such as abbreviations or coordinated phrases [ 60 ].…”
Section: Related Workmentioning
confidence: 99%