2020
DOI: 10.1186/s12859-020-3517-7
|View full text |Cite
|
Sign up to set email alerts
|

Broad-coverage biomedical relation extraction with SemRep

Abstract: Background In the era of information overload, natural language processing (NLP) techniques are increasingly needed to support advanced biomedical information management and discovery applications. In this paper, we present an in-depth description of SemRep, an NLP system that extracts semantic relations from PubMed abstracts using linguistic principles and UMLS domain knowledge. We also evaluate SemRep on two datasets. In one evaluation, we use a manually annotated test collection and perform a comprehensive … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
48
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 74 publications
(57 citation statements)
references
References 102 publications
0
48
0
Order By: Relevance
“…There are methods aimed at NER that have been developing during the last years (Kaewphan et al, 2018 ; Korvigo et al, 2018 ; Hemati and Mehler, 2019 ; Hong and Lee, 2020 ; Huang et al, 2020 ; Kilicoglu et al, 2020 ). Most of them are based on algorithms for NER related either to chemicals or biological objects.…”
Section: Introductionmentioning
confidence: 99%
“…There are methods aimed at NER that have been developing during the last years (Kaewphan et al, 2018 ; Korvigo et al, 2018 ; Hemati and Mehler, 2019 ; Hong and Lee, 2020 ; Huang et al, 2020 ; Kilicoglu et al, 2020 ). Most of them are based on algorithms for NER related either to chemicals or biological objects.…”
Section: Introductionmentioning
confidence: 99%
“…The INDRA system can translate scientific prose directly into executable graphical models [108, 109]. The SemRep system (soon to be released in Java) is being upgraded with exciting features, including factuality levels (potentially useful for improving “knowledge hygiene” and identifying contradictory claims [110]) and end-user extensibility [46].…”
Section: Discussionmentioning
confidence: 99%
“…SemMedDB is a knowledge database deployed extensively in biomedical research and developed at the US National Library of Medicine. The knowledge contained in SemMedDB consists of subject-predicate-object triples (or predications) extracted from titles and abstracts in MEDLINE [44] using the SemRep biomedical NLP system [44, 45, 46]. SemRep can be thought of as a machine reading utility for transforming biomedical literature into computable knowledge.…”
Section: Introductionmentioning
confidence: 99%
“…There is disagreement as to whether cooccurrence based methods are too noisy. More complex methods such as relation extraction (Kilicoglu et al, 2020) models exist, however there is a trade-off between precision and recall with these models. Co-occurrence-based models inherently have a high recall since all co-occurrences are considered a relation, but this high recall comes at the expense of precision since many co-occurrences do not in actuality constitute a relationship.…”
Section: Limitationsmentioning
confidence: 99%