Committee-based Selection of Weakly Labeled Instances for Learning Relation Extraction

Bobić, Tamara; Klinger, Roman

doi:10.13053/rcs-70-1-14

Cited by 2 publications

(2 citation statements)

References 24 publications

(19 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Experiments on 5 PPI corpora show mixed results. Bobić and Klinger [25] proposed the use of query-by-committee to select instances instead. This approach was similar to the active learning paradigm, with a difference that unlabeled instances are weakly annotated, rather than by human experts.…”

Section: Related Workmentioning

confidence: 99%

“…Mintz et al [24] assumes that if two entities have a relationship in a known knowledge base, then all sentences that contain this pair of entities will express the relationship. Since its emergence, distant supervision has been widely adopted to information extraction in news domain [24] as well as in biomedical text mining [25–28]. However, the original assumption by Mintz et al [24] does not always hold and false-positive instances may be generated during automatic instance construction procedure.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Chemical-induced disease relation extraction via attention-based distant supervision

Sun

Qian

et al. 2019

BMC Bioinformatics

View full text Add to dashboard Cite

Background Automatically understanding chemical-disease relations (CDRs) is crucial in various areas of biomedical research and health care. Supervised machine learning provides a feasible solution to automatically extract relations between biomedical entities from scientific literature, its success, however, heavily depends on large-scale biomedical corpora manually annotated with intensive labor and tremendous investment. Results We present an attention-based distant supervision paradigm for the BioCreative-V CDR extraction task. Training examples at both intra- and inter-sentence levels are generated automatically from the Comparative Toxicogenomics Database (CTD) without any human intervention. An attention-based neural network and a stacked auto-encoder network are applied respectively to induce learning models and extract relations at both levels. After merging the results of both levels, the document-level CDRs can be finally extracted. It achieves the precision/recall/F1-score of 60.3%/73.8%/66.4%, outperforming the state-of-the-art supervised learning systems without using any annotated corpus. Conclusion Our experiments demonstrate that distant supervision is promising for extracting chemical disease relations from biomedical literature, and capturing both local and global attention features simultaneously is effective in attention-based distantly supervised learning.

show abstract

Section: Related Workmentioning

confidence: 99%