2017
DOI: 10.1101/192856
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Extracting Evidence Fragments for Distant Supervision of Molecular Interactions

Abstract: Abstract. We describe a methodology for automatically extracting 'evidence fragments' from a set of biomedical experimental research articles. These fragments provide the primary description of evidence that is presented in the papers' figures. They elucidate the goals, methods, results and interpretations of experiments that support the original scientific contributions the study being reported. Within this paper, we describe our methodology and showcase an example data set based on the European Bioinformatic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
2
2

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 15 publications
0
4
0
Order By: Relevance
“…We sought to characterize the performance of several different standard neural network approaches in order to support the use of these tools across various similar focused text classification tasks of relevance to biocuration. This directly builds on previous work performing the same classification task with non-deep learning methods (21) or with FastText classifiers as part of a broader workflow (18). We here examine classification performance for all available word embedding models under a variety of parameterizations (see Figure 2).…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…We sought to characterize the performance of several different standard neural network approaches in order to support the use of these tools across various similar focused text classification tasks of relevance to biocuration. This directly builds on previous work performing the same classification task with non-deep learning methods (21) or with FastText classifiers as part of a broader workflow (18). We here examine classification performance for all available word embedding models under a variety of parameterizations (see Figure 2).…”
Section: Resultsmentioning
confidence: 99%
“…We queried PubMed to obtain title, abstract and MeSH data for each paper. We downloaded * .nxml-formatted files from PMC and invoked the UIMA-Bioc preprocessing pipeline (as described previously) to extract figure captions and ‘evidence fragment’ text pertaining to each subfigure mentioned in the textual narrative (21). Given these data, we performed document-level text classification as a ‘triage’ task (i.e.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Integrating computational approaches into the workflow of biocuration can be seen in many applications such as constructing genomics knowledge base (Baumgartner Jr et al 2007), biomedical document classification (Cohen 2006;Shatkay, Chen, and Blostein 2006;Jiang et al 2017;Simon et al 2019), biomedical text mining (Dowell et al 2009), and human-in-the-loop curation (Lee et al 2018). Some prior works also adopt multimodal machine learning for general biomedical information extractions (Schlegl et al 2015;Eickhoff et al 2017;Zhang et al 2017), as well as textual extraction (Burns, Dasigi, and Hovy 2017), medical image captioning (Shin et al 2016), and automated diagnosis from medical images (Jing, Xie, and Xing 2018;Wang et al 2018;Liu et al 2019a).…”
Section: Related Workmentioning
confidence: 99%