2005
DOI: 10.1016/j.artmed.2004.07.016
|View full text |Cite
|
Sign up to set email alerts
|

Comparative experiments on learning information extractors for proteins and their interactions

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
225
0
9

Year Published

2006
2006
2016
2016

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 297 publications
(234 citation statements)
references
References 16 publications
0
225
0
9
Order By: Relevance
“…Consistently, recent trends regarding the application of machine learning to biological IE head toward the development of public annotated corpora, targeting such binary relations to compare systems' performances (e.g. AIMed [29], Bioinfer [30], HPRD50 [10], LLL [9]). In this paper, the ontology does not limit us to the extraction of a single relation, but allows the definition of numerous relations.…”
Section: Resultsmentioning
confidence: 99%
“…Consistently, recent trends regarding the application of machine learning to biological IE head toward the development of public annotated corpora, targeting such binary relations to compare systems' performances (e.g. AIMed [29], Bioinfer [30], HPRD50 [10], LLL [9]). In this paper, the ontology does not limit us to the extraction of a single relation, but allows the definition of numerous relations.…”
Section: Resultsmentioning
confidence: 99%
“…These include the protein-interaction datasets from Ray and Craven (2001) and from Bunescu et al (2005), and we have reported elsewhere results on a recent Learning Language and Logic challenge task dataset (Goadrich et al, 2005). Other datasets outside of IE where we believe Gleaner will be useful include the nuclear smuggling dataset from Tang et al (2003), the social network dataset from Taskar et al (2003), the CiteSeer citation dataset from Popescul et al (2003), and the university relation dataset from Richardson and Domingos (2006).…”
Section: Discussionmentioning
confidence: 99%
“…Recently, biomedical journal articles have been a major source of interest in the IE community for a number of reasons: the amount of data available is enormous; the objects, proteins and genes, do not have standard naming conventions; and there is interest from biomedical practitioners to quickly find relevant information (Blaschke et al, 2002;Shatkay and Feldman, 2003;Ray and Craven, 2001;Bunescu et al, 2005). We have focused on learning multi-slot protein localization from Medline 1 abstracts, where the task is to identify links between phrases which correspond to a protein and the location of that particular protein in a cell.…”
Section: Information Extractionmentioning
confidence: 99%
See 1 more Smart Citation
“…This is why along ROC curves analysis we validate our binary classifier with the help of PrecisionRecall (PR) curves. PR curves have been mentioned as an alternative to ROC curves for tasks with a large skew in the class distribution (Craven, 2005;Bunescu et al, 2005;Goadrich et al, 2004). Indeed, when the proportion of negative samples is much greater than that of the positive ones, a large change in the fraction of false positives can lead to a minimal change in the false positive rate of the ROC analysis, because they are underrepresented in the test set.…”
Section: Assessing the Classifier Performancementioning
confidence: 99%