Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing - BioNLP '08 2008
DOI: 10.3115/1572306.1572308
|View full text |Cite
|
Sign up to set email alerts
|

A graph kernel for protein-protein interaction extraction

Abstract: In this paper, we propose a graph kernel based approach for the automated extraction of protein-protein interactions (PPI) from scientific literature. In contrast to earlier approaches to PPI extraction, the introduced alldependency-paths kernel has the capability to consider full, general dependency graphs. We evaluate the proposed method across five publicly available PPI corpora providing the most comprehensive evaluation done for a machine learning based PPI-extraction system. Our method is shown to achiev… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
93
0
9

Year Published

2008
2008
2014
2014

Publication Types

Select...
4
3
3

Relationship

0
10

Authors

Journals

citations
Cited by 63 publications
(102 citation statements)
references
References 18 publications
0
93
0
9
Order By: Relevance
“…Fundel et al (2007) found that, in extraction of relations between genes and proteins, a system based on the SD scheme greatly outperformed the previous best system on the LLL challenge dataset (by an 18% absolute improvement in F-measure). Airola et al (2008) provide more systematic results on a number of proteinprotein interaction datasets. Their graph kernel approach uses an all-dependency-paths kernel which allows their system to consider full dependency graphs.…”
Section: The Formalism and The Toolmentioning
confidence: 99%
“…Fundel et al (2007) found that, in extraction of relations between genes and proteins, a system based on the SD scheme greatly outperformed the previous best system on the LLL challenge dataset (by an 18% absolute improvement in F-measure). Airola et al (2008) provide more systematic results on a number of proteinprotein interaction datasets. Their graph kernel approach uses an all-dependency-paths kernel which allows their system to consider full dependency graphs.…”
Section: The Formalism and The Toolmentioning
confidence: 99%
“…To obtain AUC iP/R curves, the highest precision at each recall point is calculated. The interested reader can find a detailed discussion regarding f-measure versus AUC scores (for PPI extraction systems) in [36].…”
Section: Ranking Of Extracted Pairs and Proteinsmentioning
confidence: 99%
“…Features used for classifier training are normally syntactic and lexical patterns derived from dependency relations between individual words in sentences which are revealed automatically by syntactic parsers. Various kernels have been proposed to calculate similarity between syntactic structures, including subsequence kernel [10], tree kernels [11], shortest path kernel [12], graph kernel [13], or a combination of them [14]. Under this kind of problem setting, one sentence in the dataset yields C 2 n distinct instances, where n is the number of proteins in the sentence and each instance represents a pairwise combination of proteins.…”
Section: Work On Protein-protein Interactions Extractionmentioning
confidence: 99%