2006
DOI: 10.1007/s10994-006-8958-3
|View full text |Cite
|
Sign up to set email alerts
|

Gleaner: Creating ensembles of first-order clauses to improve recall-precision curves

Abstract: Many domains in the field of Inductive Logic Programming (ILP) involve highly unbalanced data. A common way to measure performance in these domains is to use precision and recall instead of simply using accuracy. The goal of our research is to find new approaches within ILP particularly suited for large, highly-skewed domains. We propose Gleaner, a randomized search method that collects good clauses from a broad spectrum of points along the recall dimension in recall-precision curves and employs an "at least L… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
28
0
1

Year Published

2010
2010
2021
2021

Publication Types

Select...
7

Relationship

2
5

Authors

Journals

citations
Cited by 41 publications
(31 citation statements)
references
References 37 publications
(34 reference statements)
0
28
0
1
Order By: Relevance
“…Interpolation Estimators As suggested by Davis and Goadrich [6] and Goadrich et al [1], we use PR space interpolation as the basis for several estimators. These methods use the non-linear interpolation between known points in PR space derived from a linear interpolation in ROC space.…”
Section: Point Estimatorsmentioning
confidence: 99%
See 2 more Smart Citations
“…Interpolation Estimators As suggested by Davis and Goadrich [6] and Goadrich et al [1], we use PR space interpolation as the basis for several estimators. These methods use the non-linear interpolation between known points in PR space derived from a linear interpolation in ROC space.…”
Section: Point Estimatorsmentioning
confidence: 99%
“…These methods use the non-linear interpolation between known points in PR space derived from a linear interpolation in ROC space. Davis and Goadrich [6] and Goadrich et al [1] examine the interpolation in terms of the number of true positives and false positives corresponding to each PR point. Here we perform the same interpolation, but use the recall and precision of the PR points directly, which leads to the surprising observation that the interpolation (from the same PR points) does not depend on π. Theorem 1.…”
Section: Point Estimatorsmentioning
confidence: 99%
See 1 more Smart Citation
“…One method of developing more complex rstorder models is to retain more than just the best clause found during search. Taking an idea from the Gleaner algorithm [9] which retains an entire set of rules found during search that span the range of recall values, we have developed a second weak learner that retains a set of the best rules found during search. This weak learner, PRankBoost.Path, contains all rules along the path from the most general rule to the highest-scoring rule found during search.…”
Section: Weak Learnersmentioning
confidence: 99%
“…In the Inductive Logic Programming [6] domain ensembles have been successfully used to increase performance [5,9,10]. Successful ensemble approaches must both learn individual classiers that work well with a set of other classiers as well as combine those classiers in a way that maximizes performance.…”
Section: Introductionmentioning
confidence: 99%