First order random forests: Learning relational classifiers with complex aggregates

Assche, Anneleen Van; Vens, Celine; Blockeel, Hendrik; Džeroski, Sašo

doi:10.1007/s10994-006-8713-9

Cited by 50 publications

(56 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Van Assche et al [21] have made the first implementation of combined aggregates and selections in an ILP system. They have extended the refinement operator of the relational decision tree learner Tilde [3] to include so-called complex aggregates: literals of the form F (V, Q, R), where F is an aggregate function (e.g., count), V is an aggregate variable occurring in the aggregate query Q, and R is the result of applying F to the set of all answer substitutions for V that Q results in (we will call this set the result set of Q).…”

Section: Related Workmentioning

confidence: 99%

“…A complex aggregate can be constructed by iterative refinement of Q, starting with a general query (e.g., the number of atoms of a molecule) and ending with a very specific one (e.g., the number of carbon atoms bound with an aromatic bond type to an atom with charge larger than 0.06). The feature explosion resulting from combining aggregate functions with selection conditions was handled by upgrading Tilde to a random forest [21] and taking advantage of the feature sampling applied at each node of the trees. Charnay et al [6] have recently proposed an alternative solution, by introducing a hill-climbing approach to build complex aggregates incrementally.…”

Section: Related Workmentioning

confidence: 99%

“…The latter two types of relationships result in single examples being related to a set of objects, and require mechanisms to deal with such sets. While, traditionally, relational learning approaches handled sets by either looking at properties of individual elements or by aggregating over them, a number of researchers [12,14,17,20,21] have looked into the combination of these approaches, resulting in so-called complex aggregates. Such complex aggregates aggregate over a subset of the elements of a set, where the subset is usually defined by imposing conditions on some attribute values.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Complex Aggregates over Clusters of Elements

Vens

Gassen

Dhaene

et al. 2015

Inductive Logic Programming

Self Cite

View full text Add to dashboard Cite

Abstract. Complex aggregates have been proposed as a way to bridge the gap between approaches that handle sets by imposing conditions on specific elements, and approaches that handle them by imposing conditions on aggregated values. A complex aggregate summarises a subset of the elements in a set, where this subset is defined by conditions on the attribute values. In this paper, we present a new type of complex aggregate, where this subset is defined to be a cluster of the set. This is useful if subsets that are relevant for the task at hand are difficult to describe in terms of attribute conditions. This work is motivated from the analysis of flow cytometry data, where the sets are cells, and the subsets are cell populations. We describe two approaches to aggregate over clusters on an abstract level, and validate one of them empirically, motivating future research in this direction.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Complex Aggregates over Clusters of Elements

Vens

Gassen

Dhaene

et al. 2015

Inductive Logic Programming

Self Cite

View full text Add to dashboard Cite

show abstract

“…The formatting of the feature value emphasizes the hierarchical, combinatorial structure of these count of count values (which can also be nested deeper into count of count of count values, etc.). Note that count of count features in our sense are quite different from nested aggregates in the sense of [1]: unlike the latter, our counts of counts do not aggregate a multiset of values into a single number at each level of nesting.…”

Section: Introductionmentioning

confidence: 97%

“…Perhaps the count of actors with a proportion of ≥ 50% award winning movies among the movies they participated in is the more relevant feature (here evaluating to 0 and 2, respectively, assuming that the movie to be classified is not considered in the count). Nested aggregates like this have been used in conjunction with first-order decision trees [1]. This very simple example illustrates three interesting aspects of relational features.…”

Section: Introductionmentioning

confidence: 99%

Feature Discovery with Type Extension Trees

Jaeger

Passerini

Inductive Logic Programming

View full text Add to dashboard Cite

Abstract. We are interested in learning complex combinatorial features from relational data. We rely on an expressive and general representation language whose semantics allows us to express many features that have been used in different statistical relational learning settings. To avoid expensive exhaustive search over the space of relational features, we introduce a heuristic search algorithm guided by a generalized relational notion of information gain and a discriminant function. The algorithm succesfully finds interesting and interpretable features on artificial and real-world relational learning problems.

show abstract

Refining Aggregate Conditions in Relational Learning

Vens

Ramon

Blockeel

2006

Lecture Notes in Computer Science

Self Cite

View full text Add to dashboard Cite

In relational learning, predictions for an individual are based not only on its own properties but also on the properties of a set of related individuals. Many systems use aggregates to summarize this set. Features thus introduced compare the result of an aggregate function to a threshold. We consider the case where the set to be aggregated is generated by a complex query and present a framework for refining such complex aggregate conditions along three dimensions: the aggregate function, the query used to generate the set, and the threshold value. The proposed aggregate refinement operator allows a more efficient search through the hypothesis space and thus can be beneficial for many relational learners that use aggregates. As an example application, we have implemented the refinement operator in a relational decision tree induction system. Experimental results show a significant efficiency gain in comparison with the use of a less advanced refinement operator.

show abstract

First order random forests: Learning relational classifiers with complex aggregates

Cited by 50 publications

References 30 publications

Complex Aggregates over Clusters of Elements

Complex Aggregates over Clusters of Elements

Feature Discovery with Type Extension Trees

Refining Aggregate Conditions in Relational Learning

Contact Info

Product

Resources

About