2010
DOI: 10.1186/1471-2105-11-2
|View full text |Cite
|
Sign up to set email alerts
|

Predicting gene function using hierarchical multi-label decision tree ensembles

Abstract: BackgroundS. cerevisiae, A. thaliana and M. musculus are well-studied organisms in biology and the sequencing of their genomes was completed many years ago. It is still a challenge, however, to develop methods that assign biological functions to the ORFs in these genomes automatically. Different machine learning methods have been proposed to this end, but it remains unclear which method is to be preferred in terms of predictive performance, efficiency and usability.ResultsWe study the use of decision tree base… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
116
0
3

Year Published

2010
2010
2020
2020

Publication Types

Select...
5
5

Relationship

2
8

Authors

Journals

citations
Cited by 154 publications
(120 citation statements)
references
References 35 publications
(86 reference statements)
1
116
0
3
Order By: Relevance
“…CLUS-HMC can exploit the hierarchical relationships in GO to achieve higher predictive performance (Blockeel et al, 2006;Vens et al, 2008) and was previously used for AFP tasks (Schietgat et al, 2010;Slavkov et al, 2010;Škunca et al, 2013). For each OG and GO term pair, the classifier outputs a score ranging between 0 and 1 that indicates confidence in assignment of that function to the OG.…”
Section: Integrating Across Genomes In a Single Massive Afp Analysismentioning
confidence: 99%
“…CLUS-HMC can exploit the hierarchical relationships in GO to achieve higher predictive performance (Blockeel et al, 2006;Vens et al, 2008) and was previously used for AFP tasks (Schietgat et al, 2010;Slavkov et al, 2010;Škunca et al, 2013). For each OG and GO term pair, the classifier outputs a score ranging between 0 and 1 that indicates confidence in assignment of that function to the OG.…”
Section: Integrating Across Genomes In a Single Massive Afp Analysismentioning
confidence: 99%
“…when composed with online learning [11]; Bayesian learning [10]; support vector machines [6]; and decision tree ensembles [39]. Our embedding approach neither requires nor exploits such side information, and is therefore applicable to different scenarios, but is potentially suboptimal when side information is present.…”
Section: Related Workmentioning
confidence: 99%
“…Table 1 shows how our work fits into the context of related work on hierarchical classification according to these two issues. [15] In addition to the use of our ABC-Miner for hierarchical protein function prediction in a new ageing-related dataset, what is novel in this work are the various methods we employ to combine the outputs of an ensemble's classifiers built with different protein representations in hierarchical classification. As shown in Table 1, three of the four cells marked with the keyword "this work" involve a new combination of the technique of building classifiers with different protein representations (proposed in [18]) with a technique for combining the classifiers' predictions.…”
Section: Proposed Methods For An Ensemble Of Classifiersmentioning
confidence: 99%