2010
DOI: 10.1007/s12293-010-0045-4
|View full text |Cite
|
Sign up to set email alerts
|

A hierarchical multi-label classification ant colony algorithm for protein function prediction

Abstract: This paper proposes a novel Ant Colony Optimisation algorithm (ACO) tailored for the hierarchical multilabel classification problem of protein function prediction. This problem is a very active research field, given the large increase in the number of uncharacterised proteins available for analysis and the importance of determining their functions in order to improve the current biological knowledge. Since it is known that a protein can perform more than one function and many protein functional-definition sche… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
53
0
7

Year Published

2011
2011
2022
2022

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 47 publications
(60 citation statements)
references
References 30 publications
0
53
0
7
Order By: Relevance
“…The authors later extended this method [23] to allow multi-label classification, considering both tree-and DAG-structured hierarchies.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The authors later extended this method [23] to allow multi-label classification, considering both tree-and DAG-structured hierarchies.…”
Section: Related Workmentioning
confidence: 99%
“…The three methods are detailed in [31]. We also employ the Ant Colony Optimization-based method hmAnt-Miner [23], which is a global method that obtained competitive results when compared to Clus-HMC. We decided to select these algorithms because they were all applied to the same protein function prediction datasets used in the experiments (both for tree and DAG structures).…”
Section: Baseline Algorithmsmentioning
confidence: 99%
“…Firstly, as the hierarchy is traversed down to the leaves, classes at lower levels of the hierarchy usually have very fewer positive instances, requiring base classifiers at these hierarchy levels to be learned from very skewed data sets [19]. In such imbalanced data set learning scenarios, standard classifiers might be overwhelmed by the majority class and can not detect the minority one.…”
Section: Introductionmentioning
confidence: 99%
“…Meanwhile, from the view of protein function prediction, a protein annotated with several GO terms can be viewed as a sample and each GO term corresponds to a label. Thus, the solutions of multi-label classification have great potential to protein function prediction [10][11][12][13][14]: Celine Vens et al [15] discussed three hierarchical multi-label classifiers, which are based on the decision tree algorithm; then, the experiments were carried on 24 datasets from functional genomics; Yu [16] proposed a multiple kernels (ProMK) method to process multiple heterogeneous protein data sources for predicting protein functions; a novel ant colony algorithm was proposed in [17] for hierarchical multi-label classification. Although good results of multi-label classification are achieved by methods mentioned above, the outputs of them are the corresponding label sets for samples in testing set, and the relationships between individual attributes and their most appropriate labels are unknown.…”
Section: Introductionmentioning
confidence: 99%