2012
DOI: 10.1186/1471-2105-13-157
|View full text |Cite
|
Sign up to set email alerts
|

Minimalist ensemble algorithms for genome-wide protein localization prediction

Abstract: BackgroundComputational prediction of protein subcellular localization can greatly help to elucidate its functions. Despite the existence of dozens of protein localization prediction algorithms, the prediction accuracy and coverage are still low. Several ensemble algorithms have been proposed to improve the prediction performance, which usually include as many as 10 or more individual localization algorithms. However, their performance is still limited by the running complexity and redundancy among individual … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
27
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
8

Relationship

1
7

Authors

Journals

citations
Cited by 36 publications
(27 citation statements)
references
References 23 publications
0
27
0
Order By: Relevance
“…This insight opens avenues for new algorithms for heterogeneous ensemble learning that can balance diversity and performance in a structured and mathematically rigorous way. Lin et al (2012) made a commendable beginning in this direction by proposing such an algorithm for enhancing the prediction of protein subcellular localization. However they applied these algorithms to a set of only about ten base predictors for this problem, and the true potential of these algorithms will be realized when they can be applied to the scale (number and type) of base predictors tested in our study.…”
Section: Results: Ensemble Characteristicsmentioning
confidence: 99%
See 1 more Smart Citation
“…This insight opens avenues for new algorithms for heterogeneous ensemble learning that can balance diversity and performance in a structured and mathematically rigorous way. Lin et al (2012) made a commendable beginning in this direction by proposing such an algorithm for enhancing the prediction of protein subcellular localization. However they applied these algorithms to a set of only about ten base predictors for this problem, and the true potential of these algorithms will be realized when they can be applied to the scale (number and type) of base predictors tested in our study.…”
Section: Results: Ensemble Characteristicsmentioning
confidence: 99%
“…Diversity measures, such as the Q statistic discussed earlier, might play an important role here. Approaches such as those proposed by Lin et al (2012) can be an important step forward in this direction. On the implementation front, although our DataSink framework exploits several parallelization opportunities to make the heterogeneous ensemble learning process more efficient, there are potentially many more avenues to improve this efficiency further.…”
Section: Discussionmentioning
confidence: 99%
“…Prediction of cellular localization was performed using Wolf PSort (https://wolfpsort.hgc.jp) and LOCALIZER (https://localizer.csiro.au; Sperschneider et al, ). Prediction of the presence and localization of the NLS was performed with SeqNLS (Lin et al, ). The coding region of Mg16820 was blasted using default parameters with a cut‐off value of 50 (bitscore) against the M. graminicola genomics resource database (MGRAMBASE; https://insilico.iari.res.in/mgram/), the non‐redundant protein and nucleotide databases for all organisms of the National Center for Biotechnology Information (NCBI), fungal and oomycete genomes (FungiDB; https://fungidb.org/fungidb/), bacterial genomes (Microbial nucleotide blast on NCBI), and the four genomic datasets of the plant‐parasitic nematodes Bursaphelenchus xylophilus , Globodera pallida , M. incognita and M. hapla.…”
Section: Methodsmentioning
confidence: 99%
“…The binding capability of PHB2 with each KPNA was comparable with that of the Ala-replaced NLS mutant PHB2 (R86A, R88A and K89A) ( Fig 1C ). Notably, the scores of NLS sequences within PHB2 predicted using the SeqNLS algorithm [ 25 ] and cNLS Mapper [ 26 ] were substantially less than the cut-off value (data not shown). In recent years, however, it has become increasingly apparent that a number of proteins do not follow these canonical pathways and instead utilize non-conventional mechanisms [ 27 ].…”
Section: Discussionmentioning
confidence: 99%