2009
DOI: 10.1186/1471-2105-10-s1-s47
|View full text |Cite
|
Sign up to set email alerts
|

Semi-supervised protein subcellular localization

Abstract: Background: Protein subcellular localization is concerned with predicting the location of a protein within a cell using computational method. The location information can indicate key functionalities of proteins. Accurate predictions of subcellular localizations of protein can aid the prediction of protein function and genome annotation, as well as the identification of drug targets. Computational methods based on machine learning, such as support vector machine approaches, have already been widely used in the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
16
0

Year Published

2009
2009
2022
2022

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 28 publications
(16 citation statements)
references
References 54 publications
(39 reference statements)
0
16
0
Order By: Relevance
“…In SubLoc, a query sequence is converted to 20-dim amino-acid composition vector for classification by SVMs. Recently, Xu et al [20] proposed a semi-supervised learning technique (a kind of transductive learning) that makes use of unlabelled test data to boost the classification performance of SVMs. One limitation of composition-based methods is that information about the sequence order is not easy to represent.…”
Section: B Approaches To Subcellular Localization Predictionmentioning
confidence: 99%
“…In SubLoc, a query sequence is converted to 20-dim amino-acid composition vector for classification by SVMs. Recently, Xu et al [20] proposed a semi-supervised learning technique (a kind of transductive learning) that makes use of unlabelled test data to boost the classification performance of SVMs. One limitation of composition-based methods is that information about the sequence order is not easy to represent.…”
Section: B Approaches To Subcellular Localization Predictionmentioning
confidence: 99%
“…The first group is based on the basic machine learning algorithms, such as neural network [3], k-nearest neighbor (KNN) [5], support vector machine [6], [13], [14], and so forth. The other group contains several novel machine learning paradigms, such as ensemble learning [15], [16], semi-supervised learning [17], multitask learning [18], transfer learning [19], boosted association rules [20], and so on, which substantially improve the performance of prediction of subcellular localization of proteins.…”
Section: Introductionmentioning
confidence: 99%
“…Semi-supervised learning has mostly been studied on protein classification (Weston et al, 2005;Weston et al, 2006;Kall et al, 2007;Craig and Liao, 2007;Xu et al, 2009) and efforts on semi-supervised learning from imbalanced distributions have focused on protein datasets with relatively low imbalance degrees. For example, Kondratovich et al (2013) address the problem of molecule activity prediction and they experiment with ten relatively small (3,000 instances) molecule activity datasets with imbalance degrees no larger than to 1-to-40.…”
Section: Related Workmentioning
confidence: 99%