2011
DOI: 10.1016/j.jtbi.2010.10.037
|View full text |Cite
|
Sign up to set email alerts
|

AFP-Pred: A random forest approach for predicting antifreeze proteins from sequence-derived properties

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
174
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
5
1
1
1

Relationship

2
6

Authors

Journals

citations
Cited by 221 publications
(175 citation statements)
references
References 85 publications
1
174
0
Order By: Relevance
“…The problems can be understood as how to construct an effective feature model, and whether the algorithm performance can be improved. This is because the existing classifier algorithm, such as neural network, (SVM) [8], random forest [9], K-nearest neighbor (KNN) [10], can highly effectively deal with biological problems.…”
Section: Methodsmentioning
confidence: 99%
“…The problems can be understood as how to construct an effective feature model, and whether the algorithm performance can be improved. This is because the existing classifier algorithm, such as neural network, (SVM) [8], random forest [9], K-nearest neighbor (KNN) [10], can highly effectively deal with biological problems.…”
Section: Methodsmentioning
confidence: 99%
“…We obtained AFP dataset used for the development of AFP-Pred [5]. Briefly, antifreeze protein sequences were collected from seed proteins of the Pfam database [18], enriched by performing Position Specific Iteration -Basic Local Alignment Search Tool (PSI-BLAST) [19] with a string threshold (E-value) of 0.001, and followed by manual inspection for presence of AFPs.…”
Section: Datasetmentioning
confidence: 99%
“…Since these proteins hold a promising scope for wide range of biotechnological applications in industry, medicine, food technology, cell lines and organ preservation, cryosurgery and transgenics, gaining knowledge into their functional mechanisms has become increasingly essential [5]. 4 for prediction.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…This is because almost all the existing machine-learning algorithms, such as "Neural Network" or NN algorithm [1][2][3] "Support Vector Machine" or SVM algorithm [4][5][6][7][8][9][10][11][12] "Nearest Neighbor" or NN algorithm [13,14] and "Random Forest" algorithm [15][16][17][18][19][20][21][22] can only handle vectors but not sequence samples as elucidated in a review paper [23]. Unfortunately, if using the sequential model, i.e., the model in which all the samples are represented by their original sequences, it is hardly able to train a machine learning model that can cover all the possible cases concerned, as elaborated in [24].…”
Section: Introductionmentioning
confidence: 99%