2014
DOI: 10.1186/1471-2105-15-s11-s14
|View full text |Cite
|
Sign up to set email alerts
|

Improving protein fold recognition by random forest

Abstract: BackgroundRecognizing the correct structural fold among known template protein structures for a target protein (i.e. fold recognition) is essential for template-based protein structure modeling. Since the fold recognition problem can be defined as a binary classification problem of predicting whether or not the unknown fold of a target protein is similar to an already known template protein structure in a library, machine learning methods have been effectively applied to tackle this problem. In our work, we de… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
37
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
4
3
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 61 publications
(37 citation statements)
references
References 46 publications
0
37
0
Order By: Relevance
“…The server performs up to eight iterative psi‐blast searches through filtered versions of the nonredundant (nr) database from NCBI. Using the final target alignment, a hidden Markov model (HMM) profile is calculated. Homologous templates are identified by searching through a database containing HMMs for a representative subset of PDB sequences.…”
Section: Methodsmentioning
confidence: 99%
“…The server performs up to eight iterative psi‐blast searches through filtered versions of the nonredundant (nr) database from NCBI. Using the final target alignment, a hidden Markov model (HMM) profile is calculated. Homologous templates are identified by searching through a database containing HMMs for a representative subset of PDB sequences.…”
Section: Methodsmentioning
confidence: 99%
“…RF is an ensemble method of binary decision trees that are trained separately, and it is appropriate for classification and regression problems [54]. The fundamental approach used for classification problems by RF is based on training separately each decision tree, whereas the final outcome is estimated by taking into account the results obtained by each decision tree [55].…”
Section: Random Forest (Rf)mentioning
confidence: 99%
“…RF-Fold uses random forests, a highly scalable classification method, to recognize protein folds [35]. A random forest is composed of many decision trees that are each trained on datasets of target-template protein pairs.…”
Section: Protein Fold Recognitionmentioning
confidence: 99%