2014
DOI: 10.1021/ci400647u
|View full text |Cite
|
Sign up to set email alerts
|

Context-Based Features Enhance Protein Secondary Structure Prediction Accuracy

Abstract: We report a new approach of using statistical context-based scores as encoded features to train neural networks to achieve secondary structure prediction accuracy improvement. The context-based scores are pseudo-potentials derived by evaluating statistical, high-order inter-residue interactions, which estimate the favorability of a residue adopting certain secondary structure conformation within its amino acid environment. Encoding these context-based scores as important training and prediction features provid… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
61
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
5
1
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 59 publications
(61 citation statements)
references
References 39 publications
0
61
0
Order By: Relevance
“…In this work, similar to our previous work employed in DINOSOLVE [29], SCORPION [30], and CASA, we collect statistics of singlets (), doublets (), and triplets () residues at different positions in protein chains in a window of size 7 residues (). These statistics represent approximations of the possibilities of residues adopting certain flexibility states when none, one, or two neighboring residues are considered.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…In this work, similar to our previous work employed in DINOSOLVE [29], SCORPION [30], and CASA, we collect statistics of singlets (), doublets (), and triplets () residues at different positions in protein chains in a window of size 7 residues (). These statistics represent approximations of the possibilities of residues adopting certain flexibility states when none, one, or two neighboring residues are considered.…”
Section: Methodsmentioning
confidence: 99%
“…We use the methods SCORPION [30] and CASA for secondary structure and solvent accessibility predictions, respectively.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Computational methods for PSSP, mostly based on machine learning methods, can be schematically grouped in the following three families: sequence- based methods; network-based methods [9]; hierarchical ensemble methods [10]. Some methods provided predictions of a relatively small set of functional classes [11], while others considered predictions extended to larger sets, using Support Vector machines [12] , HMM Algorithm, artificial neural networks, Bayesian networks, Decision Trees or methods that combine functional linkage networks with learning machines using a logistic regression model or simple algebraic operators.…”
Section: Structure Prediction: Concepts and Techniquesmentioning
confidence: 99%
“…Sometimes it is essential to know protein 3D structures to identify the protein functions at a molecular level. Reliably and accurately predicting protein 3D structure form sequences of proteins is one of the most challenging issues in computational biology [1]. Protein secondary structure prediction is a vital step towards to predict protein tertiary (3D) structure [2], prediction of protein disorder [3], and solvent accessibility prediction [4].…”
Section: Introductionmentioning
confidence: 99%