2011
DOI: 10.1002/jcc.21968
|View full text |Cite
|
Sign up to set email alerts
|

SPINE X: Improving protein secondary structure prediction by multistep learning coupled with prediction of solvent accessible surface area and backbone torsion angles

Abstract: Accurate prediction of protein secondary structure is essential for accurate sequence alignment, three-dimensional structure modeling, and function prediction. The accuracy of ab initio secondary structure prediction from sequence, however, has only increased from around 77% to 80% over the past decade. Here, we developed a multi-step neural-network algorithm by coupling secondary structure prediction with prediction of solvent accessibility and backbone torsion angles in an iterative manner. Our method called… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

4
218
0
1

Year Published

2014
2014
2018
2018

Publication Types

Select...
6
2

Relationship

3
5

Authors

Journals

citations
Cited by 227 publications
(226 citation statements)
references
References 45 publications
4
218
0
1
Order By: Relevance
“…Yu et al (2017) use Chous pseudo amino acid composition and wavelet denoising to prediction structural class. From 2014 to now, several papers (Dehzangi et al, 2014;Wang et al, 2014;Jones, 1999;Faraggi et al, 2012) show that the protein secondary structure is significanc to predict protein structural classes. Firstly the features are extracted, secondly all kinds of algorithms can be used to implement the classification prediction, such as Fisher's linear discriminant algorithm (Yang et al, 2009), Support Vector Machine (SVM) (Cai et al, 2003) and so on.…”
Section: Introductionmentioning
confidence: 99%
“…Yu et al (2017) use Chous pseudo amino acid composition and wavelet denoising to prediction structural class. From 2014 to now, several papers (Dehzangi et al, 2014;Wang et al, 2014;Jones, 1999;Faraggi et al, 2012) show that the protein secondary structure is significanc to predict protein structural classes. Firstly the features are extracted, secondly all kinds of algorithms can be used to implement the classification prediction, such as Fisher's linear discriminant algorithm (Yang et al, 2009), Support Vector Machine (SVM) (Cai et al, 2003) and so on.…”
Section: Introductionmentioning
confidence: 99%
“…We also use predicted secondary structure using SPINE-X which was recently proposed by [46] and attained better results (especially for the coded area) than PSIPRED on predicting protein secondary structure [47]. Given a protein sequence, it returns an L Â 3 matrix (which will be referred to as SPINE-M for the rest of this study) consisting of the normalized probability of contribution of a given amino acid based on its position along the protein sequence to build one of the three secondary structure elements namely, a-helix, b-strands, and coils.…”
Section: Feature Extraction Methodsmentioning
confidence: 99%
“…In this study, we will refer to this sequence as the structural consensus sequence. It is expected that predicted secondary structure using SPINE-X provides significant structural information for the PFR similar to or even better than PSIPRED due to its better performance [17], [23], [30], [46].…”
Section: Feature Extraction Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The residue level information includes: (a) single valued amino acid type (all the necessary information for the correct folding of a protein is encoded in its amino acid sequence [26]); (b) seven physicochemical properties of amino acid (different types, short or long, disordered regions in protein are found to have distinguished physicochemical properties); (c) twenty PSSM's (position specific scoring matrix) indicating the evolutionary information accumulated in each residue position of a protein sequence; (d) three predicted secondary structure (helix, strand and coil) probabilities from SPINE-X [27], one predicted accessible surface area (ASA) normalized by the ASA of an extended conformation (Ala-XAla) [28] and two predicted backbone torsion angle (phi, psi) fluctuations [29] since disordered residues are characterized by lack of stable secondary structure [30], highly exposed area and angle fluctuations; (e) one monogram and twenty bigrams computed from PSSM [31] representing the conserved evolutionary information of PSSM transformed from primary structure level to three dimensional structure level, which are normalized by the median of normal density distribution of monogram and bigram values in their logarithmic space; (f) one indicator for terminal residues (five residues from Nterminal as {−1.0, −0.8, −0.6, −0.4, −0.2}, five residue from C-terminal from {+1.0, +0.8, +0.6, +0.4, +0.2} respectively, with the rest as 0.0). Finally, before feeding the features into the classifier, neighboring residue's information is aggregated using a sliding window of 21 residues (10 residues on each residue to be predicted), resulting in 21 × 56 = 1176 features per residue.…”
Section: B Input Featuresmentioning
confidence: 99%