2015
DOI: 10.1038/srep11476
|View full text |Cite
|
Sign up to set email alerts
|

Improving prediction of secondary structure, local backbone angles and solvent accessible surface area of proteins by iterative deep learning

Abstract: Direct prediction of protein structure from sequence is a challenging problem. An effective approach is to break it up into independent sub-problems. These sub-problems such as prediction of protein secondary structure can then be solved independently. In a previous study, we found that an iterative use of predicted secondary structure and backbone torsion angles can further improve secondary structure and torsion angle prediction. In this study, we expand the iterative features to include solvent accessible s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

2
322
1
1

Year Published

2015
2015
2023
2023

Publication Types

Select...
8
1

Relationship

2
7

Authors

Journals

citations
Cited by 328 publications
(335 citation statements)
references
References 45 publications
2
322
1
1
Order By: Relevance
“…Three predictive methods were used: NetsurfP 50 , I-TASSER 51 and SPIDER2 (ref. 52). Residues altered in both disorders (136–137) were excluded from this analysis.…”
Section: Figurementioning
confidence: 99%
“…Three predictive methods were used: NetsurfP 50 , I-TASSER 51 and SPIDER2 (ref. 52). Residues altered in both disorders (136–137) were excluded from this analysis.…”
Section: Figurementioning
confidence: 99%
“…49, 50 Specifically, the advantages of simultaneously training a neural network to predict multiple outcome variables (disease severity, electrophysiological parameters, etc.) may enable a more accurate prediction of phenotype traits as well.…”
Section: Discussionmentioning
confidence: 99%
“…Covering the entire sequence for small RNAs by an optimized window size of 40 ensures the best learning for small RNAs while optimizing the performance for large RNAs. This window size of 40 is much larger than a typical window size (8-10) of a protein used for predicting protein secondary structure and ASA (Zhou and Faraggi 2010;Heffernan et al 2015). This is because 20 different amino acids of proteins with a typical window size of eight can generate 340 features [20×(2 × 8 + 1)], whereas four-letter-coded RNAs require us to expand the window size to 40 in order to achieve a similar number of features [324 = 4×(2 × 40 + 1)] so that a similar amount of information content can be learned from RNAs or from proteins.…”
Section: Discussionmentioning
confidence: 99%
“…To address this question, we focus on the solvent accessibility or the fraction of solvent accessible surface area (ASA) of nucleotides in the tertiary structure of an RNA chain. RNA solvent accessibility, reflecting the level of exposure of a base to solvent, is a one-dimensional structural property of tertiary structure that is more amenable for computational prediction than three-dimensional structure, as demonstrated in predicting the solvent accessibility of proteins (Zhou and Faraggi 2010;Heffernan et al 2015). The ability to predict RNA accessibility of functional tertiary structures directly from RNA sequences would support the notion that functional RNA structures are encoded in their nucleotide sequences alone (i.e., without inputting information from their binding partners).…”
Section: Introductionmentioning
confidence: 99%