2011
DOI: 10.1186/1471-2105-12-154
|View full text |Cite
|
Sign up to set email alerts
|

Learning sparse models for a dynamic Bayesian network classifier of protein secondary structure

Abstract: BackgroundProtein secondary structure prediction provides insight into protein function and is a valuable preliminary step for predicting the 3D structure of a protein. Dynamic Bayesian networks (DBNs) and support vector machines (SVMs) have been shown to provide state-of-the-art performance in secondary structure prediction. As the size of the protein database grows, it becomes feasible to use a richer model in an effort to capture subtle correlations among the amino acids and the predicted labels. In this co… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
27
0

Year Published

2011
2011
2021
2021

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 36 publications
(28 citation statements)
references
References 47 publications
1
27
0
Order By: Relevance
“…Although PSIPRED had been trained on PSI-BLAST MSAs, HHblits MSAs improved the Q3 score (fraction of correctly predicted secondary structure states) for proteins from the PDBselect 2007 dataset (Online Methods) from 80.4% to 81.3% and the secondary structure segment overlap (SOV) score from 77.5% to 78.6% (Supplementary Table 1). These results, obtained without training a large parameter set, are among the best achieved at present 14 .…”
supporting
confidence: 57%
“…Although PSIPRED had been trained on PSI-BLAST MSAs, HHblits MSAs improved the Q3 score (fraction of correctly predicted secondary structure states) for proteins from the PDBselect 2007 dataset (Online Methods) from 80.4% to 81.3% and the secondary structure segment overlap (SOV) score from 77.5% to 78.6% (Supplementary Table 1). These results, obtained without training a large parameter set, are among the best achieved at present 14 .…”
supporting
confidence: 57%
“…For example, the prediction of protein structure might benefit from large-scale protein functional data that reveal amino acid preferences within particular structural elements (e.g., the paucity of proline residues in β-strands) and the functional effects of mutations that occur at spatially proximal positions. The feasibility of this approach is illustrated by existing structural prediction methods that are founded on these concepts but require extensive existing sequence alignment or structural training data (34,35). Another example relates to the understanding of enzyme mechanism, which might be uncovered by an analysis of the pattern of mutations that increase or decrease catalytic activity in large-scale protein functional data.…”
Section: Discussionmentioning
confidence: 99%
“…Typically, algorithms base predictions on the amino acid preferences in each type of secondary structure (α-helix, β-sheet or loop) in a training set of proteins with known structures 18,19 . As an alternative, large-scale mutational data on proteins with known structures could also reveal amino acid preferences within structural elements, and the resulting preferences used to enhance structure prediction algorithms.…”
Section: Inference Of Fundamental Protein Propertiesmentioning
confidence: 99%