1995
DOI: 10.1073/pnas.92.19.8700
|View full text |Cite
|
Sign up to set email alerts
|

Prediction of protein folding class using global description of amino acid sequence.

Abstract: We present a method for predicting protein folding class based on global protein chain description and a voting process. Selection of the best descriptors was achieved by a computer-simulated neural network trained on a data base consisting of 83 folding classes. Protein-chain descriptors include overall composition, transition, and distribution of amino acid attributes, such as relative hydrophobicity, predicted secondary structure, and predicted solvent exposure. Cross-validation testing was performed on 15 … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
386
0
5

Year Published

1998
1998
2017
2017

Publication Types

Select...
7
3

Relationship

0
10

Authors

Journals

citations
Cited by 515 publications
(393 citation statements)
references
References 23 publications
2
386
0
5
Order By: Relevance
“…Each protein pair (t,t′) is also subjected to pairwise local sequence alignment using the SmithWaterman algorithm implemented in EMBOSS, and the alignment scores S SW (t,t′) are expressed in logarithmic form (d) PROFEAT kernel. The PROFEAT server 16 computes 1447 protein descriptors from protein sequence including descriptors developed by Dubchak et al 17 that account for the composition, transition, and distribution of structural and physicochemical properties such as hydrophobicity, polarity, charge, and solvent accessibility. Each descriptor is separately normalized to the value range [0,1], and each target t is represented by a vector Φ P (t) of 1447 normalized descriptor values.…”
Section: Support Vector Machine Theorymentioning
confidence: 99%
“…Each protein pair (t,t′) is also subjected to pairwise local sequence alignment using the SmithWaterman algorithm implemented in EMBOSS, and the alignment scores S SW (t,t′) are expressed in logarithmic form (d) PROFEAT kernel. The PROFEAT server 16 computes 1447 protein descriptors from protein sequence including descriptors developed by Dubchak et al 17 that account for the composition, transition, and distribution of structural and physicochemical properties such as hydrophobicity, polarity, charge, and solvent accessibility. Each descriptor is separately normalized to the value range [0,1], and each target t is represented by a vector Φ P (t) of 1447 normalized descriptor values.…”
Section: Support Vector Machine Theorymentioning
confidence: 99%
“…In that work, they used the least free energy principle to solve the problem. Dubchak et al used a neural network-based method containing three depictors related to five amino acid attributes called composition, transition, and distribution [4,13]. One of the basic studies related to fold classification was performed by Ding and Dubchak.…”
Section: Introductionmentioning
confidence: 99%
“…The global description of amino acid sequence [13] has been widely used in various fields. It uses three descriptors composition, transition and distribution that are deprived for the physicochemical properties to generate the sequence descriptors.…”
Section: Introductionmentioning
confidence: 99%