2000
DOI: 10.1007/pl00007958
|View full text |Cite
|
Sign up to set email alerts
|

Parallel cascade identification as a means for automatically classifying protein sequences into structure/function groups

Abstract: Current methods for automatically classifying protein sequences into structure/function groups, based on their hydrophobicity profiles, have typically required large training sets. The most successful of these methods are based on hidden Markov models, but may require hundreds of exemplars for training in order to obtain consistent results. In this paper, we describe a new approach, based on nonlinear system identification, which appears to require little training data to achieve highly promising results.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
16
0

Year Published

2001
2001
2006
2006

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 14 publications
(17 citation statements)
references
References 12 publications
1
16
0
Order By: Relevance
“…To increase the accuracy of estimating the coefficients, the impulse response function h k ( j) can first be scaled so that the linear element's output u k (i) has unity mean square [40]. The new residual is then calculated from Eq.…”
Section: Parallel Cascade Identificationmentioning
confidence: 99%
See 3 more Smart Citations
“…To increase the accuracy of estimating the coefficients, the impulse response function h k ( j) can first be scaled so that the linear element's output u k (i) has unity mean square [40]. The new residual is then calculated from Eq.…”
Section: Parallel Cascade Identificationmentioning
confidence: 99%
“…For example, consider the problem of predicting the structure/function family of a novel protein given only its primary amino acid sequence. The amino acid sequences can be regarded as the inputs and their corresponding families as the outputs [40,51]. First, some means is used to map the amino acid sequence into a corresponding numerical sequence and similarly to numerically designate the families to be distinguished.…”
Section: Constructing Class Predictorsmentioning
confidence: 99%
See 2 more Smart Citations
“…7 Parallel Cascade Identification (PCI), a method of nonlinear system identification, 6 has shown itself to be a powerful tool in the identification of such systems, which are often dynamic, complex, and have high order nonlinearities. Recent applications of PCI to bioinformatics include the prediction of protein coding potential from DNA sequences, 9 the identification of adenosine triphosphate (ATP) binding sites on proteins, 4 and the classification of protein sequences into structure/function families.…”
Section: Introductionmentioning
confidence: 99%