1993
DOI: 10.1093/bioinformatics/9.6.671
|View full text |Cite
|
Sign up to set email alerts
|

A hybrid method to cluster protein sequences based on statistics and artificial neural networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

1994
1994
2009
2009

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 12 publications
(11 citation statements)
references
References 0 publications
0
11
0
Order By: Relevance
“…Although self-organized neural networks have already been used for the classification of proteins (Ferran et al 1994;Ferran and Pflugfelder 1993;Ferrara 1991, 1992;Andrade et al 1996) using sequence data, the approach presented here is completely different in the sense that a new type of self-organizing structure has been developed which grows according to the hypothetical pattern of speciation which would have given rise to the set of present-day sequences analyzed. Direct application of the Kohonen algorithm (Kohonen 1990) to data whose internal relationships are described by means of a binary tree may produce a correct segregation into the main groups but lacks a natural way to represent the taxonomic relationships among the individuals.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Although self-organized neural networks have already been used for the classification of proteins (Ferran et al 1994;Ferran and Pflugfelder 1993;Ferrara 1991, 1992;Andrade et al 1996) using sequence data, the approach presented here is completely different in the sense that a new type of self-organizing structure has been developed which grows according to the hypothetical pattern of speciation which would have given rise to the set of present-day sequences analyzed. Direct application of the Kohonen algorithm (Kohonen 1990) to data whose internal relationships are described by means of a binary tree may produce a correct segregation into the main groups but lacks a natural way to represent the taxonomic relationships among the individuals.…”
Section: Discussionmentioning
confidence: 99%
“…In this study we will concentrate on a type of unsupervised neural network known as ''Kohonen self-organizing maps'' (Kohonen 1990). Previous works on sequence analysis have used this algorithm to classify protein sequences into groups (Ferran et al 1994;Ferran and Pflugfelder 1993;Ferrara 1991, 1992) based on their dipeptide compositions.…”
Section: Introductionmentioning
confidence: 99%
“…In addition, we have recently shown that the results issued from the statistical methods can be used not only to validate the classifications obtained with the ANN approach, but also to choose, in a more reasonable way, the number of neurons of the network (Ferran & Pflugfelder, 1993). This can be done by relating a statistical determination of the optimal number of clusters with the number of neurons of the network.…”
Section: Discussionmentioning
confidence: 99%
“…Some other examples of networks recognizing functional motifs were presented by Sternberg (1991, 1992); Ladunga et al (1991); Schneider and Wrede (1993); Hansen et al (1998);Nielsen et al (1997). The second approach is based on using the frequency with which any of the 20 * 20 possible amino acid pairs occurs in the sequence (Ferrán and Pflugfelder 1993), or on using the information extracted from database annotations (Andrade and Valencia 1997).…”
Section: Functional Predictionmentioning
confidence: 99%
“…For example, a neural network system for predicting various aspects of 1D structure based on evolutionary information is by far the most widely used prediction method (Rost et al 1994). Other network-based methods are unique, or superior in their field (Ferrán and Pflugfelder 1993;Riis and Krogh 1996;Andrade and Valencia 1997;Hansen et al 1998;Nielsen et al 1997). Furthermore, neural networks revealed data base errors, and principles underlying protein structures (Brunak 1991;Rost et al 1994;Tolstrup et al 1994;Blom et al 1996).…”
Section: Multiple Sequence Alignmentmentioning
confidence: 99%