2013
DOI: 10.1080/1062936x.2013.773378
|View full text |Cite
|
Sign up to set email alerts
|

An alignment-free method to find similarity among protein sequences via the general form of Chou’s pseudo amino acid composition

Abstract: In this paper, we propose a method to create the 60-dimensional feature vector for protein sequences via the general form of pseudo amino acid composition. The construction of the feature vector is based on the contents of amino acids, total distance of each amino acid from the first amino acid in the protein sequence and the distribution of 20 amino acids. The obtained cosine distance metric (also called the similarity matrix) is used to construct the phylogenetic tree by the neighbour joining method. In orde… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

3
17
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 56 publications
(20 citation statements)
references
References 25 publications
3
17
0
Order By: Relevance
“…As a comparison, we apply the method in Ref. [21] to analyze the dataset in Table 2. The results are shown in Fig.…”
Section: Nd5 Protein Sequences Of 22 Speciesmentioning
confidence: 99%
See 2 more Smart Citations
“…As a comparison, we apply the method in Ref. [21] to analyze the dataset in Table 2. The results are shown in Fig.…”
Section: Nd5 Protein Sequences Of 22 Speciesmentioning
confidence: 99%
“…We use some coronavirus spike proteins as inputs to test our method. All the data comes from the Table 3 in reference [21]. The relations revealed in Fig.…”
Section: Coronavirus Spike Proteinsmentioning
confidence: 99%
See 1 more Smart Citation
“…However, as elucidated in [38] and demonstrated by Eqs. (28)-(32) of [38], among the three cross-validation methods, the jackknife test is deemed the least arbitrary (most objective) that can always yield a unique result for a given benchmark dataset, and hence has been increasingly used and widely recognized by investigators to examine the accuracy of various predictors (see, e.g., [46,[52][53][54][55][61][62][63][64][65][66][67][68][69][70]). Accordingly, the jackknife test is also adopted here to examine the quality of the present predictor.…”
Section: Prediction Assessmentmentioning
confidence: 99%
“…The concept of Chou's PseAAC was presented in 2001 and then it quickly pierced into many areas of computational proteomics [23,24,[58][59][60][61]. A flexible web server creates a variety of protein PseAACs (http://chou.med.harvard.edu/bioinf/PseAAC).…”
Section: Producing Chou's Pseaacmentioning
confidence: 99%