2018
DOI: 10.1101/276915
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Chaos Game Representation: An Alignment-Free Technique for Exploring Evolutionary Relationships of Protein Sequences

Abstract: Chaos Game Representation (CGR) is an iterative mapping technique, which shows patterns in amino acids or nucleotide sequences. Here we present a method for using CGR to explore evolutionary relationships of protein sequences based on amino acid properties and illustrate the approach with complete sets of protein translations from viral genomes. In an analysis of complete polyprotein sequences from the viral family Flaviviridae, the CGR method was able to cluster members of major viral groups together, but rel… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 28 publications
0
1
0
Order By: Relevance
“…Considering the limitation that a 20-vertex CGR cannot be used to demonstrate the similarity of protein sequences with conservative substitution, Basu [14] proposed a 12-vertex CGR, with each vertex of a regular 12-sided polygon representing an amino acid with its conservative substitutions. The number of the vertices in CGR M. D. was then reduced to four [15] [16], with each vertex of the square representing one of the four groups of amino acids, that is, the non-polar, uncharged polar, negative polar, and positive polar groups. The reduction in the vertices of CGR images can help represent the similarity in protein sequences.…”
Section: Introductionmentioning
confidence: 99%
“…Considering the limitation that a 20-vertex CGR cannot be used to demonstrate the similarity of protein sequences with conservative substitution, Basu [14] proposed a 12-vertex CGR, with each vertex of a regular 12-sided polygon representing an amino acid with its conservative substitutions. The number of the vertices in CGR M. D. was then reduced to four [15] [16], with each vertex of the square representing one of the four groups of amino acids, that is, the non-polar, uncharged polar, negative polar, and positive polar groups. The reduction in the vertices of CGR images can help represent the similarity in protein sequences.…”
Section: Introductionmentioning
confidence: 99%