2003
DOI: 10.1023/a:1024553109779
|View full text |Cite
|
Sign up to set email alerts
|

Untitled

Abstract: We introduce a novel, linguistic-like method of genome analysis. We propose a natural approach to characterizing genomic sequences based on occurrences of fixed length words from a predefined, sufficiently large set of words (strings over the alphabet [A, C, G, T]). A measure based on this approach is called compositional spectrum and is actually a histogram of imperfect word occurrences. Our results assert that the compositional spectrum is an overall characteristic of a long sequence i.e., a complete genome … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2003
2003
2021
2021

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 20 publications
(3 citation statements)
references
References 16 publications
0
3
0
Order By: Relevance
“…Properties like conserved sequence repeats [58], “periodicity signatures” – the formal representation of periodic sequence patterns related to DNA curvature [59] and compositional spectra based on imperfect occurrences of long olignucleotide words [60, 61] are also potentially characteristic of different ecological groups of microbes. For instance, the archaea of the order Halobacteriaceae displayed the “periodicity signatures” distinct from other archaeal species, which might be due to their early divergence from other archaeal lineages, extensive lateral gene transfer or adaptation to high salt environments [59].…”
Section: Sequence Features Of Microbial Genomes Influenced By Lifestylementioning
confidence: 99%
“…Properties like conserved sequence repeats [58], “periodicity signatures” – the formal representation of periodic sequence patterns related to DNA curvature [59] and compositional spectra based on imperfect occurrences of long olignucleotide words [60, 61] are also potentially characteristic of different ecological groups of microbes. For instance, the archaea of the order Halobacteriaceae displayed the “periodicity signatures” distinct from other archaeal species, which might be due to their early divergence from other archaeal lineages, extensive lateral gene transfer or adaptation to high salt environments [59].…”
Section: Sequence Features Of Microbial Genomes Influenced By Lifestylementioning
confidence: 99%
“…It can be seen that data errors, even as high as in case B, still do not prevent the evaluation of the number of different genomes. It was shown by us earlier [3,25], that a fragment spectrum usually almost coincides with the whole-genome spectrum for a large number of genomes. The exception from this finding is a small number of certain bacteria that have significant differences in the spectra of their genome fragments, namely, Borrelia burgdorferi [3,26], and some Spirochete bacteria.…”
Section: Design Of the Numerical Experimentsmentioning
confidence: 86%
“…[22,31,34,36,38,50], and genome informatics, cf. [4,27,33,41]. Moreover, particle physicists are currently utilizing clustering techniques when the ouput-data of large particle detectors, such as ATLAS, CMS 2 or ALICE, cf.…”
Section: Introductionmentioning
confidence: 99%