2008
DOI: 10.1186/1471-2105-9-546
|View full text |Cite
|
Sign up to set email alerts
|

Barcodes for genomes and applications

Abstract: Background: Each genome has a stable distribution of the combined frequency for each k-mer and its reverse complement measured in sequence fragments as short as 1000 bps across the whole genome, for 1

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
94
0
1

Year Published

2008
2008
2024
2024

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 91 publications
(95 citation statements)
references
References 21 publications
(25 reference statements)
0
94
0
1
Order By: Relevance
“…For example, N (4) = 136. Our first observation is that the combined k-mer frequency distribution is highly stable across the whole genome, for any fixed k-mer; and this is true for any sequenced genome, prokaryotic or eukaryotic, chromosomal or organelle [55] .…”
Section: Genome Visualization In Support Of Knowledge Discoverymentioning
confidence: 94%
See 2 more Smart Citations
“…For example, N (4) = 136. Our first observation is that the combined k-mer frequency distribution is highly stable across the whole genome, for any fixed k-mer; and this is true for any sequenced genome, prokaryotic or eukaryotic, chromosomal or organelle [55] .…”
Section: Genome Visualization In Support Of Knowledge Discoverymentioning
confidence: 94%
“…Various computational techniques have been developed for characterizing and identifying these mobile elements [19][20][52][53][54] . While the prediction of the MGEs has reached a good level of maturity, reliable prediction of HTGs remains a very challenging problem [55][56][57][58] .…”
Section: What Is Known About Bacterial Genomes In Generalmentioning
confidence: 99%
See 1 more Smart Citation
“…Vì vậy, để áp dụng mô hình ẩn cho việc phân tích trình tự metagenomic, cần chuyển đổi trình tự (là một dạng một chuỗi ký tự hợp thành từ 4 ký tự A, G, T, C) thành các từ có độ dài k-mer, ứng với mỗi từ trong tài liệu. Theo [14,18], k=4 được đánh giá là phù hợp.…”
Section: B Chuyển Trình Tự Thành Tài Liệuunclassified
“…Some of the GS are based on information theory concepts [7], others rely on statistical properties [9,8,11,12], and some others state that what is important is the spatial information between sequences [4,13]. All GS proposals depend on the idea that phylogenetically close genomes have similar GS, while phylogenetically distant organisms present different GS.…”
Section: Introductionmentioning
confidence: 99%