1999
DOI: 10.1093/bioinformatics/15.12.974
|View full text |Cite
|
Sign up to set email alerts
|

SEGMENT: identifying compositional domains in DNA sequences

Abstract: Here we describe a heuristic segmentation algorithm for DNA sequences, which was implemented on a Windows program (SEGMENT). The program divides a DNA sequence into compositionally homogeneous domains by iterating a local optimization procedure at a given statistical significance. Once a sequence is partitioned into domains, a global measure of sequence compositional complexity (SCC), accounting for both the sizes and compositional biases of all the domains in the sequence, is derived. SEGMENT computes SCC as … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
42
0
3

Year Published

2000
2000
2017
2017

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 48 publications
(45 citation statements)
references
References 11 publications
0
42
0
3
Order By: Relevance
“…Other DNA segmentation algorithms-for example, into compositionally homogeneous DNA domains (Oliver et al 1999) or regions with similar combinatorial features (Chrochemore and Vérin 1998)-have been proposed. The topic is reviewed in Braun and Mueller (1998).…”
mentioning
confidence: 99%
“…Other DNA segmentation algorithms-for example, into compositionally homogeneous DNA domains (Oliver et al 1999) or regions with similar combinatorial features (Chrochemore and Vérin 1998)-have been proposed. The topic is reviewed in Braun and Mueller (1998).…”
mentioning
confidence: 99%
“…For a single Dl\A sequence, our method is equivalent to a generalization of the Jensen-Shannon divergence based segmentation method (Oliver et aL, 1999) except that we take into account the possible overdispersion effect on the choice of the significance leveL Here the overdispersion means that in reality there is greater variability among the columns in (1.1) than would be expected from a statistical model (e.g., product multinomial model), because we can not expect each domain to be completely homogeneous. There are many different segmentation methods in literature.…”
Section: Relation To the Other Methodsmentioning
confidence: 99%
“…Oliver et al 1999;Wan and Wootton 1998). Such measures are sensitive to the presence/absence of certain sub-sequences or consensus patterns in a DNA sequence but fail to take the linguistic structure of the sequence into account.…”
Section: Complexity Analysismentioning
confidence: 99%