2014
DOI: 10.1371/journal.pcbi.1003628
|View full text |Cite
|
Sign up to set email alerts
|

Genomic Characterization of Large Heterochromatic Gaps in the Human Genome Assembly

Abstract: The largest gaps in the human genome assembly correspond to multi-megabase heterochromatic regions composed primarily of two related families of tandem repeats, Human Satellites 2 and 3 (HSat2,3). The abundance of repetitive DNA in these regions challenges standard mapping and assembly algorithms, and as a result, the sequence composition and potential biological functions of these regions remain largely unexplored. Furthermore, existing genomic tools designed to predict consensus-based descriptions of repeat … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

3
143
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
6
3

Relationship

1
8

Authors

Journals

citations
Cited by 107 publications
(157 citation statements)
references
References 70 publications
3
143
0
Order By: Relevance
“…There is resurgent interest in the repeated sequence elements of the human genome as their "post-Human Genome Project" characterization continues (21). For many applications, we envision the need to identify sets of repeated DNA sequences that are unique to a single locus on a chromosome or are present at multiple sites on only one chromosome in the complement (i.e., as a bar code).…”
Section: Discussionmentioning
confidence: 99%
“…There is resurgent interest in the repeated sequence elements of the human genome as their "post-Human Genome Project" characterization continues (21). For many applications, we envision the need to identify sets of repeated DNA sequences that are unique to a single locus on a chromosome or are present at multiple sites on only one chromosome in the complement (i.e., as a bar code).…”
Section: Discussionmentioning
confidence: 99%
“…But the observation that at least a subset of satDNA families are involved in important biological roles makes the study of these highly abundant and fast-evolving components of the eukaryote genome highly relevant for structural, functional and evolutionary genomics. It is also worth mentioning that in the past few years, new and efficient bioinformatic tools are becoming available for the identification of satDNAs from sequenced genomes (Novák et al, 2014), whereas longtemplate sequencing and new computational approaches (Altemose et al, 2014) have been fostering the assembly of satDNA repeats. With all these new findings and tools, genomic studies, including those related to wholesequenced genomes, will certainly benefit by a 'satDNA recall'.…”
mentioning
confidence: 99%
“…The normal distribution is a real valued distribution whose mean, median and mode are equal [2]. This is symmetrical about mean; and not suitable for the variables which are inherently positive or negatively skewed.…”
Section: Properties Of Normal Distributionmentioning
confidence: 99%