2018
DOI: 10.1038/s41467-018-05691-7
|View full text |Cite
|
Sign up to set email alerts
|

Decoding topologically associating domains with ultra-low resolution Hi-C data by graph structural entropy

Abstract: Submegabase-size topologically associating domains (TAD) have been observed in high-throughput chromatin interaction data (Hi-C). However, accurate detection of TADs depends on ultra-deep sequencing and sophisticated normalization procedures. Here we propose a fast and normalization-free method to decode the domains of chromosomes (deDoc) that utilizes structural information theory. By treating Hi-C contact matrix as a representation of a graph, deDoc partitions the graph into segments with minimal structural … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

3
102
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2
2

Relationship

2
6

Authors

Journals

citations
Cited by 63 publications
(105 citation statements)
references
References 40 publications
(81 reference statements)
3
102
0
Order By: Relevance
“…2f). Notably, 1.5 Mb, the average size of CD that we can best read off from the 50-kb resolution Hi-C data [17] used in this study, is also similar to the domain size detected by a recently proposed TAD detection algorithm called deDoC [53]. In essence, the concept of "graph structural entropy" used in deDoC is also based on global pattern recognition.…”
Section: Discussionsupporting
confidence: 60%
“…2f). Notably, 1.5 Mb, the average size of CD that we can best read off from the 50-kb resolution Hi-C data [17] used in this study, is also similar to the domain size detected by a recently proposed TAD detection algorithm called deDoC [53]. In essence, the concept of "graph structural entropy" used in deDoC is also based on global pattern recognition.…”
Section: Discussionsupporting
confidence: 60%
“…We next determined if the methods for defining individual TADs influenced the accuracy in detecting TAD splits and mergers. We replaced our default TAD identification method with each of the four state-of-the-art methods [21], including the HiCseq [22], TopDom [16], DomainCaller [4], and IC-finder [15]. The results indicated that the default method in TADsplimer outperformed HiCseq, TopDom, DomainCaller, and IC-finder ( Fig.…”
Section: Simulation Data Demonstrated Superior Performance Of Tadsplimermentioning
confidence: 99%
“…Many algorithms have been developed to define TADs in a single Hi-C sample [4,5,[14][15][16][17]]. However, systematically analyzing the reorganization of TADs in response to biological stimuli remains a technical challenge.…”
Section: Introductionmentioning
confidence: 99%
“…We next asked if there is also genome widely changes of the TADs in response to the thermal stresses, as observed in Drosophila (Djekidel et al, 2015). By applying a recently developed algorithm deDoc to our Hi-C data (Li et al, 2018), we identified 2107, 2207 and 1967 TADs in NM, HS and CS, at 10Kb resolution respectively. The accuracy of TADs boundaries was evidenced by the enrichment of CTCF, and SMC3 ChIP-seq peaks.…”
Section: Tad Structure Are Stable After the Thermal Stressesmentioning
confidence: 99%
“…A/B compartment profile is called by analyzing the first eigevector of KR normalized contact maps at 100Kb resolution (Lieberman-Aiden et al, 2009b). The compartment with higher H3K27ac ChIP-Seq signals were determined as compartment A. TADs were called using deDoc at 10Kb resolutions (Li et al, 2018).…”
Section: Hi-c and Hichip Data Processingmentioning
confidence: 99%