2019
DOI: 10.1186/s13059-019-1658-7
|View full text |Cite
|
Sign up to set email alerts
|

Measuring the reproducibility and quality of Hi-C data

Abstract: BackgroundHi-C is currently the most widely used assay to investigate the 3D organization of the genome and to study its role in gene regulation, DNA replication, and disease. However, Hi-C experiments are costly to perform and involve multiple complex experimental steps; thus, accurate methods for measuring the quality and reproducibility of Hi-C data are essential to determine whether the output should be used further in a study.ResultsUsing real and simulated data, we profile the performance of several rece… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
128
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 132 publications
(133 citation statements)
references
References 52 publications
(65 reference statements)
1
128
0
Order By: Relevance
“…FreeHi-C takes as input raw Hi-C sequencing reads in FASTQ format and estimates the frequency of genomic fragment interactions. This is fundamentally different from existing methods that simulate Hi-C contact matrices under a series of assumptions 9,11,14,15 . Subsequently, FreeHi-C generates pairs of sequencing reads that represent the interacting fragment pairs with embedded random nucleotide mutations and indels while conserving the proportion of chimeric reads.…”
Section: Resultsmentioning
confidence: 83%
See 2 more Smart Citations
“…FreeHi-C takes as input raw Hi-C sequencing reads in FASTQ format and estimates the frequency of genomic fragment interactions. This is fundamentally different from existing methods that simulate Hi-C contact matrices under a series of assumptions 9,11,14,15 . Subsequently, FreeHi-C generates pairs of sequencing reads that represent the interacting fragment pairs with embedded random nucleotide mutations and indels while conserving the proportion of chimeric reads.…”
Section: Resultsmentioning
confidence: 83%
“…Recent maturation of chromosome conformation capture (3C) 1 and Hi-C sequencing technologies 2,3 led to high-throughput profiling of three-dimensional chromatin architecture and revealed transformative insights on long-range gene regulation [4][5][6] . Alongside the technological breakthroughs, a growing number of methodologies and algorithms [7][8][9][10][11][12][13][14][15][16] emerged for the analysis of Hi-C and other 3C-derived data types. These methods are developed and benchmarked on disparate biological and simulated or computationally-constructed datasets that are often customized for the methods under the study.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…To do so, we first identified alternative ways to estimate Contact in the ABC model; although maps of chromatin accessibility and histone modifications are available in many cell types, maps of 3D contacts are not. Because contact frequencies in Hi-C data correlate well across cell types (Note S3) (38,39), we compared versions of the ABC model in which we estimated Contact for each DE-G pair using either K562 Hi-C data or the average Hi-C contact frequency from 8 other human cell types. Both approaches performed similarly at predicting our CRISPR data in K562 cells (AUPRC = 0.66 and 0.68 respectively; Fig.…”
Section: Fig 1 Crispri-flowfish Identifies Regulatory Elements Formentioning
confidence: 99%
“…Previously, similarity measures for comparing Hi-C contact matrices mostly focus on bulk Hi-C data (Yardımcı et al, 2019). These methods evaluate how likely two bulk Hi-C experiments are generated from the same biological sample.…”
Section: Introductionmentioning
confidence: 99%