2012
DOI: 10.1093/bioinformatics/bts570
|View full text |Cite
|
Sign up to set email alerts
|

HiCNorm: removing biases in Hi-C data via Poisson regression

Abstract: Supplementary data are available at Bioinformatics online.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
297
0
1

Year Published

2015
2015
2023
2023

Publication Types

Select...
4
3
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 240 publications
(299 citation statements)
references
References 6 publications
(8 reference statements)
1
297
0
1
Order By: Relevance
“…Mapped reads were assigned to restriction fragments with BEDtools 67 , tabulated with custom AWK scripts and imported into R (https://www.r-project.org/). Raw counts of Hi-C links were aggregated in 1 Mb bins and normalized separately for intra-and interchromosomal contacts using HiCNorm 68 . Contact probability matrices were plotted using standard R functions 69 .…”
Section: Methodsmentioning
confidence: 99%
“…Mapped reads were assigned to restriction fragments with BEDtools 67 , tabulated with custom AWK scripts and imported into R (https://www.r-project.org/). Raw counts of Hi-C links were aggregated in 1 Mb bins and normalized separately for intra-and interchromosomal contacts using HiCNorm 68 . Contact probability matrices were plotted using standard R functions 69 .…”
Section: Methodsmentioning
confidence: 99%
“…In the development of this new version of the software, particular attention has been paid at optimising the data structures employed for the construction of the graph, in order to facilitate the parallel implementation of the algorithm. This novel implementation refines the normalisation routine, which relies on a modified version of the Hu et al approach [10]: while the original prototype used the Poisson regression model to provide a score to each read, NuChart-II exploits the same regression analysis to assign a confidence score to each edge of the neighbourhood graph, so that the user can evaluate the reliability of each contact. The engineering of the new software has been conducted on top of FastFlow, using the ParallelFor pattern discussed above (see section II-B): FastFlow aims at simplifying the programmers life in developing complex parallel applications, while providing high runtime efficiency.…”
Section: A Nuchart-iimentioning
confidence: 99%
“…This approach can remove the majority of systematic biases, at the expense of very high computational costs, due to the observation of paired-end reads spanning all possible fragment end pairs. Hu et al [10] proposed a parametric model based on a Poisson regression. This is a simplified, and less computationally intensive normalisation procedure than the one described by Yaffe and Tanay, since it corrects the systematic biases in Hi-C contact maps at the desired resolution level, instead of modelling Hi-C data at the fragment end level.…”
Section: Introductionmentioning
confidence: 99%
“…Mostly during the past five years, several software applications have been developed to analyze and visualize genomic interaction data, with different characteristics and outputs. If we consider Hi-C, available analysis software applications are chromoR [6], HiCdat [7], HiCNorm [8], Hi-Corrector [9], Hi-C Pipeline [10], HiC-Pro [11], HiCUP [12], HiFive [13], HIPPIE [14], HiTC [15], HOMER [16], ICE [17]. However, besides correcting biases and generating contact maps, most of them do not provide the entire pipeline to pre-process the raw data (downloadable files) or to compute topological domain coordinates.…”
Section: Introductionmentioning
confidence: 99%