2010
DOI: 10.1101/gr.112623.110
|View full text |Cite
|
Sign up to set email alerts
|

Accurate inference of transcription factor binding from DNA sequence and chromatin accessibility data

Abstract: Accurate functional annotation of regulatory elements is essential for understanding global gene regulation. Here, we report a genome-wide map of 827,000 transcription factor binding sites in human lymphoblastoid cell lines, which is comprised of sites corresponding to 239 position weight matrices of known transcription factor binding motifs, and 49 novel sequence motifs. To generate this map, we developed a probabilistic framework that integrates cell- or tissue-specific experimental data such as histone modi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

18
609
0

Year Published

2012
2012
2023
2023

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 514 publications
(627 citation statements)
references
References 44 publications
(49 reference statements)
18
609
0
Order By: Relevance
“…4). The high GC content in these states is consistent with high gene density (32), and their open chromatin architecture enables high mutagenesis as well as coding function and associated regulation (21,44). The deletion/substitution-warm and cold autosomal states show trends opposite to those of hot and insertion-warm states, and the microsatellite state is depleted in genes and functional marks (Fig.…”
Section: Resultsmentioning
confidence: 87%
“…4). The high GC content in these states is consistent with high gene density (32), and their open chromatin architecture enables high mutagenesis as well as coding function and associated regulation (21,44). The deletion/substitution-warm and cold autosomal states show trends opposite to those of hot and insertion-warm states, and the microsatellite state is depleted in genes and functional marks (Fig.…”
Section: Resultsmentioning
confidence: 87%
“…Therefore, dPCA analysis of multiple surrogate datasets (e.g., HM ChIP-seq) provides a solution to unsupervised characterization of gene regulation dynamics, and it allows one to infer differential binding of many TFs simultaneously using the same set of experiments. Unlike several recent studies that use surrogates to predict TF binding in one condition (19,20), dPCA allows one to predict dynamic changes of TF binding across conditions.…”
Section: Example I: Analysis Of Differential Chromatin Patterns At Tfmentioning
confidence: 99%
“…Many studies have noted the difficulty of accurately predicting transcription factor binding from binding site models alone, with few differences in the number or strength of binding sites between regions that are weakly or strongly bound Pique-Regi et al 2011). For each of the 14 factors with welldefined binding sites, we computed the total number of recognition motifs for these factors within 500 bp of each ChIP peak summit as a measure of its naïve binding potential.…”
Section: Tagteam Motifs Demarcate Highly Bound Clusters Of Transcriptmentioning
confidence: 99%