2019
DOI: 10.1093/nar/gkz540
|View full text |Cite
|
Sign up to set email alerts
|

Co-SELECT reveals sequence non-specific contribution of DNA shape to transcription factor binding in vitro

Abstract: Understanding the principles of DNA binding by transcription factors (TFs) is of primary importance for studying gene regulation. Recently, several lines of evidence suggested that both DNA sequence and shape contribute to TF binding. However, the following compelling question is yet to be considered: in the absence of any sequence similarity to the binding motif, can DNA shape still increase binding probability? To address this challenge, we developed Co-SELECT, a computational approach to analyze the results… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 15 publications
(7 citation statements)
references
References 39 publications
0
7
0
Order By: Relevance
“…However, because SELEX libraries are often only weakly enriched with true binding sequences (Additional file 1: Fig.S1, see also Fig. 1 in [24]), we take only a top percentile of the positive and negative scores (e.g., the top 10%) for ROC AUC value computation. Importantly, before PWM scoring, we extend the random insert sequences obtained from the sequence repository with the primer and barcode sequences that were present (and thus accessible to proteins) during the SELEX experiments.…”
Section: Protocol For Ht-selex Datamentioning
confidence: 99%
“…However, because SELEX libraries are often only weakly enriched with true binding sequences (Additional file 1: Fig.S1, see also Fig. 1 in [24]), we take only a top percentile of the positive and negative scores (e.g., the top 10%) for ROC AUC value computation. Importantly, before PWM scoring, we extend the random insert sequences obtained from the sequence repository with the primer and barcode sequences that were present (and thus accessible to proteins) during the SELEX experiments.…”
Section: Protocol For Ht-selex Datamentioning
confidence: 99%
“…The identified motifs provide a set of novel human regulatory lexicons and can be used to construct a gene regulatory atlas for the human genome. Specifically, shape motifs can potentially contribute to the interpretation of indiscriminate binding behavior of human TFs (76). Co-regulated gene groups, revealed by identification of motifs may define cell-type specific regulons and thus, provide critical biological insights to the cell heterogeneity mechanisms (77).…”
Section: Discussionmentioning
confidence: 99%
“…Though for the simple example in Fig. 1 JUDI takes almost same number of lines as Snakemake, for a larger pipeline as in Pal et al (2018) JUDI requires far less scripting. For example, the file name expansion due to the masked parameter 'group' needed hard-coding in lines 11-12, Listing 1 of Köster and Rahmann (2012), imagine the effort required if there were more masked parameters and one or more parameters had a relatively large number of possible values!…”
Section: Discussionmentioning
confidence: 99%
“…We applied JUDI to the Co-SELECT tool (Pal et al, 2018) developed for the analysis of HT-SELEX data for transcription factor (TF) DNA binding. The tool analyzed 5 rounds of data from 81 TF experiments for 3 families by dividing the sequencing reads into two populations to find statistically significant shape-strings which were enriched at 5 possible thresholds in both populations.…”
Section: A Case Studymentioning
confidence: 99%