2017
DOI: 10.1371/journal.pcbi.1005176
|View full text |Cite
|
Sign up to set email alerts
|

Automated incorporation of pairwise dependency in transcription factor binding site prediction using dinucleotide weight tensors

Abstract: Gene regulatory networks are ultimately encoded by the sequence-specific binding of (TFs) to short DNA segments. Although it is customary to represent the binding specificity of a TF by a position-specific weight matrix (PSWM), which assumes each position within a site contributes independently to the overall binding affinity, evidence has been accumulating that there can be significant dependencies between positions. Unfortunately, methodological challenges have so far hindered the development of a practical … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
8
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(8 citation statements)
references
References 32 publications
0
8
0
Order By: Relevance
“…An intrinsic limitation to PFMs/PWMs is that they ignore inter-nucleotide dependencies within TFBSs ( 9 13 ). TF–DNA interaction data derived from next-generation sequencing assays has improved the computational modeling of TF binding ( 14 19 ). For example, the TF flexible models (TFFMs) ( 14 ), based on first-order hidden Markov models, capture dinucleotide dependencies within TFBSs and were introduced in the 2016 release of the JASPAR database.…”
Section: Introductionmentioning
confidence: 99%
“…An intrinsic limitation to PFMs/PWMs is that they ignore inter-nucleotide dependencies within TFBSs ( 9 13 ). TF–DNA interaction data derived from next-generation sequencing assays has improved the computational modeling of TF binding ( 14 19 ). For example, the TF flexible models (TFFMs) ( 14 ), based on first-order hidden Markov models, capture dinucleotide dependencies within TFBSs and were introduced in the 2016 release of the JASPAR database.…”
Section: Introductionmentioning
confidence: 99%
“…It is widely recognized that PWMs are not adequate representations of TFBS complexity, which contain significant positional correlations [18] and multiple efforts have been made to go beyond the PWM model [2][3][4][5]. As a complementary approach, we show that positional dependency effects can be examined by conditioning PSSVs on specific ancestral nucleotides.…”
Section: Multiple Tfbs Exhibit Positional Correlations Based On Ances...mentioning
confidence: 99%
“…However, positional dependencies are known to occur in binding sites of many TFs. Extensions to the basic PWM format have been proposed to account for positional dependencies [2][3][4][5]. Nevertheless, PWMs persist as the dominant motif representation format, because of ease of calculation and because they are easily visualized via sequence logos [6].…”
mentioning
confidence: 99%
See 1 more Smart Citation
“…Position weight matrices assume each substitution at a base pair position has an independent effect on the binding affinity of the protein to the motif and the magnitude of the effect is related to conservation of the base pair position in the frequency matrix (Stormo, 2000). There is no shortage of programs that can search sequences based on the position weight matrix (Frith, Li, & Weng, 2003;Kel et al, 2003;Tan & Lenhard, 2016;Wang, Martins, & Danko, 2016); however, the generation of a position frequency matrix can involve bias (Teytelman, Thurtle, Rine, & van Oudenaarden, 2013) and the assumption of independence of base pairs may often be unwarranted (Bulyk, Johnson, & Church, 2002;Man & Stormo, 2001;Omidi et al, 2017). The lack of independence is especially nontrivial in EREs, as loss of a perfect half site has a larger effect on the binding affinity than point mutations after the half site is lost (Deegan et al, 2011;Tyulmenkov & Klinge, 2001).…”
mentioning
confidence: 99%