2020
DOI: 10.1534/genetics.120.303597
|View full text |Cite
|
Sign up to set email alerts
|

Motto: Representing Motifs in Consensus Sequences with Minimum Information Loss

Abstract: Sequence analysis frequently requires intuitive understanding and convenient representation of motifs. Typically, motifs are represented as position weight matrices (PWMs) and visualized using sequence logos. However, in many scenarios, in order to interpret the motif information or search for motif matches, it is compact and sufficient to represent motifs by wildcard-style consensus sequences (such as [GC][AT]GATAAG[GAC]). Based on mutual information theory and Jensen-Shannon Divergence, we propose a mathemat… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 30 publications
0
2
0
Order By: Relevance
“…Core promoters contain specific transcription factor binding-sites (TFBs), such as the TATA box or the Initiator sequence , that are recognized by transcription factors (TFs) ( Boeva 2016 ). Binding motifs with low identity to the consensus sequence are referred to as “minimal motif” ( Wang et al 2020a ). Transcription factors recruit the protein complexes required for transcription ( Butler and Kadonaga 2002 ).…”
Section: Introductionmentioning
confidence: 99%
“…Core promoters contain specific transcription factor binding-sites (TFBs), such as the TATA box or the Initiator sequence , that are recognized by transcription factors (TFs) ( Boeva 2016 ). Binding motifs with low identity to the consensus sequence are referred to as “minimal motif” ( Wang et al 2020a ). Transcription factors recruit the protein complexes required for transcription ( Butler and Kadonaga 2002 ).…”
Section: Introductionmentioning
confidence: 99%
“…Core promoters contain specific binding motifs, such as the TATA box or the Initiator sequence , that are recognized by transcription factors (tFs) (Boeva, 2016). Binding motifs with low identity to the consensus sequence are referred as minimal motif (Wang et al, 2020a). Transcription factors then recruit the protein complexes required for transcription (Butler and Kadonaga, 2002).…”
Section: Introductionmentioning
confidence: 99%