2017
DOI: 10.1186/s12920-017-0264-3
|View full text |Cite
|
Sign up to set email alerts
|

Taking promoters out of enhancers in sequence based predictions of tissue-specific mammalian enhancers

Abstract: BackgroundMany genetic diseases are caused by mutations in non-coding regions of the genome. These mutations are frequently found in enhancer sequences, causing disruption to the regulatory program of the cell. Enhancers are short regulatory sequences in the non-coding part of the genome that are essential for the proper regulation of transcription. While the experimental methods for identification of such sequences are improving every year, our understanding of the rules behind the enhancer activity has not p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(6 citation statements)
references
References 35 publications
0
6
0
Order By: Relevance
“…In this study, a two-layer classifier was proposed for predicting fertility-related protein. The high efficiency of this method has been reported in previous studies such as predicting membrane proteins 92 , enhancer prediction 93 , remote protein homology detection 94 , identifying piwi-interacting RNAs 34 and miRNA Drosha processing site detection 95 .…”
Section: Resultsmentioning
confidence: 73%
“…In this study, a two-layer classifier was proposed for predicting fertility-related protein. The high efficiency of this method has been reported in previous studies such as predicting membrane proteins 92 , enhancer prediction 93 , remote protein homology detection 94 , identifying piwi-interacting RNAs 34 and miRNA Drosha processing site detection 95 .…”
Section: Resultsmentioning
confidence: 73%
“…For straightforward comparison and interpretation of k-mer count vectors, we standardized the length of both types of regulatory regions at 1 kb, centered at the TSS or at the middle genomic position of the enhancer. Guided by our previous experience [ 8 ], we used k-mers of the length 4 because the shorter k-mers are less informative, while the longer are represented in too few data elements, given the data size. The steps of our sequence and graph analysis are illustrated in Figure S1 .…”
Section: Resultsmentioning
confidence: 99%
“…We assessed the sequence composition and the sequence similarity of interacting regions through analysis of their DNA sequence k-mer content, for k = 4, with selected results confirmed for k = 1 and k = 2. The k-mer content was used before for identifying enhancers, for distinguishing enhancers from promoters and distinguishing among enhancers with different tissue specificity [ 7 , 8 , 9 ]. K-mers, or a related operation of sequence convolution, were also used for prediction of individual promoter–enhancer interactions [ 10 , 11 , 12 ].…”
Section: Introductionmentioning
confidence: 99%
“…Enhancers are DNA elements of up to 50–1500 base pairs (bp) [13]. They interact with their target promoters irrespective of their position to regulate downstream gene expression [14].…”
Section: Introductionmentioning
confidence: 99%