2018
DOI: 10.1101/gr.227819.117
|View full text |Cite
|
Sign up to set email alerts
|

Sequential regulatory activity prediction across chromosomes with convolutional neural networks

Abstract: Models for predicting phenotypic outcomes from genotypes have important applications to understanding genomic function and improving human health. Here, we develop a machine-learning system to predict cell-type-specific epigenetic and transcriptional profiles in large mammalian genomes from DNA sequence alone. By use of convolutional neural networks, this system identifies promoters and distal regulatory elements and synthesizes their content to make effective gene expression predictions. We show that model pr… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

3
405
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 384 publications
(439 citation statements)
references
References 59 publications
3
405
0
1
Order By: Relevance
“…The standard approach for mapping motif instances bound by TFs in vivo is to extract bound regions from chromatin immunoprecipitation experiments coupled to sequencing (ChIP-seq) using peak-callers [34][35][36][37][38][39] and identify over-represented motifs in these sequences as position weight matrix models (PWM) [40][41][42][43] .While CNNs are ideally suited to model TF binding from motif combinations and their syntax, current models have limited resolution. State-of-the-art CNN models of TF binding predict binary binding events [25][26][27] or low-resolution continuous binding signal averaged across 100-200 bp windows 44 .…”
Section: Introductionmentioning
confidence: 99%
“…The standard approach for mapping motif instances bound by TFs in vivo is to extract bound regions from chromatin immunoprecipitation experiments coupled to sequencing (ChIP-seq) using peak-callers [34][35][36][37][38][39] and identify over-represented motifs in these sequences as position weight matrix models (PWM) [40][41][42][43] .While CNNs are ideally suited to model TF binding from motif combinations and their syntax, current models have limited resolution. State-of-the-art CNN models of TF binding predict binary binding events [25][26][27] or low-resolution continuous binding signal averaged across 100-200 bp windows 44 .…”
Section: Introductionmentioning
confidence: 99%
“…Each layer had 6 distinct computational operations: 1D convolution with filter size 4 or 8 (conv4, conv8), dilated 1D convolution with rate 10 and filter size 4 or 8 (dconv4, dconv8), max-pooling or average pooling with size 4 (maxpool, avgpool). These hyperparameters for computational operations were selected based on previous works 4, 10 . Moreover, we added an identity mapping to each layer that maps input identically to output without any computations (identity), for potentially reducing the child model complexity.…”
Section: Designing Model Search Spacementioning
confidence: 99%
“…The successful applications of CNNs have been largely attributed to their corresponding architectures. Indeed, for CNN applications in genomics and biomedicine, numerous efforts have been devoted to the development of architectures, such as in DeepSEA 4 , Basenji 10 and SpliceAI 7 . This is similar to the extensive efforts in architecture designs for tackling computer vision problems, for example VGG 11 , Inception 12 , and ResNet 13 .…”
mentioning
confidence: 99%
“…Las redes neuronales convolucionales son redes neuronales completamente conectadas que utilizan matrices bidimensionales, llamadas ventanas, para realizar mapeo de los datos, intentado imitar las neuronas de las cortezas visual de cerebro humano (Figura 5) (Eraslan et al, 2019;Wainberg et al, 2018). Estas redes han sido utilizadas para la clasificación de los sitios de unión de factores de transcripción (Zou et al, 2016); Wang, Tai, E, & Wei, 2018), la predicción de fenotipos moleculares (Kelley et al, 2018), metilación de ADN (Zhou et al, 2018), análisis de la expresión génica y microARN (Budach & Marsico, 2018). Las redes neuronales recurrentes son utilizadas cuando se trabaja con datos dinámicos que pueden cambiar en el tiempo.…”
Section: Aprendizaje Profundo Supervisadounclassified