2019
DOI: 10.1371/journal.pcbi.1007560
|View full text |Cite
|
Sign up to set email alerts
|

Representation learning of genomic sequence motifs with convolutional neural networks

Abstract: Although convolutional neural networks (CNNs) have been applied to a variety of computational genomics problems, there remains a large gap in our understanding of how they build representations of regulatory genomic sequences. Here we perform systematic experiments on synthetic sequences to reveal how CNN architecture, specifically convolutional filter size and max-pooling, influences the extent that sequence motif representations are learned by first layer filters. We find that CNNs designed to foster hierarc… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
118
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 82 publications
(122 citation statements)
references
References 23 publications
1
118
0
Order By: Relevance
“…To generate data-driven hypotheses, first-order and secondorder attribution methods can be employed to identify important local features. Because attribution maps can be noisy, it may be beneficial to employ CNNs that are designed to learn more interpretable representations in first layer filters (Koo & Eddy, 2019). It turns out that CNNs designed to learn interpretable filters also yield more reliable representations with attribution methods .…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…To generate data-driven hypotheses, first-order and secondorder attribution methods can be employed to identify important local features. Because attribution maps can be noisy, it may be beneficial to employ CNNs that are designed to learn more interpretable representations in first layer filters (Koo & Eddy, 2019). It turns out that CNNs designed to learn interpretable filters also yield more reliable representations with attribution methods .…”
Section: Resultsmentioning
confidence: 99%
“…Recent advances have made it possible to intentionally design CNNs to learn more human-interpretable patterns in convolutional filters. This includes design principles based on spatial information flow through the network and employing highly divergent activation functions such as the exponential function (Koo & Eddy, 2019;Koo & Ploenzke, 2019). In parallel, advances have been developed to make direct weight visualization more interpretable (Ploenzke & Irizarry, 2018).…”
Section: Global Interpretabilitymentioning
confidence: 99%
“…These results suggest that while the use of rational features may facilitate the abstraction of potentially relevant information of toehold switch function, the one-hot sequence-only MLP model can recover such information without a priori hypothesisdriven assumptions built into the model if given sufficient training data. 40 In order to evaluate the degree of biological generalization in our sequence-only MLP model, we performed two additional rounds of validation. First, we iteratively withheld each of the 23 tiled viral genomes in the dataset during training and predicted their function as test sets, resulting in a 0.82-0.98 AUROC range (average 0.87, Fig.…”
Section: Improved Prediction Using Sequence-based Multilayer Perceptrmentioning
confidence: 99%
“…By contrast, mechanistic hypothesis-driven models can more directly inform which aspects of a biological theory best explain the observations. Various methods have been established to address this limitation, including alternative network 30 architectures (39), and the use of saliency maps (40,41), which reveal the regions of an input that deep learning models weigh most heavily and therefore pay the most attention to when making predictions. While saliency maps have been previously used to visualize model attention in one-hot representations of sequence data (10,17,18,20,40), such implementations focus only on the primary sequence and have not been developed to identify secondary structure 35 interactions, which are especially relevant in the operation of RNA synthetic biology elements.…”
Section: Visualizing Learned Rna Secondary Structure Motifs With Vis4mentioning
confidence: 99%
See 1 more Smart Citation