2021
DOI: 10.1371/journal.pcbi.1009282
|View full text |Cite
|
Sign up to set email alerts
|

Discovering differential genome sequence activity with interpretable and efficient deep learning

Abstract: Discovering sequence features that differentially direct cells to alternate fates is key to understanding both cellular development and the consequences of disease related mutations. We introduce Expected Pattern Effect and Differential Expected Pattern Effect, two black-box methods that can interpret genome regulatory sequences for cell type-specific or condition specific patterns. We show that these methods identify relevant transcription factor motifs and spacings that are predictive of cell state-specific … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(5 citation statements)
references
References 41 publications
(57 reference statements)
0
5
0
Order By: Relevance
“…To identify putative regulators of maturation, we performed motifenrichment analysis on accessible regions using two distinct but complementary approaches. We used a newly developed convolutional neural network-based method called DeepAccess 65,66 which learns the relationship between primary DNA sequence and chromatin accessibility at different ages and then predicts transcription factor motifs that drive differential chromatin accessibility at one age compared to another. In addition, we used HOMER, a de novo motif discovery method that identifies enriched motifs in provided sets of age-specific accessible peaks 67 (Methods).…”
Section: Motif Analysis Identifies Putative Regulators Of Maturationmentioning
confidence: 99%
See 1 more Smart Citation
“…To identify putative regulators of maturation, we performed motifenrichment analysis on accessible regions using two distinct but complementary approaches. We used a newly developed convolutional neural network-based method called DeepAccess 65,66 which learns the relationship between primary DNA sequence and chromatin accessibility at different ages and then predicts transcription factor motifs that drive differential chromatin accessibility at one age compared to another. In addition, we used HOMER, a de novo motif discovery method that identifies enriched motifs in provided sets of age-specific accessible peaks 67 (Methods).…”
Section: Motif Analysis Identifies Putative Regulators Of Maturationmentioning
confidence: 99%
“…Chromosomes 18 and 19 are held out for validation and testing. Methods for computing differential expected pattern effects between cell types are described in 65 . Briefly, we compute a differential expected pattern effect as the ratio between the effect that the presence of a transcription factor motif has on the predicted accessibility of a DNA sequence in one cell type relative to another cell type within a DeepAccess model.…”
Section: Non-negative Least Squares Deconvolution Of Bulk P56 Datamentioning
confidence: 99%
“…Hammelman and Gifford created such a deep learning approach and used it for the identification of cell state‐specific TFs in chromatin accessibility data [79, 80]. Their method is part of the framework DeepAccess and is based on an ensemble of convolutional neural networks.…”
Section: Computational Tools For Tfa Inferencementioning
confidence: 99%
“…Hammelman and Gifford created such a deep learning approach and used it for the identification of cell state-specific TFs in chromatin accessibility data [79,80]. Their method is part of the frame- Worth mentioning is also BPNet which does not predict accessibility, but the binding profile of specific TFs from CHIP-exo data [82].…”
Section: Sequence-based Deep Learning Modelsmentioning
confidence: 99%
“…Conversely, prevailing post hoc model interpretability methods concentrate primarily on the analysis of motifs [12][13][14][15][16][17][18][19][20][21][22] , short DNA sequences associated with regulatory functions. As sequence inputs for DNNs grow longer, deciphering the complex coordination of motifs at a scale of hundreds of kilobases (kb) becomes increasingly difficult to interpret.…”
Section: Introductionmentioning
confidence: 99%