2019
DOI: 10.1016/j.cell.2019.04.046
|View full text |Cite
|
Sign up to set email alerts
|

A Deep Neural Network for Predicting and Engineering Alternative Polyadenylation

Abstract: Highlights d Trained a neural network to predict APA using data from over 3 million reporters d Visualized learned features to reveal a rich cis-regulatory code for APA d Developed and tested an algorithm to accurately engineer polyadenylation signals d Predicted and experimentally characterized over 12,000 human APA variants

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

6
244
0
2

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
4
1

Relationship

1
8

Authors

Journals

citations
Cited by 169 publications
(280 citation statements)
references
References 89 publications
6
244
0
2
Order By: Relevance
“…The 100 toeholds with the lowest ON/OFF ratio were one-hot encoded and fed as inputs to the static model. Target ON and OFF values of 0.99 and 0.001, respectively, were set and supplied to SeqProp, an open-source python package that enables streamlined development 22 of gradient ascent pipelines for genomic and RNA biology applications 22 . Toehold design constraints were incorporated into the loss function, such that the modified toehold switch contained the conserved sequences and base pairing within the hairpin was preserved.…”
Section: Storm: a Sequence-based Toehold Optimization And Redesign Modelmentioning
confidence: 99%
See 1 more Smart Citation
“…The 100 toeholds with the lowest ON/OFF ratio were one-hot encoded and fed as inputs to the static model. Target ON and OFF values of 0.99 and 0.001, respectively, were set and supplied to SeqProp, an open-source python package that enables streamlined development 22 of gradient ascent pipelines for genomic and RNA biology applications 22 . Toehold design constraints were incorporated into the loss function, such that the modified toehold switch contained the conserved sequences and base pairing within the hairpin was preserved.…”
Section: Storm: a Sequence-based Toehold Optimization And Redesign Modelmentioning
confidence: 99%
“…To improve toehold switch design and prediction, we took inspiration from the field of machine learning and deep learning. Machine learning approaches have been applied successfully to synthetic biology 1,19 , as exemplified in a recent study by Yang et al 19 using a 'white box' approach to extract antibiotic mechanisms of action, and in motif finding and DNA sequence prediction tasks [20][21][22] . As typical deep learning approaches require large amounts of training examples, we used a dataset from Angenent-Mari et al (2019) 23 that experimentally characterized 91,534 toehold switches, in both active and repressed states.…”
Section: Introductionmentioning
confidence: 99%
“…Convolutional neural networks (CNNs) in particular can learn the combinatorial patterns embedded within input examples without the need for alignment of examples. Recent studies have begun to take advantage of CNNs to tackle aspects of gene regulation 6 , including models that predict chromatin state [7][8][9] , TF binding 10,11 , polyadenylation 12 , or gene expression 7,13 solely on the basis of DNA (100bp-1Mb) or RNA sequences, with the potential to ferret out relevant motifs.…”
Section: Introductionmentioning
confidence: 99%
“…Indeed, [24] has recently explored the possibility of using recurrent neural networks for biophysically modeling gene expression in flies. In principle, it should be possible to apply similar strategies to MPRAs performed on genome-wide or random sequence libraries [38,4,39,40,6]. But across all of biology, very few individual cis-regulatory sequences have been characterized to the level that our modeling strategy aims to elucidate.…”
mentioning
confidence: 99%