2020
DOI: 10.1038/s41467-020-17222-4
|View full text |Cite
|
Sign up to set email alerts
|

Large-scale DNA-based phenotypic recording and deep learning enable highly accurate sequence-function mapping

Abstract: Predicting effects of gene regulatory elements (GREs) is a longstanding challenge in biology. Machine learning may address this, but requires large datasets linking GREs to their quantitative function. However, experimental methods to generate such datasets are either application-specific or technically complex and error-prone. Here, we introduce DNA-based phenotypic recording as a widely applicable, practicable approach to generate large-scale sequence-function datasets. We use a site-specific recombinase to … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
55
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 42 publications
(56 citation statements)
references
References 60 publications
1
55
0
Order By: Relevance
“…(Lou et al ., 2012 ), B. ii. (Qi et al ., 2012 ), C. (Valeri et al ., 2020 ), D. (Höllerer et al ., 2020 ).…”
Section: Context Dependency Challengementioning
confidence: 99%
“…(Lou et al ., 2012 ), B. ii. (Qi et al ., 2012 ), C. (Valeri et al ., 2020 ), D. (Höllerer et al ., 2020 ).…”
Section: Context Dependency Challengementioning
confidence: 99%
“…Strong selection in S. cerevisiae leads to plasmid copy variation [298,299] Role in regulating protein folding [300] Review of codon usage tables [301] Ribosome binding sites (RBS) RBS calculator [302] Machine learning in E. coli [303] Multiprotein RBS optimisation in various bacteria [304] Review of RBS calculator [305] Phenotypic recording with deep learning, using more than 2.7 M sequence-function pairs [306] Translation initiation optimisation Reviews [228,307,308] Significant increase in serine overproduction…”
Section: Codon Usagementioning
confidence: 99%
“…Another important area of optimisation is in microbial biotechnology, whether in finding the best growth medium [186], subsets of genes to manipulate to increase productivity [187,188], or optimal sequences for generating host [189] or protein properties [190,191]. Each of these represents a combinatorial search problem [8,9].…”
Section: Optimisationmentioning
confidence: 99%