2020
DOI: 10.1371/journal.pone.0227621
|View full text |Cite
|
Sign up to set email alerts
|

Deep2Full: Evaluating strategies for selecting the minimal mutational experiments for optimal computational predictions of deep mutational scan outcomes

Abstract: Performing a complete deep mutational scan with all single point mutations may not be practical, and may not even be required, especially if predictive computational models can be developed. Computational models are however naive to cellular response in the myriads of assay-conditions. In a realistic paradigm of assay context-aware predictive hybrid models that combine minimal experimental data from deep mutational scans with structure, sequence information and computational models, we define and evaluate diff… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
19
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 9 publications
(19 citation statements)
references
References 47 publications
(81 reference statements)
0
19
0
Order By: Relevance
“…In contrast to previous machine learning approaches utilising DMS data (e.g. [36, 37, 38]), here we introduce DNA mutational signatures as a feature to be evaluated for its predictive utility, and explicitly test the contribution of different feature combinations in classifying variant impact (Figure 5A). We note that only a small number ( n = 29 proteins from 83 experiments) of experimental DMS datasets are publicly available (as of August 2021), with even fewer proteins (9 proteins from 10 experiments) displaying a bimodal distribution of DMS scores amenable for a binary clasification task.…”
Section: Resultsmentioning
confidence: 99%
“…In contrast to previous machine learning approaches utilising DMS data (e.g. [36, 37, 38]), here we introduce DNA mutational signatures as a feature to be evaluated for its predictive utility, and explicitly test the contribution of different feature combinations in classifying variant impact (Figure 5A). We note that only a small number ( n = 29 proteins from 83 experiments) of experimental DMS datasets are publicly available (as of August 2021), with even fewer proteins (9 proteins from 10 experiments) displaying a bimodal distribution of DMS scores amenable for a binary clasification task.…”
Section: Resultsmentioning
confidence: 99%
“…Computational saturation mutagenesis guides experimental approaches to study the impacts or help rationalise the consequences of known or emerging mutations [59] . Such approaches have been applied to other proteins like artificial (βα) 8 ‐barrel protein [19] or in deep mutation scans [60] . Experimental validation of mutation impacts in M. leprae are time and labour-intensive processes owing to the inability of bacillus to grow on an artificial culture media.…”
Section: Discussionmentioning
confidence: 99%
“…[37,38] Instead of complex metrics derived from protein structure prediction software, more intuitively explanatory descriptors such as the number of contacts to other amino acid residues are excellent predictors. [39][40][41] Evolutionary descriptors including conservation score and PSSM weight are proven to be also extremely useful in predicting mutational effect [40,42,43] and site-wise tolerance to mutations. The use of multiple sequence alignments (MSA) to generate useful embeddings from transformer language models is under active research.…”
Section: A Brief Summary Of Machine Learning-assisted Protein Engineeringmentioning
confidence: 99%
“…Thus far, massively parallel assays are almost built-in with ML predictions on phenotypic values. [11,15,39,49,56,[85][86][87][88] While methods. The computational selected variants harboring correct mutational combinations (variants in solid lines) constitute a portion of the entire library; combinatorial variants that were not selected for experimental validation are also present (variants in dash lines), since it is difficult to control precisely which oligos are cloned into each fragment.…”
Section: Considerations On Integrating Massively Parallel Screening Platforms With Machine Learningmentioning
confidence: 99%
See 1 more Smart Citation