Computational Design of Stable and Soluble Biocatalysts

Musil, Miloš; Konegger, Hannes; Hon, Jiří; Bednář, David; Damborský, Jiřı́

doi:10.1021/acscatal.8b03613

Cited by 109 publications

(117 citation statements)

References 226 publications

Supporting

Mentioning

109

Contrasting

Order By: Relevance

“…Over the last 20 years, in silico design based on energy calculations has taken a long way from fairly simple to more accurate and versatile methods [42][43] , particularly with a positive impact in the area of protein stabilization. However, the accuracies based on energy functions are still suboptimal because of several factors, including the insufficient conformational sampling of the static structure, imbalances in the force fields, and the intrinsic problems with existing data sets 24 . Although the drawback can be mitigated by using hybrid methods that incorporate complementary statistical-based approaches such as ABACUS, most stability strategies are focusing on single-pointmutation or simple stepwise combination process, resulting in higher prediction errors upon application to multiple-point mutants.…”

Section: Discussionmentioning

confidence: 99%

“…There is usually a pathway whereby some new functions could be acquired by individually beneficial mutations, however, when the desired function is beyond what a single mutation or double mutations can accomplish, possible paths grow exponentially as the mutations accumulate and most paths result in downhill or even unfolded proteins 23 . Since a majority of protein engineering studies involves simple uphill walks, the main demand lies in identifying an efficient path of accumulated mutations to achieve the desired protein performance 24 .…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Computational redesign of a PETase for plastic biodegradation by the GRAPE strategy

Chen

Liu

et al. 2019

Preprint

View full text Add to dashboard Cite

The excessive use of plastics has been accompanied by severe ecologically damaging effects. The recent discovery of a PETase from Ideonella sakaiensis that decomposes poly(ethylene terephthalate) (PET) under mild conditions provides an attractive avenue for the biodegradation of plastics. However, the inherent instability of the enzyme limits its practical 20 15 and the Biological Resources Program (KFJ-BRP-009) of the Chinese Academy of Sciences.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Computational redesign of a PETase for plastic biodegradation by the GRAPE strategy

Chen

Liu

et al. 2019

Preprint

View full text Add to dashboard Cite

show abstract

“…We have compared the performance of 11 sequence based solubility prediction methods on plant UGT proteins. We have only included the tools that predict the solubility of different proteins expressed in E. coli [12,13]. The list of the tools and their description can be found in the Methods section.…”

Section: Resultsmentioning

confidence: 99%

“…Solubility prediction software tools can have a significant impact on recombinant protein production by excluding insoluble proteins from expression trials and thereby preventing extra costs and dead-end experiments. Overall, solubility prediction tools can be grouped into 3 classes based on their applications [12]: 1) methods that predict the overall solubility of proteins upon expression (usually in E. coli), 2) approaches for predicting the aggregation propensity of different regions in a protein sequence, and 3) tools that predict the impact of mutations on solubility of proteins. Among these groups, the former is studied here.…”

Section: Introductionmentioning

confidence: 99%

A benchmark of protein solubility prediction methods on UDP-dependent glycosyltransferases

Ghomi

Kittila

Welner³

2020

Preprint

View full text Add to dashboard Cite

UDP-dependent glycosyltransferases (UGTs) are enzymes that glycosylate a wide variety of natural products, thereby modifying their physico-chemical properties, i.e. solubility, stability, reactivity, and function. To successfully leverage the UGTs in biocatalytic processes, we need to be able to screen and characterise them in vitro, which requires efficient heterologous expression in amenable hosts, preferably Escherichia coli. However, many UGTs are insoluble when expressed in standard and attempted optimised E. coli conditions, resulting in many unproductive and costly experiments. To overcome this limitation, we have investigated the performance of 11 existing solubility predictors on a dataset of 57 UGTs expressed in E. coli.We show that SoluProt outperforms other methods in terms of both threshold-independent and threshold-dependent measures. Among the benchmarked methods, only SoluProt is significantly better than random predictors using both measures. Moreover, we show that SoluProt uses a threshold for separating soluble and insoluble proteins that is optimal for our dataset. Hence, we conclude that using SoluProt to select UGT sequences for in vitro investigation will significantly increase the success rate of soluble expression, thereby minimising cost and enabling efficient characterisation efforts for biocatalysis research.

show abstract

“…TM is defined by the temperature at which half the enzyme is found in the unfolded state over folded state12,22 and is often evaluated through denaturation assays, from which the thermodynamic measurements (ΔGunfolding) can be obtained. 22 This method is generally a lower throughput method as purified protein is required to get an accurate measurement for the structural properties for the mutant being evaluated. T50 measures the temperature of half-inactivation that leads to irreversible unfolding11,23, and it is determined by the reduction of half of the enzymatic activity due heat-challenges.12 This is a very common assay for protein engineering due to its compatibility with high throughput assays and the ability to use cell lysates to evaluate function.…”

Section: Evaluating the Relationship Between Tm And T50mentioning

confidence: 99%

Evaluating molecular modeling tools for thermal stability using an independently generated dataset

Huang

Chu

Frizzo

et al. 2019

Preprint

View full text Add to dashboard Cite

Engineering proteins to enhance thermal stability is a widely utilized approach for creating industrially relevant biocatalysts. Computational tools that guide these engineering efforts remain an active area of research with new data sets and develop algorithms. To aid in these efforts, we are reporting an expansion of our previously published data set of mutants for a -glucosidase to include both measures of TM and G, to complement the previously reported measures of T50 and kinetic constants (kcat and KM). For a set of 51 mutants, we found that T50 and TM are moderately correlated with a Pearson correlation coefficient (PCC) of 0.58, indicated the two methods capture different physical features. The performance of predicted stability using five computational tools are also evaluated on the 51 mutants dataset, none of which are found to be strong predictors of the observed changes in T50, TM, or G. Furthermore, the ability of the five algorithms to predict the production of isolatable soluble protein is examined, which revealed that Rosetta ΔΔG, ELASPIC, and DeepDDG are capable of predicting if a mutant could be produced and isolated as a soluble protein.These results further highlight the need for new algorithms for predicting modest, yet important, changes in thermal stability as well as a new utility for current algorithms for prescreening designs for the production of soluble mutants. ASSOCIATED CONTENT Supporting Information (SI)SI 1-1. SDS-PAGE images for 51 BglB mutants and WT. (PDF) SI 1-2. A distribution analysis of temperatures observed for TM and T50. SI 1-3. PPC graph between ΔTM and ΔΔG of BglB mutants. (PDF) SI 1-4. Evaluation of five computational methods on protein expression. (PDF) SI 2. Images of TM fluorescence graphs, derivative graphs, and Van't Hoff plot for 51 mutants and WT. (Zip) SI 3. Rosetta ΔΔG and FoldX PSSM correlations graphs with ΔTM and experimental ΔΔG. Excel files of all the parameters from data acquisition. Excel files of the total system energy for DeepDDG, ELASPIC, and PoPMuSiC (Zip)SI 4. Jupyter notebook for all thermal stability data acquisitions with all TM raw data files. (ipynb) SI 5. Example files for Rosetta_ddg_monomer run. (Zip)

show abstract

Computational Design of Stable and Soluble Biocatalysts

Cited by 109 publications

References 226 publications

Computational redesign of a PETase for plastic biodegradation by the GRAPE strategy

Computational redesign of a PETase for plastic biodegradation by the GRAPE strategy

A benchmark of protein solubility prediction methods on UDP-dependent glycosyltransferases

Evaluating molecular modeling tools for thermal stability using an independently generated dataset

Contact Info

Product

Resources

About