Engineering proteins to enhance thermal stability is a widely utilized approach for creating industrially relevant biocatalysts. Computational tools that guide these engineering efforts remain an active area of research with new data sets and develop algorithms. To aid in these efforts, we are reporting an expansion of our previously published data set of mutants for a -glucosidase to include both measures of TM and G, to complement the previously reported measures of T50 and kinetic constants (kcat and KM). For a set of 51 mutants, we found that T50 and TM are moderately correlated with a Pearson correlation coefficient (PCC) of 0.58, indicated the two methods capture different physical features. The performance of predicted stability using five computational tools are also evaluated on the 51 mutants dataset, none of which are found to be strong predictors of the observed changes in T50, TM, or G. Furthermore, the ability of the five algorithms to predict the production of isolatable soluble protein is examined, which revealed that Rosetta ΔΔG, ELASPIC, and DeepDDG are capable of predicting if a mutant could be produced and isolated as a soluble protein.These results further highlight the need for new algorithms for predicting modest, yet important, changes in thermal stability as well as a new utility for current algorithms for prescreening designs for the production of soluble mutants.
ASSOCIATED CONTENT Supporting Information (SI)SI 1-1. SDS-PAGE images for 51 BglB mutants and WT. (PDF) SI 1-2. A distribution analysis of temperatures observed for TM and T50. SI 1-3. PPC graph between ΔTM and ΔΔG of BglB mutants. (PDF) SI 1-4. Evaluation of five computational methods on protein expression. (PDF) SI 2. Images of TM fluorescence graphs, derivative graphs, and Van't Hoff plot for 51 mutants and WT. (Zip) SI 3. Rosetta ΔΔG and FoldX PSSM correlations graphs with ΔTM and experimental ΔΔG. Excel files of all the parameters from data acquisition. Excel files of the total system energy for DeepDDG, ELASPIC, and PoPMuSiC (Zip)SI 4. Jupyter notebook for all thermal stability data acquisitions with all TM raw data files. (ipynb) SI 5. Example files for Rosetta_ddg_monomer run. (Zip)