2011
DOI: 10.1371/journal.pone.0023947
|View full text |Cite
|
Sign up to set email alerts
|

Rational Design of Temperature-Sensitive Alleles Using Computational Structure Prediction

Abstract: Temperature-sensitive (ts) mutations are mutations that exhibit a mutant phenotype at high or low temperatures and a wild-type phenotype at normal temperature. Temperature-sensitive mutants are valuable tools for geneticists, particularly in the study of essential genes. However, finding ts mutations typically relies on generating and screening many thousands of mutations, which is an expensive and labor-intensive process. Here we describe an in silico method that uses Rosetta and machine learning techniques t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
29
0

Year Published

2013
2013
2024
2024

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 22 publications
(29 citation statements)
references
References 23 publications
(21 reference statements)
0
29
0
Order By: Relevance
“…However, we can utilize physicochemical data on the change of amino acids at a particular site ("residue change in charge, hydrophobicity, volume, molecular weight") as a means to approximate this. To compliment these differences in physicochemical empirical data, we utilize a qualitative residue swap similarity metric that is 0 when both the unmutated and mutated amino acids belong to the same class (small nonpolar, small polar, negative charge, large nonpolar, bad behaved, positive charge, side chain amide) and 1 otherwise, as defined by Poultney et al [58]. Additionally, a static structural feature of potentially high descriptive value is "residue mean mutual information" which is the average value in bits at a particular residue in a mutual information matrix computed using MDEntropy.…”
Section: B Feature Designmentioning
confidence: 99%
“…However, we can utilize physicochemical data on the change of amino acids at a particular site ("residue change in charge, hydrophobicity, volume, molecular weight") as a means to approximate this. To compliment these differences in physicochemical empirical data, we utilize a qualitative residue swap similarity metric that is 0 when both the unmutated and mutated amino acids belong to the same class (small nonpolar, small polar, negative charge, large nonpolar, bad behaved, positive charge, side chain amide) and 1 otherwise, as defined by Poultney et al [58]. Additionally, a static structural feature of potentially high descriptive value is "residue mean mutual information" which is the average value in bits at a particular residue in a mutual information matrix computed using MDEntropy.…”
Section: B Feature Designmentioning
confidence: 99%
“…Both protocols 1) substitute the native residue for the variant amino acid, 2) refine the variant structure, including protein backbone movements, to accommodate this change, and 3) compare the output structures using the Rosetta score terms (Figure 1, Supplementary Figure S2). To generate features for each variant, we follow Poultney et al (40) and normalize structure-based features by comparing scores for a given variant to scores derived from Rosetta-relaxed ensembles of its native protein. We also include the accessible surface area at the position of variation as a feature, calculated using PROBE (52).…”
Section: Sequence-based Features From Blast Analysismentioning
confidence: 99%
“…We include additional features describing the geometric differences between the input and final structure for each trajectory (e.g., RMSD and gdtmm) to detect proteins undergoing large rearrangements, totaling 23 features. To compare the native and variant ensembles and eliminate potential differences in score magnitude across diverse protein folds, we 1) extract the distributions of each Rosetta score term for the native and variant proteins, 2) calculate the quartiles of the variant protein score distributions, and 3) calculate the cumulative density for these quantiles on the corresponding native protein score distribution (40). FastRelax and quartile analysis produce three features per score term for each variant, corresponding to the Q1, Q2, and Q3 quartiles (40), totaling 60 features.…”
Section: Structure-based Features From Rosetta Analysismentioning
confidence: 99%
See 2 more Smart Citations