Ribosomes can produce proteins in minutes and are largely constrained to proteinogenic amino acids. Here, we report highly efficient chemistry matched with an automated fast-flow instrument for the direct manufacturing of peptide chains up to 164 amino acids long over 327 consecutive reactions. The machine is rapid: Peptide chain elongation is complete in hours. We demonstrate the utility of this approach by the chemical synthesis of nine different protein chains that represent enzymes, structural units, and regulatory factors. After purification and folding, the synthetic materials display biophysical and enzymatic properties comparable to the biologically expressed proteins. High-fidelity automated flow chemistry is an alternative for producing single-domain proteins without the ribosome.
Chemical protein synthesis and racemic protein crystallization were used to determine the X-ray structure of the snow flea antifreeze protein (sfAFP). Crystal formation from a racemic solution containing equal amounts of the chemically synthesized proteins d-sfAFP and l-sfAFP occurred much more readily than for l-sfAFP alone. More facile crystal formation also occurred from a quasi-racemic mixture of d-sfAFP and l-Se-sfAFP, a chemical protein analogue that contains an additional -SeCH2- moiety at one residue and thus differs slightly from the true enantiomer. Multiple wavelength anomalous dispersion (MAD) phasing from quasi-racemate crystals was then used to determine the X-ray structure of the sfAFP protein molecule. The resulting model was used to solve by molecular replacement the X-ray structure of l-sfAFP to a resolution of 0.98 A. The l-sfAFP molecule is made up of six antiparallel left-handed PPII helixes, stacked in two sets of three, to form a compact brick-like structure with one hydrophilic face and one hydrophobic face. This is a novel experimental protein structure and closely resembles a structural model proposed for sfAFP. These results illustrate the utility of total chemical synthesis combined with racemic crystallization and X-ray crystallography for determining the unknown structure of a protein.
High-diversity genetically-encoded combinatorial libraries (10 8 −10 13 members) are a rich source of peptide-based binding molecules, identified by affinity selection. Synthetic libraries can access broader chemical space, but typically examine only~10 6 compounds by screening. Here we show that in-solution affinity selection can be interfaced with nano-liquid chromatography-tandem mass spectrometry peptide sequencing to identify binders from fully randomized synthetic libraries of 10 8 members-a 100-fold gain in diversity over standard practice. To validate this approach, we show that binders to a monoclonal antibody are identified in proportion to library diversity, as diversity is increased from 10 6-10 8. These results are then applied to the discovery of p53-like binders to MDM2, and to a family of 3-19 nM-affinity, α/β-peptide-based binders to 14-3-3. An X-ray structure of one of these binders in complex with 14-3-3σ is determined, illustrating the role of β-amino acids in facilitating a key binding contact.
The use of competitive inhibitors to disrupt protein–protein interactions (PPIs) holds great promise for the treatment of disease. However, the discovery of high-affinity inhibitors can be a challenge. Here we report a platform for improving the affinity of peptide-based PPI inhibitors using non-canonical amino acids. The platform utilizes size exclusion-based enrichment from pools of synthetic peptides (1.5–4kDa) and liquid chromatography-tandem mass spectrometry-based peptide sequencing to identify high-affinity binders to protein targets, without the need for ‘reporter’ or ‘encoding’ tags. Using this approach—which is inherently selective for high-affinity binders—we realized gains in affinity of up to ~100- or ~30-fold for binders to the oncogenic ubiquitin ligase MDM2 or HIV capsid protein C-terminal domain, which inhibit MDM2-p53 interaction or HIV capsid protein C-terminal domain dimerization, respectively. Subsequent macrocyclization of select MDM2 inhibitors rendered them cell permeable and cytotoxic toward cancer cells, demonstrating the utility of the identified compounds as functional PPI inhibitors.
The burial of hydrophobic side chains in a protein core generally is thought to be the major ingredient for stable, cooperative folding. Here, we show that, for the snow flea antifreeze protein (sfAFP), stability and cooperativity can occur without a hydrophobic core, and without α-helices or β-sheets. sfAFP has low sequence complexity with 46% glycine and an interior filled only with backbone H-bonds between six polyproline 2 (PP2) helices. However, the protein folds in a kinetically two-state manner and is moderately stable at room temperature. We believe that a major part of the stability arises from the unusual match between residue-level PP2 dihedral angle bias in the unfolded state and PP2 helical structure in the native state. Additional stabilizing factors that compensate for the dearth of hydrophobic burial include shorter and stronger H-bonds, and increased entropy in the folded state. These results extend our understanding of the origins of cooperativity and stability in protein folding, including the balance between solvent and polypeptide chain entropies.protein folding | cooperativity | kinetics | PP2 | hydrogen bonding T he basis of protein-folding stability and cooperativity remains a topic of great interest (1, 2). Most studies have focused on proteins with hydrophobic cores containing α-helices and β-sheets. Here, we study the folding of snow flea antifreeze protein (sfAFP), a globular protein that lacks these features. The 81-residue protein has a novel fold that is distinct from other proteins (3-6), containing only polyproline 2 (PP2) helices and turns with a core filled with H-bonds and no hydrophobic groups (Fig. 1).The H-bond network between the PP2 helices (6), PP2 1-6 , requires close packing, which would be precluded by the presence of a C β atom. Thus, the PP2 helices are defined by glycine (Gly) repeats (GXX or GGX, where X is any other amino acid), which enable the hydrogen-bond network. As a consequence, the sfAFP molecule has a low-complexity glycine-rich sequence [37/81 Gly residues, or 46%; typical is 6, or 7% (7)]. The sfAFP core is well packed with essentially no interior void volumes, even when probed using a 0.5-Å radius sphere (8). sfAFP also contains two disulfide bonds, C1-C28 and C13-C43.Notably, no side chains are buried in sfAFP's core (2). All 12 hydrophobic side chains (V 4 , K 2 , and P 6 ) are surface exposed. Upon folding, sfAFP buries about 20 Å 2 ·res −1 of hydrophobic surface area compared with an average of 50 Å 2 ·res −1 for a set of 34 proteins of similar size (Fig. 2A). The difference equates to a decrease in stability of ∼1 or 1.4 kcal·mol −1 ·res −1 , assuming a surface tension coefficient of γ ∼ 34 or 47 cal·mol −1 ·Å −2 based on classical (9) or Flory-Huggins theory (10), respectively. Even the lower bound of 1 kcal·mol −1 ·res −1 represents a significant stability loss, and it is not obvious which energetic terms could compensate for the reduction in hydrophobic burial.Because hydrophobic burial is generally regarded as the basis of protein stability a...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.