Proteins fold into unique native structures stabilized by thousands of weak interactions that collectively overcome the entropic cost of folding. Although these forces are "encoded" in the thousands of known protein structures, "decoding" them is challenging because of the complexity of natural proteins that have evolved for function, not stability. We combined computational protein design, next-generation gene synthesis, and a high-throughput protease susceptibility assay to measure folding and stability for more than 15,000 de novo designed miniproteins, 1000 natural proteins, 10,000 point mutants, and 30,000 negative control sequences. This analysis identified more than 2500 stable designed proteins in four basic folds-a number sufficient to enable us to systematically examine how sequence determines folding and stability in uncharted protein space. Iteration between design and experiment increased the design success rate from 6% to 47%, produced stable proteins unlike those found in nature for topologies where design was initially unsuccessful, and revealed subtle contributions to stability as designs became increasingly optimized. Our approach achieves the long-standing goal of a tight feedback cycle between computation and experiment and has the potential to transform computational protein design into a data-driven science.
Targeting the interaction between the SARS-CoV-2 Spike protein and the human ACE2 receptor is a promising therapeutic strategy. We designed inhibitors using two de novo design approaches. Computer generated scaffolds were either built around an ACE2 helix that interacts with the Spike receptor binding domain (RBD), or docked against the RBD to identify new binding modes, and their amino acid sequences designed to optimize target binding, folding and stability. Ten designs bound the RBD with affinities ranging from 100pM to 10nM, and blocked ARS-CoV-2 infection of Vero E6 cells with IC 50 values between 24 pM and 35 nM; The most potent, with new binding modes, are 56 and 64 residue proteins (IC 50 ~ 0.16 ng/ml). Cryo-electron microscopy structures of these minibinders in complex with the SARS-CoV-2 spike ectodomain trimer with all three RBDs bound are nearly identical to the computational models. These hyperstable minibinders provide starting points for SARS-CoV-2 therapeutics.
De novo protein design holds promise for creating small stable proteins with shapes customized to bind therapeutic targets. We describe a massively parallel approach for designing, manufacturing and screening mini-protein binders, integrating large-scale computational design, oligonucleotide synthesis, yeast display screening and next-generation sequencing. We designed and tested 22,660 mini-proteins of 37–43 residues that target influenza haemagglutinin and botulinum neurotoxin B, along with 6,286 control sequences to probe contributions to folding and binding, and identified 2,618 high-affinity binders. Comparison of the binding and non-binding design sets, which are two orders of magnitude larger than any previously investigated, enabled the evaluation and improvement of the computational model. Biophysical characterization of a subset of the binder designs showed that they are extremely stable and, unlike antibodies, do not lose activity after exposure to high temperatures. The designs elicit little or no immune response and provide potent prophylactic and therapeutic protection against influenza, even after extensive repeated dosing.
The design of proteins that bind to a specific site on the surface of a target protein using no information other than the three-dimensional structure of the target remains a challenge1–5. Here we describe a general solution to this problem that starts with a broad exploration of the vast space of possible binding modes to a selected region of a protein surface, and then intensifies the search in the vicinity of the most promising binding modes. We demonstrate the broad applicability of this approach through the de novo design of binding proteins to 12 diverse protein targets with different shapes and surface properties. Biophysical characterization shows that the binders, which are all smaller than 65 amino acids, are hyperstable and, following experimental optimization, bind their targets with nanomolar to picomolar affinities. We succeeded in solving crystal structures of five of the binder–target complexes, and all five closely match the corresponding computational design models. Experimental data on nearly half a million computational designs and hundreds of thousands of point mutants provide detailed feedback on the strengths and limitations of the method and of our current understanding of protein–protein interactions, and should guide improvements of both. Our approach enables the targeted design of binders to sites of interest on a wide variety of proteins for therapeutic and diagnostic applications.
We used two approaches to design proteins with shape and chemical complementarity to the receptor binding domain (RBD) of SARS-CoV-2 Spike protein near the binding site for the human ACE2 receptor. Scaffolds were built around an ACE2 helix that interacts with the RBD, or de novo designed scaffolds were docked against the RBD to identify new binding modes. In both cases, designed sequences were optimized first in silico and then experimentally for target binding, folding and stability. Nine designs bound the RBD with affinities ranging from 100pM to 10nM, and blocked bona fide SARS-CoV-2 infection of Vero E6 cells with IC50 values ranging from 35 pM to 35 nM; the most potent of these - 56 and 64 residue hyperstable proteins made using the second approach - are roughly six times more potent on a per mass basis (IC50 ~ 0.23 ng/ml) than the best monoclonal antibodies reported thus far. Cryo-electron microscopy structures of the SARS-CoV-2 spike ectodomain trimer in complex with the two most potent minibinders show that the structures of the designs and their binding interactions with the RBD are nearly identical to the computational models, and that all three RBDs in a single Spike protein can be engaged simultaneously. These hyperstable minibinders provide promising starting points for new SARS-CoV-2 therapeutics, and illustrate the power of computational protein design for rapidly generating potential therapeutic candidates against pandemic threats.
New variants of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) continue to arise and prolong the coronavirus disease 2019 (COVID-19) pandemic. Here we used a cell-free expression workflow to rapidly screen and optimize constructs containing multiple computationally designed miniprotein inhibitors of SARS-CoV-2. We found the broadest efficacy with a homo-trimeric version of the 75-residue angiotensin converting enzyme 2 (ACE2) mimic AHB2 (TRI2-2) designed to geometrically match the trimeric spike architecture. In the cryo-electron microscopy structure, TRI2 formed a tripod on top of the spike protein which engaged all three receptor binding domains (RBDs) simultaneously as in the design model. TRI2-2 neutralized Omicron (B.1.1.529), Delta (B.1.617.2), and all other variants tested with greater potency than that of monoclonal antibodies used clinically for the treatment of COVID-19. TRI2-2 also conferred prophylactic and therapeutic protection against SARS-CoV-2 challenge when administered intranasally in mice. Designed miniprotein receptor mimics geometrically arrayed to match pathogen receptor binding sites could be a widely applicable antiviral therapeutic strategy with advantages over antibodies and native receptor traps. By comparison, the designed proteins have resistance to viral escape and antigenic drift by construction, precisely tuned avidity, and greatly reduced chance of autoimmune responses.
To create new enzymes and biosensors from scratch, precise control over the structure of small-molecule binding sites is of paramount importance, but systematically designing arbitrary protein pocket shapes and sizes remains an outstanding challenge. Using the NTF2-like structural superfamily as a model system, we developed an enumerative algorithm for creating a virtually unlimited number of de novo proteins supporting diverse pocket structures. The enumerative algorithm was tested and refined through feedback from two rounds of large-scale experimental testing, involving in total the assembly of synthetic genes encoding 7,896 designs and assessment of their stability on yeast cell surface, detailed biophysical characterization of 64 designs, and crystal structures of 5 designs. The refined algorithm generates proteins that remain folded at high temperatures and exhibit more pocket diversity than naturally occurring NTF2-like proteins. We expect this approach to transform the design of small-molecule sensors and enzymes by enabling the creation of binding and active site geometries much more optimal for specific design challenges than is accessible by repurposing the limited number of naturally occurring NTF2-like proteins.
Highlights d SARS-CoV-2 RBD-specific miniproteins confer 100% survival against lethal challenge d Miniproteins prevent SARS-CoV-2-induced lung inflammation and pathology d One miniprotein dose protects when given between D-5 and D+3 relative to infection d Miniproteins inhibit historical, circulating, and emerging SARS-CoV-2 strains Authors James
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.