The Rosetta software suite for macromolecular modeling, docking, and design is widely used in pharmaceutical, industrial, academic, non-profit, and government laboratories. Despite its broad modeling capabilities, Rosetta remains consistently among leading software suites when compared to other methods created for highly specialized protein modeling and design tasks. Developed for over two decades by a global community of over 60 laboratories, Rosetta has undergone multiple refactorings, and now comprises over three million lines of code. Here we discuss methods developed in the last five years in Rosetta, involving the latest protocols for structure prediction; protein-protein and protein-small molecule docking; protein structure and interface design; loop modeling; the incorporation of various types of experimental data; modeling of peptides, antibodies and proteins in the immune system, nucleic acids, non-standard chemistries, carbohydrates, and membrane proteins. We briefly discuss improvements to the energy function, user interfaces, and usability of the software. Rosetta is available at www.rosettacommons.org.
Circulating antibodies (Ab) that specifically bind polyethylene glycol (PEG), a biocompatible polymer routinely used in protein and nanoparticle therapeutics, have been associated with reduced efficacy of and/or adverse reactions to therapeutics modified with or containing PEG. Unlike most antidrug antibodies that are induced following initial drug dosing, anti-PEG Ab can be found in treatment-naïve individuals (i.e., individuals who have never undergone treatment with PEGylated drugs but most likely have been exposed to PEG through other means). Unfortunately, the true prevalence, quantitative levels, and Ab isotype of pre-existing anti-PEG Ab remain poorly understood. Here, using rigorously validated competitive ELISAs with engineered chimeric anti-PEG monoclonal Ab standards, we quantified the levels of anti-PEG IgM and different subclasses of anti-PEG IgG (IgG1-4) in both contemporary and historical human samples. We unexpectedly found, with 90% confidence, detectable levels of anti-PEG Ab in ∼72% of the contemporary specimens (18% IgG, 25% IgM, 30% both IgG and IgM). The vast majority of these samples contained low levels of anti-PEG Ab, with only ∼7% and ∼1% of all specimens possessing anti-PEG IgG and IgM in excess of 500 ng/mL, respectively. IgG2 was the predominant anti-PEG IgG subclass. Anti-PEG Ab's were also observed in ∼56% of serum samples collected during 1970-1999 (20% IgG, 19% IgM, and 16% both IgG and IgM), suggesting that the presence of PEG-specific antibodies may be a longstanding phenomenon. Anti-PEG IgG levels demonstrated correlation with patient age, but not with gender or race. The widespread prevalence of pre-existing anti-PEG Ab, coupled with high Ab levels in a subset of the population, underscores the potential importance of screening patients for anti-PEG Ab levels prior to administration of therapeutics containing PEG.
Natural recombination combines pieces of pre-existing proteins to create new tertiary structures and functions. We describe a computational protocol, called SEWING, which is inspired by this process and builds new proteins from connected or disconnected pieces of existing structures. Helical proteins designed with SEWING contain structural features absent from other de novo designed proteins and in some cases remain folded to over 100 °C. High resolution structures of the designed proteins CA01 and DA05R1 were solved by X-ray crystallography (2.2 Å resolution) and NMR respectively, and there was excellent agreement with the design models. This method provides a new strategy to rapidly create large numbers of diverse and designable protein scaffolds.
KEAP1 is a substrate adaptor protein for a CUL3-based E3 ubiquitin ligase. Ubiquitylation and degradation of the antioxidant transcription factor NRF2 is considered the primary function of KEAP1; however, few other KEAP1 substrates have been identified. Because KEAP1 is altered in a number of human pathologies and has been proposed as a potential therapeutic target therein, we sought to better understand KEAP1 through systematic identification of its substrates. Toward this goal, we combined parallel affinity capture proteomics and candidate-based approaches. Substrate-trapping proteomics yielded NRF2 and the related transcription factor NRF1 as KEAP1 substrates. Our targeted investigation of KEAP1-interacting proteins revealed MCM3, an essential subunit of the replicative DNA helicase, as a new substrate. We show that MCM3 is ubiquitylated by the KEAP1-CUL3-RBX1 complex in cells and in vitro Using ubiquitin remnant profiling, we identify the sites of KEAP1-dependent ubiquitylation in MCM3, and these sites are on predicted exposed surfaces of the MCM2-7 complex. Unexpectedly, we determined that KEAP1 does not regulate total MCM3 protein stability or subcellular localization. Our analysis of a KEAP1 targeting motif in MCM3 suggests that MCM3 is a point of direct contact between KEAP1 and the MCM hexamer. Moreover, KEAP1 associates with chromatin in a cell cycle-dependent fashion with kinetics similar to the MCM2-7 complex. KEAP1 is thus poised to affect MCM2-7 dynamics or function rather than MCM3 abundance. Together, these data establish new functions for KEAP1 within the nucleus and identify MCM3 as a novel substrate of the KEAP1-CUL3-RBX1 E3 ligase.
Degenerate codon (DC) libraries efficiently address the experimental library-size limitations of directed evolution by focusing diversity toward the positions and toward the amino acids (AAs) that are most likely to generate hits; however, manually constructing DC libraries is challenging, error prone and time consuming. This paper provides a dynamic programming solution to the task of finding the best DCs while keeping the size of the library beneath some given limit, improving on the existing integer-linear programming formulation. It then extends the algorithm to consider multiple DCs at each position, a heretofore unsolved problem, while adhering to a constraint on the number of primers needed to synthesize the library. In the two library-design problems examined here, the use of multiple DCs produces libraries that very nearly cover the set of desired AAs while still staying within the experimental size limits. Surprisingly, the algorithm is able to find near-perfect libraries where the ratio of amino-acid sequences to nucleic-acid sequences approaches 1; it effectively side-steps the degeneracy of the genetic code. Our algorithm is freely available through our web server and solves most design problems in about a second.
Computational protein design has found great success in engineering proteins for thermodynamic stability, binding specificity, or enzymatic activity in a ‘single state’ design (SSD) paradigm. Multi-specificity design (MSD), on the other hand, involves considering the stability of multiple protein states simultaneously. We have developed a novel MSD algorithm, which we refer to as REstrained CONvergence in multi-specificity design (RECON). The algorithm allows each state to adopt its own sequence throughout the design process rather than enforcing a single sequence on all states. Convergence to a single sequence is encouraged through an incrementally increasing convergence restraint for corresponding positions. Compared to MSD algorithms that enforce (constrain) an identical sequence on all states the energy landscape is simplified, which accelerates the search drastically. As a result, RECON can readily be used in simulations with a flexible protein backbone. We have benchmarked RECON on two design tasks. First, we designed antibodies derived from a common germline gene against their diverse targets to assess recovery of the germline, polyspecific sequence. Second, we design “promiscuous”, polyspecific proteins against all binding partners and measure recovery of the native sequence. We show that RECON is able to efficiently recover native-like, biologically relevant sequences in this diverse set of protein complexes.
Polyethylene glycol (PEG) is a polymer routinely used to modify biologics and nanoparticles to prolong blood circulation and reduce immunogenicity of the underlying therapeutic. However, several PEGylated therapeutics induce the development of anti-PEG antibodies (APA), leading to reduced efficacy and increased adverse events. Given the highly flexible structure of PEG, how APA specifically bind PEG remains poorly understood. Here, we report a crystal structure illustrating the structural properties and conformation of the APA 6-3 Fab bound to the backbone of PEG. The structure reveals an open ring-like sub-structure in the Fab paratope, whereby PEG backbone is captured and then stabilized via Van der Waals interactions along the interior and exterior of the ring paratope surface. Our finding illustrates a strategy by which antibodies can bind highly flexible repeated structures that lack fixed conformations, such as polymers. This also substantially advances our understanding of the humoral immune response generated against PEG.
The Bacillus subtilis trpEDCFBA operon is regulated by a transcription attenuation mechanism controlled by the trp RNA-binding attenuation protein (TRAP). TRAP binds to 11 (G/U)AG repeats in the trp leader transcript and prevents formation of an antiterminator, which allows formation of an intrinsic terminator (attenuator). Previously, formation of the attenuator RNA structure was believed to be solely responsible for signaling RNA polymerase (RNAP) to halt transcription. However, base substitutions that prevent formation of the antiterminator, and thus allow the attenuator structure to form constitutively, do not result in efficient transcription termination. The observation that the attenuator requires the presence of TRAP bound to the nascent RNA to cause efficient transcription termination suggests TRAP has an additional role in causing termination at the attenuator. We show that the trp attenuator is a weak intrinsic terminator due to low GC content of the hairpin stem and interruptions in the U-stretch following the hairpin. We also provide evidence that termination at the trp attenuator requires forward translocation of RNA polymerase and that TRAP binding to the nascent transcript can induce this activity.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.