The Rosetta software suite for macromolecular modeling, docking, and design is widely used in pharmaceutical, industrial, academic, non-profit, and government laboratories. Despite its broad modeling capabilities, Rosetta remains consistently among leading software suites when compared to other methods created for highly specialized protein modeling and design tasks. Developed for over two decades by a global community of over 60 laboratories, Rosetta has undergone multiple refactorings, and now comprises over three million lines of code. Here we discuss methods developed in the last five years in Rosetta, involving the latest protocols for structure prediction; protein-protein and protein-small molecule docking; protein structure and interface design; loop modeling; the incorporation of various types of experimental data; modeling of peptides, antibodies and proteins in the immune system, nucleic acids, non-standard chemistries, carbohydrates, and membrane proteins. We briefly discuss improvements to the energy function, user interfaces, and usability of the software. Rosetta is available at www.rosettacommons.org.
Skeletal muscle is a heterogeneous tissue comprised of muscle fiber and mononuclear cell types that, in addition to movement, influences immunity, metabolism and cognition. We investigated the gene expression patterns of skeletal muscle cells using RNA-seq of subtype-pooled single human muscle fibers and single cell RNA-seq of mononuclear cells from human vastus lateralis, mouse quadriceps, and mouse diaphragm. We identified 11 human skeletal muscle mononuclear cell types, including two fibro-adipogenic progenitor (FAP) cell subtypes. The human FBN1+ FAP cell subtype is novel and a corresponding FBN1+ FAP cell type was also found in single cell RNA-seq analysis in mouse. Transcriptome exercise studies using bulk tissue analysis do not resolve changes in individual cell-type proportion or gene expression. The cell-type gene signatures provide the means to use computational methods to identify cell-type level changes in bulk studies. As an example, we analyzed public transcriptome data from an exercise training study and revealed significant changes in specific mononuclear cell-type proportions related to age, sex, acute exercise and training. Our single-cell expression map of skeletal muscle cell types will further the understanding of the diverse effects of exercise and the pathophysiology of muscle disease.Skeletal muscle is a complex heterogeneous tissue consisting of multinucleated muscle fibers, immune cells, endothelial cells, muscle stem cells (satellite cells), non-myogenic mesenchymal progenitors (e.g., fibro-adipogenic progenitors, or FAPs), and other mononuclear cells 1 . To improve the understanding of skeletal muscle cell types and their transcriptional signatures, we studied human and mouse skeletal muscle mononuclear cells by single-cell RNA-sequencing and single human muscle fiber subtypes by RNA-seq.The majority of skeletal muscle is composed of the multinucleated fibers that facilitate movement. These muscle fibers include several fiber types of differing metabolic and functional properties 2-4 . While slow-twitch (or Type I) muscle fibers possess high oxidative capacity, fast-twitch (or Type II) muscle fibers have a high glycolytic capacity and are capable of supplying more power than Type I fibers 2-4 . Fiber-type composition differs across individuals and can change by as much as 10-30% during exercise training regimens 5-7 . Furthermore, the transcriptomic response to physical activity is different in each fiber-type as each fiber-type responds differently to different modes of exercise 8,9 . Crucially, muscle fibers secrete myokines, which both act locally within muscle tissue as well as influence other organs and tissues via hormone-like signaling 10 . Myokines may be responsible for the immune-, metabolism-, and cognition-related benefits of physical activity, as well as the chronic diseases that are caused by lack of physical activity (insulin resistance, cardiovascular disease, etc.) 10 .Besides multinucleated fibers, skeletal muscle contains many mononuclear cells, such as immune cells,...
Exercise provides a robust physiological stimulus that evokes cross-talk among multiple tissues that when repeated regularly (i.e., training) improves physiological capacity, benefits numerous organ systems, and decreases the risk for premature mortality. However, a gap remains in identifying the detailed molecular signals induced by exercise that benefits health and prevents disease. The Molecular Transducers of Physical Activity Consortium (MoTrPAC) was established to address this gap and generate a molecular map of exercise. Preclinical and clinical studies will examine the systemic effects of endurance and resistance exercise across a range of ages and fitness levels by molecular probing of multiple tissues before and after acute and chronic exercise. From this multi-omic and bioinformatic analysis, a molecular map of exercise will be established. Altogether, MoTrPAC will provide a public database that is expected to enhance our understanding of the health benefits of exercise and to provide insight into how physical activity mitigates disease.
Biophysical interactions between proteins and peptides are key determinants of molecular recognition specificity landscapes. However, an understanding of how molecular structure and residue-level energetics at protein−peptide interfaces shape these landscapes remains elusive. We combine information from yeast-based library screening, next-generation sequencing, and structure-based modeling in a supervised machine learning approach to report the comprehensive sequence−energetics−function mapping of the specificity landscape of the hepatitis C virus (HCV) NS3/4A protease, whose function—site-specific cleavages of the viral polyprotein—is a key determinant of viral fitness. We screened a library of substrates in which five residue positions were randomized and measured cleavability of ∼30,000 substrates (∼1% of the library) using yeast display and fluorescence-activated cell sorting followed by deep sequencing. Structure-based models of a subset of experimentally derived sequences were used in a supervised learning procedure to train a support vector machine to predict the cleavability of 3.2 million substrate variants by the HCV protease. The resulting landscape allows identification of previously unidentified HCV protease substrates, and graph-theoretic analyses reveal extensive clustering of cleavable and uncleavable motifs in sequence space. Specificity landscapes of known drug-resistant variants are similarly clustered. The described approach should enable the elucidation and redesign of specificity landscapes of a wide variety of proteases, including human-origin enzymes. Our results also suggest a possible role for residue-level energetics in shaping plateau-like functional landscapes predicted from viral quasispecies theory.
Regular exercise promotes whole-body health and prevents disease, yet the underlying molecular mechanisms throughout a whole organism are incompletely understood. Here, the Molecular Transducers of Physical Activity Consortium (MoTrPAC) profiled the temporal transcriptome, proteome, metabolome, lipidome, phosphoproteome, acetylproteome, ubiquitylproteome, epigenome, and immunome in whole blood, plasma, and 18 solid tissues in Rattus norvegicus over 8 weeks of endurance exercise training. The resulting data compendium encompasses 9466 assays across 19 tissues, 25 molecular platforms, and 4 training time points in young adult male and female rats. We identified thousands of shared and tissue- and sex- specific molecular alterations. Temporal multi-omic and multi-tissue analyses demonstrated distinct patterns of tissue remodeling, with widespread regulation of immune, metabolism, heat shock stress response, and mitochondrial pathways. These patterns provide biological insights into the adaptive responses to endurance training over time. For example, exercise training induced heart remodeling via altered activity of the Mef2 family of transcription factors and tyrosine kinases. Translational analyses revealed changes that are consistent with human endurance training data and negatively correlated with disease, including increased phospholipids and decreased triacylglycerols in the liver. Sex differences in training adaptation were widespread, including those in the brain, adrenal gland, lung, and adipose tissue. Integrative analyses generated novel hypotheses of disease relevance, including candidate mechanisms that link training adaptation to non-alcoholic fatty liver disease, inflammatory bowel disease, cardiovascular health, and tissue injury and recovery. The data and analysis results presented in this study will serve as valuable resources for the broader community and will be provided in an easily accessible public repository (https://motrpac-data.org/).
Characterizing the substrate specificity of protease enzymes is critical for illuminating the molecular basis of their diverse and complex roles in a wide array of biological processes. Rapid and accurate prediction of their extended substrate specificity would also aid in the design of custom proteases capable of selectively and controllably cleaving biotechnologically or therapeutically relevant targets. However, current in silico approaches for protease specificity prediction, rely on, and are therefore limited by, machine learning of sequence patterns in known experimental data. Here, we describe a general approach for predicting peptidase substrates de novo using protein structure modeling and biophysical evaluation of enzyme-substrate complexes. We construct atomic resolution models of thousands of candidate substrate-enzyme complexes for each of five model proteases belonging to the four major protease mechanistic classes-serine, cysteine, aspartyl, and metallo-proteases-and develop a discriminatory scoring function using enzyme design modules from Rosetta and AMBER's MMPBSA. We rank putative substrates based on calculated interaction energy with a modeled near-attack conformation of the enzyme active site. We show that the energetic patterns obtained from these simulations can be used to robustly rank and classify known cleaved and uncleaved peptides and that these structural-energetic patterns have greater discriminatory power compared to purely sequence-based statistical inference. Combining sequence and energetic patterns using machine-learning algorithms further improves classification performance, and analysis of structural models provides physical insight into the structural basis for the observed specificities. We further tested the predictive capability of the model by designing and experimentally characterizing the cleavage of four novel substrate motifs for the hepatitis C virus NS3/4 protease using an in vivo assay. The presented structure-based approach is generalizable to other protease enzymes with known or modeled structures, and complements existing experimental methods for specificity determination.
Multispecificity–the ability of a single receptor protein molecule to interact with multiple substrates–is a hallmark of molecular recognition at protein-protein and protein-peptide interfaces, including enzyme-substrate complexes. The ability to perform structure-based prediction of multispecificity would aid in the identification of novel enzyme substrates, protein interaction partners, and enable design of novel enzymes targeted towards alternative substrates. The relatively slow speed of current biophysical, structure-based methods limits their use for prediction and, especially, design of multispecificity. Here, we develop a rapid, flexible-backbone self-consistent mean field theory-based technique, MFPred, for multispecificity modeling at protein-peptide interfaces. We benchmark our method by predicting experimentally determined peptide specificity profiles for a range of receptors: protease and kinase enzymes, and protein recognition modules including SH2, SH3, MHC Class I and PDZ domains. We observe robust recapitulation of known specificities for all receptor-peptide complexes, and comparison with other methods shows that MFPred results in equivalent or better prediction accuracy with a ~10-1000-fold decrease in computational expense. We find that modeling bound peptide backbone flexibility is key to the observed accuracy of the method. We used MFPred for predicting with high accuracy the impact of receptor-side mutations on experimentally determined multispecificity of a protease enzyme. Our approach should enable the design of a wide range of altered receptor proteins with programmed multispecificities.
The mRNA-1273 vaccine was determined to be effective against severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) from interim Phase 3 results and subsequently received an emergency use authorization by the FDA. Human studies, however, cannot provide the controlled response to infection and complex immunological insight that are only possible with preclinical studies. Hamsters are the only model that reliably exhibit severe SARS-CoV-2 disease similar to hospitalized patients, making them pertinent for vaccine evaluation. We demonstrate that prime or prime-boost administration of mRNA-1273 in hamsters elicited robust neutralizing antibodies, ameliorated weight loss, suppressed SARS-CoV-2 replication in the airways, and better protected against disease at the highest prime-boost dose. Unlike in mice and non-human primates, low level virus replication in mRNA-1273 vaccinated hamsters coincided with an anamnestic response. Single-cell RNA sequencing of lung tissue permitted high resolution analysis which is not possible in vaccinated humans. mRNA-1273 prevented inflammatory cell infiltration and the reduction of lymphocyte proportions, but enabled antiviral responses conducive to lung homeostasis. Surprisingly, infection triggered transcriptome programs in some types of immune cells from vaccinated hamsters that were shared, albeit attenuated, with mockvaccinated hamsters. Our results support the use of mRNA-1273 in a two-dose schedule and provide insight into the potential responses within the lungs of vaccinated humans who are exposed to SARS-CoV-2.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.