RNA-Puzzles is a collective endeavor dedicated to the advancement and improvement of RNA 3D structure prediction. With agreement from crystallographers, the RNA structures are predicted by various groups before the publication of the crystal structures. We now report the prediction of 3D structures for six RNA sequences: four nucleolytic ribozymes and two riboswitches. Systematic protocols for comparing models and crystal structures are described and analyzed. In these six puzzles, we discuss (i) the comparison between the automated web servers and human experts; (ii) the prediction of coaxial stacking; (iii) the prediction of structural details and ligand binding; (iv) the development of novel prediction methods; and (v) the potential improvements to be made. We show that correct prediction of coaxial stacking and tertiary contacts is essential for the prediction of RNA architecture, while ligand binding modes can only be predicted with low resolution and simultaneous prediction of RNA structure with accurate ligand binding still remains out of reach. All the predicted models are available for the future development of force field parameters and the improvement of comparison and assessment tools.
The SARS-CoV-2 Spike protein needs to be in an open-state conformation to interact with ACE2 to initiate viral entry. We utilise coarse-grained normal mode analysis to model the dynamics of Spike and calculate transition probabilities between states for 17081 variants including experimentally observed variants. Our results correctly model an increase in open-state occupancy for the more infectious D614G via an increase in flexibility of the closed-state and decrease of flexibility of the open-state. We predict the same effect for several mutations on glycine residues (404, 416, 504, 252) as well as residues K417, D467 and N501, including the N501Y mutation recently observed within the B.1.1.7, 501.V2 and P1 strains. This is, to our knowledge, the first use of normal mode analysis to model conformational state transitions and the effect of mutations on such transitions. The specific mutations of Spike identified here may guide future studies to increase our understanding of SARS-CoV-2 infection mechanisms and guide public health in their surveillance efforts.
The SARS-CoV-2 Spike protein needs to be in an open-state conformation to interact with ACE2 as part of the viral entry mechanism. We utilise coarse-grained normal-mode analyses to model the dynamics of Spike and calculate transition probabilities between states for 17081 Spike variants. Our results correctly model an increase in open-state occupancy for the more infectious D614G via an increase in flexibility of the closed-state and decrease of flexibility of the open-state. We predict the same effect for several mutations on Glycine residues (404, 416, 504, 252) as well as residues K417, D467 and N501, including the N501Y mutation, explaining the higher infectivity of the B.1.1.7 and 501.V2 strains. This is, to our knowledge, the first use of normal-mode analysis to model conformational state transitions and the effect of mutations thereon. The specific mutations of Spike identified here may guide future studies to increase our understanding of SARS-CoV-2 infection mechanisms and guide public health in their surveillance efforts.
Summary The Najmanovich Research Group Toolkit for Elastic Networks (NRGTEN) is a Python toolkit that implements four different NMA models in addition to popular and novel metrics to benchmark and measure properties from these models. Furthermore, the toolkit is available as a public Python package and is easily extensible for the development or implementation of additional NMA models. The inclusion of the ENCoM model (Elastic Network Contact Model) developed in our group within NRGTEN is noteworthy, owing to its account for the specific chemical nature of atomic interactions. Availability and implementation https://github.com/gregorpatof/nrgten_package/
The Elastic Network Contact Model (ENCoM) is a coarse-grained normal mode analysis (NMA) model unique in its all-atom sensitivity to the sequence of the studied macromolecule and thus to the effect of mutations. We adapted ENCoM to simulate the dynamics of ribonucleic acid (RNA) molecules, benchmarked its performance against other popular NMA models and used it to study the 3D structural dynamics of human microRNA miR-125a, leveraging high-throughput experimental maturation efficiency data of over 26 000 sequence variants. We also introduce a novel way of using dynamical information from NMA to train multivariate linear regression models, with the purpose of highlighting the most salient contributions of dynamics to function. ENCoM has a similar performance profile on RNA than on proteins when compared to the Anisotropic Network Model (ANM), the most widely used coarse-grained NMA model; it has the advantage on predicting large-scale motions while ANM performs better on B-factors prediction. A stringent benchmark from the miR-125a maturation dataset, in which the training set contains no sequence information in common with the testing set, reveals that ENCoM is the only tested model able to capture signal beyond the sequence. This ability translates to better predictive power on a second benchmark in which sequence features are shared between the train and test sets. When training the linear regression model using all available data, the dynamical features identified as necessary for miR-125a maturation point to known patterns but also offer new insights into the biogenesis of microRNAs. Our novel approach combining NMA with multivariate linear regression is generalizable to any macromolecule for which relatively high-throughput mutational data is available.
L-type Ca1.2 channels are essential for the excitation-contraction coupling in cardiomyocytes and are hetero-oligomers of a pore-forming Caα1C assembled with Caβ and Caα2δ1 subunits. A direct interaction between Caα2δ1 and Asp-181 in the first extracellular loop of Caα1 reproduces the native properties of the channel. A 3D model of the von Willebrand factor type A (VWA) domain of Caα2δ1 complexed with the voltage sensor domain of Caα1C suggests that Ser-261 and Ser-263 residues in the metal ion-dependent adhesion site (MIDAS) motif are determinant in this interaction, but this hypothesis is untested. Here, coimmunoprecipitation assays and patch-clamp experiments of single-substitution variants revealed that Caα2δ1 Asp-259 and Ser-261 are the two most important residues in regard to protein interactions and modulation of Ca1.2 currents. In contrast, mutating the side chains of Caα2δ1 Ser-263, Thr-331, and Asp-363 with alanine did not completely prevent channel function. Molecular dynamics simulations indicated that the carboxylate side chain of Caα2δ1 Asp-259 coordinates the divalent cation that is further stabilized by the oxygen atoms from the hydroxyl side chain of Caα2δ1 Ser-261 and the carboxylate group of Caα1C Asp-181. In return, the hydrogen atoms contributed by the side chain of Ser-261 and the main chain of Ser-263 bonded the oxygen atoms of Ca1.2 Asp-181. We propose that Caα2δ1 Asp-259 promotes Ca binding necessary to produce the conformation of the VWA domain that locks Caα2δ1 Ser-261 and Ser-263 within atomic distance of Caα1C Asp-181. This three-way network appears to account for the Caα2δ1-induced modulation of Ca1.2 currents.
Motivation:The use of Normal Mode Analysis (NMA) methods to study both protein and nucleic acid dynamics is well established. However, the most widely used coarse-grained methods are based on backbone geometry alone and do not take into account the chemical nature of the residues. Elastic Network Contact Model (ENCoM) is a coarse-grained NMA method that includes a pairwise atom-type non-bonded interaction term, which makes it sensitive to the sequence of the studied molecule. We adapted ENCoM to simulate the dynamics of ribonucleic acid (RNA) molecules.
The DynaSig-ML (“Dynamical Signatures—Machine Learning”) Python package allows the efficient, user-friendly exploration of 3D dynamics-function relationships in biomolecules, using datasets of experimental measures from large numbers of sequence variants. It does so by predicting 3D structural dynamics for every variant using the Elastic Network Contact Model (ENCoM), a sequence-sensitive coarse-grained normal mode analysis model. Dynamical Signatures represent the fluctuation at every position in the biomolecule and are used as features fed into machine learning models of the user's choice. Once trained, these models can be used to predict experimental outcomes for theoretical variants. The whole pipeline can be run with just a few lines of Python and modest computational resources. The compute-intensive steps are easily parallelized in the case of either large biomolecules or vast amounts of sequence variants. As an example application, we use the DynaSig-ML package to predict the maturation efficiency of human microRNA miR-125a variants from high-throughput enzymatic assays. Availability DynaSig-ML is open-source software available at https://github.com/gregorpatof/dynasigml_package Supplementary information Supplementary data are available at Bioinformatics online.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.