Software packages NUPARM and NUCGEN, are described, which can be used to understand sequence directed structural variations in nucleic acids, by analysis and generation of non-uniform structures. A set of local inter basepair parameters (viz. tilt, roll, twist, shift, slide and rise) have been defined, which use geometry and coordinates of two successive basepairs only and can be used to generate polymeric structures with varying geometries for each of the 16 possible dinucleotide steps. Intra basepair parameters, propeller, buckle, opening and the C6...C8 distance can also be varied, if required, while the sugar phosphate backbone atoms are fixed in some standard conformation in each of the nucleotides. NUPARM can be used to analyse both DNA and RNA structures, with single as well as double stranded helices. The NUCGEN software generates double helical models with the backbone fixed in B-form DNA, but with appropriate modifications in the input data, it can also generate A-form DNA and RNA duplex structures.
Non-canonical base pairs, mostly present in the RNA, often play a prominent role towards maintaining their structural diversity. Higher order structures like base triples are also important in defining and stabilizing the tertiary folded structure of RNA. We have developed a new program BPFIND to analyze different types of canonical and non-canonical base pairs and base triples involving at least two direct hydrogen bonds formed between polar atoms of the bases or sugar O2' only. We considered 104 possible types of base pairs, out of which examples of 87 base pair types are found to occur in the available RNA crystal structures. Analysis indicates that approximately 32.7% base pairs in the functional RNA structures are non-canonical, which include different types of GA and GU Wobble base pairs apart from a wide range of base pair possibilities. We further noticed that more than 10.4% of these base pairs are involved in triplet formation, most of which play important role in maintaining long-range tertiary contacts in the three-dimensional folded structure of RNA. Apart from detection, the program also gives a quantitative estimate of the conformational deformation of detected base pairs in comparison to an ideal planar base pair. This helps us to gain insight into the extent of their structural variations and thus assists in understanding their specific role towards structural and functional diversity.
Complementarity, in terms of both shape and electrostatic potential, has been quantitatively estimated at proteinprotein interfaces and used extensively to predict the specific geometry of association between interacting proteins. In this work, we attempted to place both binding and folding on a common conceptual platform based on complementarity. To that end, we estimated (for the first time to our knowledge) electrostatic complementarity (E m) for residues buried within proteins. E m measures the correlation of surface electrostatic potential at protein interiors. The results show fairly uniform and significant values for all amino acids. Interestingly, hydrophobic side chains also attain appreciable complementarity primarily due to the trajectory of the main chain. Previous work from our laboratory characterized the surface (or shape) complementarity (S m) of interior residues, and both of these measures have now been combined to derive two scoring functions to identify the native fold amid a set of decoys. These scoring functions are somewhat similar to functions that discriminate among multiple solutions in a protein-protein docking exercise. The performances of both of these functions on state-of-the-art databases were comparable if not better than most currently available scoring functions. Thus, analogously to interfacial residues of protein chains associated (docked) with specific geometry, amino acids found in the native interior have to satisfy fairly stringent constraints in terms of both S m and E m. The functions were also found to be useful for correctly identifying the same fold for two sequences with low sequence identity. Finally, inspired by the Ramachandran plot, we developed a plot of S m versus E m (referred to as the complementarity plot) that identifies residues with suboptimal packing and electrostatics which appear to be correlated to coordinate errors.
Graphene−nucleobase interaction is gaining importance due to its possible therapeutic applications. The dispersion interaction plays a major role in stacked aromatic compounds, such as graphene with amino acids or nucleobases. We have carried out detailed quantum chemical calculations of complexes of nanographene sheets and the nucleobases of DNA and RNA using dispersion corrected density functional theory. Binding energies show a trend as observed earlier by different theoretical and experimental measurements. However, in our present investigation, the optimized structures of the complex as well as isolated graphene show significant curvature, similar features are also observed by our Atomic Force Microscopic studies. Analysis of Natural Bond Orbital charges indicates the possibility of weak hydrogen bond-like interactions involving pyramidal amino groups of the nucleobases and π-center of the nanographene.
The trans Watson-Crick/Watson-Crick family of base pairs represent a geometric class that play important structural and possible functional roles in the ribosome, tRNA, and other functional RNA molecules. They nucleate base triplets and quartets, participate as loop closing terminal base pairs in hair pin motifs and are also responsible for several tertiary interactions that enable sequentially distant regions to interact with each other in RNA molecules. Eleven representative examples spanning nine systems belonging to this geometric family of RNA base pairs, having widely different occurrence statistics in the PDB database, were studied at the HF/6-31G (d, p) level using Morokuma decomposition, Atoms in Molecules as well as Natural Bond Orbital methods in the optimized gas phase geometries and in their crystal structure geometries, respectively. The BSSE and deformation energy corrected interaction energy values for the optimized geometries are compared with the corresponding values in the crystal geometries of the base pairs. For non protonated base pairs in their optimized geometry, these values ranged from -8.19 kcal/mol to -21.84 kcal/mol and compared favorably with those of canonical base pairs. The interaction energies of these base pairs, in their respective crystal geometries, were, however, lesser to varying extents and in one case, that of A:A W:W trans, it was actually found to be positive. The variation in RMSD between the two geometries was also large and ranged from 0.32-2.19 A. Our analysis shows that the hydrogen bonding characteristics and interaction energies obtained, correlated with the nature and type of hydrogen bonds between base pairs; but the occurrence frequencies, interaction energies, and geometric variabilities were conspicuous by the absence of any apparent correlation. Instead, the nature of local interaction energy hyperspace of different base pairs as inferred from the degree of their respective geometric variability could be correlated with the identities of free and bound hydrogen bond donor/acceptor groups present in interacting bases in conjunction with their tertiary and neighboring group interaction potentials in the global context. It also suggests that the concept of isostericity alone may not always determine covariation potentials for base pairs, particularly for those which may be important for RNA dynamics. These considerations are more important than the absolute values of the interaction energies in their respective optimized geometries in rationalizing their occurrences in functional RNAs. They highlight the importance of revising some of the existing DNA based structure analysis approaches and may have significant implications for RNA structure and dynamics, especially in the context of structure prediction algorithms.
The importance of non-Watson-Crick base pairs in the three-dimensional structure of RNA is now well established. The structure and stability of these noncanonical base pairs are, however, poorly understood. We have attempted to understand structural features of 33 frequently occurring base pairs using density functional theory. These are of three types, namely (i) those stabilized by two or more polar hydrogen bonds between the bases, (ii) those having one polar and another C-H...O/N type interactions, and (iii) those having one H-bond between the bases and another involving one of the sugars linked to the bases. We found that the base pairs having two polar H-bonds are very stable as compared to those having one C-H...O/N interaction. Our quantitatively analysis of structures of these optimized base pairs indicates that they possess a different amount of nonplanarity with large propeller or buckle values as also observed in the crystal structures. We further found that geometry optimization does not modify the hydrogen-bonding pattern, as values of shear and open angle of the base pairs remain conserved. The structures of initial crystal geometry and final optimized geometry of some base pairs having only one polar H-bond and a C-H...O/N interaction, however, are significantly different, indicating the weak nature of the nonpolar interaction. The base pair flexibility, as measured from normal-mode analysis, in terms of the intrinsic standard deviations of the base pair structural parameters are in conformity with those calculated from RNA crystal structures. We also noticed that deformation of a base pair along the stretch direction is impossible for all of the base pairs, and movements of the base pairs along shear and open are also quite restricted. The base pair opening mode through alteration of propeller or buckle is considerably less restricted for most of the base pairs.
Non-canonical base pairs contribute immensely to the structural and functional variability of RNA, which calls for a detailed characterization of their spatial conformation. Intra-base pair parameters, namely propeller, buckle, open-angle, stagger, shear and stretch describe structure of base pairs indicating planarity and proximity of association between the two bases. In order to study the conformational specificities of non-canonical base pairs occurring in RNA crystal structures, we have upgraded NUPARM software to calculate these intra-base pair parameters using a new base pairing edge specific axis system. Analysis of base pairs and base triples with the new edge specific axis system indicate the presence of specific structural signatures for different classes of non-canonical pairs and triples. Differentiating features could be identified for pairs in cis or trans orientation, as well as those involving sugar edges or C-H-mediated hydrogen bonds. It was seen that propeller for all types of base pairs in cis orientation are generally negative, while those for trans base pairs do not have any preference. Formation of a base triple is seen to reduce propeller of the associated base pair along with reduction of overall flexibility of the pairs. We noticed that base pairs involving sugar edge are generally more non-planar, with large propeller or buckle values, presumably to avoid steric clash between the bulky sugar moieties. These specific conformational signatures often provide an insight into their role in the structural and functional context of RNA.
Base pairs involving protonated nucleobases play important roles in mediating global macromolecular conformational changes and in facilitation of catalysis in a variety of functional RNA molecules. Here we present our attempts at understanding the role of such base pairs by detecting possible protonated base pairs in the available RNA crystal structures using BPFind software, in their specific structural contexts, and by the characterization of their geometries, interaction energies, and stabilities using advanced quantum chemical computations. We report occurrences of 18 distinct protonated base pair combinations from a representative data set of RNA crystal structures and propose a theoretical model for one putative base pair combination. Optimization of base pair geometries was carried out at the B3LYP/cc-pVTZ level, and the BSSE corrected interaction energies were calculated at the MP2/aug-cc-pVDZ level of theory. The geometries for each of the base pairs were characterized in terms of H-bonding patterns observed, rmsd values observed on optimization, and base pair geometrical parameters. In addition, the intermolecular interaction in these complexes was also analyzed using Morokuma energy decomposition. The gas phase interaction energies of the base pairs range from -24 to -49 kcal/mol and reveal the dominance of Hartree-Fock component of interaction energy constituting 73% to 98% of the total interaction energy values. On the basis of our combined bioinformatics and quantum chemical analysis of different protonated base pairs, we suggest resolution of structural ambiguities and correlate their geometric and energetic features with their structural and functional roles. In addition, we also examine the suitability of specific base pairs as key elements in molecular switches and as nucleators for higher order structures such as base triplets and quartets.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.