The Protein Data Bank [PDB; Berman, Westbrook et al. (2000), Nucleic Acids Res. 28, 235-242; http://www.pdb.org/] is the single worldwide archive of primary structural data of biological macromolecules. Many secondary sources of information are derived from PDB data. It is the starting point for studies in structural bioinformatics. This article describes the goals of the PDB, the systems in place for data deposition and access, how to obtain further information and plans for the future development of the resource. The reader should come away with an understanding of the scope of the PDB and what is provided by the resource.
A consensus classification and nomenclature are defined for RNA backbone structure using all of the backbone torsion angles. By a consensus of several independent analysis methods, 46 discrete conformers are identified as suitably clustered in a qualityfiltered, multidimensional dihedral angle distribution. Most of these conformers represent identifiable features or roles within RNA structures. The conformers are given two-character names that reflect the seven-angle dezabgd combinations empirically found favorable for the sugar-to-sugar ''suite'' unit within which the angle correlations are strongest (e.g., 1a for A-form, 5z for the start of S-motifs). Since the half-nucleotides are specified by a number for dez and a lowercase letter for abgd, this modular system can also be parsed to describe traditional nucleotide units (e.g., a1) or the dinucleotides (e.g., a1a1) that are especially useful at the level of crystallographic map fitting. This nomenclature can also be written as a string with two-character suite names between the uppercase letters of the base sequence (N1aG1gN1aR1aA1cN1a for a GNRA tetraloop), facilitating bioinformatic comparisons. Cluster means, standard deviations, coordinates, and examples are made available, as well as the Suitename software that assigns suite conformer names and conformer match quality (suiteness) from atomic coordinates. The RNA Ontology Consortium will combine this new backbone system with others that define base pairs, base-stacking, and hydrogen-bond relationships to provide a full description of RNA structural motifs.
A statistical survey of the torsion angles, bond angles, and bond lengths in the sugar and phosphate groups of well-refined mononucleoside, mononucleotide, dinucleoside monophosphate, and trinucleoside diphosphate crystal structures contained in the Cambridge Structural Database and the Nucleic Acid Database is reported. The mean values of the geometrical parameters in these structures and their estimated standard deviations are separated according to their chemistry and conformation. These new parameters serve as a basis for a dictionary of standard nucleic acid geometry.
The geometry of the phosphodiester backbone was analyzed for 7739 dinucleotides from 447 selected crystal structures of naked and complexed DNA. Ten torsion angles of a near-dinucleotide unit have been studied by combining Fourier averaging and clustering. Besides the known variants of the A-, B- and Z-DNA forms, we have also identified combined A + B backbone-deformed conformers, e.g. with α/γ switches, and a few conformers with a syn orientation of bases occurring e.g. in G-quadruplex structures. A plethora of A- and B-like conformers show a close relationship between the A- and B-form double helices. A comparison of the populations of the conformers occurring in naked and complexed DNA has revealed a significant broadening of the DNA conformational space in the complexes, but the conformers still remain within the limits defined by the A- and B- forms. Possible sequence preferences, important for sequence-dependent recognition, have been assessed for the main A and B conformers by means of statistical goodness-of-fit tests. The structural properties of the backbone in quadruplexes, junctions and histone-core particles are discussed in further detail.
Ordered hydration sites were determined for the nucleotide bases in B-type conformations using the crystal structure data on 14 B-DNA decamer structures. A method of density representation was extended so that positions, occupancies, and distributions of the hydration sites were predicted around a B-DNA double helix by a method analogous to crystallographic refinement. The predicted hydration sites correctly reproduce the main features of hydration around the B-DNA dodecamer. In contrast to the previous observations, the newly available crystal data show the same extent of hydration of guanine and adenine, and of cytosine and thymine.
The hepatitis delta virus (HDV) ribozyme is an RNA enzyme from the human pathogenic HDV. Cations play a crucial role in self-cleavage of the HDV ribozyme, by promoting both folding and chemistry. Experimental studies have revealed limited but intriguing details on the location and structural and catalytic functions of metal ions. Here, we analyze a total of approximately 200 ns of explicit-solvent molecular dynamics simulations to provide a complementary atomistic view of the binding of monovalent and divalent cations as well as water molecules to reaction precursor and product forms of the HDV ribozyme. Our simulations find that an Mg2+ cation binds stably, by both inner- and outer-sphere contacts, to the electronegative catalytic pocket of the reaction precursor, in a position to potentially support chemistry. In contrast, protonation of the catalytically involved C75 in the precursor or artificial placement of this Mg2+ into the product structure result in its swift expulsion from the active site. These findings are consistent with a concerted reaction mechanism in which C75 and hydrated Mg2+ act as general base and acid, respectively. Monovalent cations bind to the active site and elsewhere assisted by structurally bridging long-residency water molecules, but are generally delocalized.
Water distributions around phosphate groups in 59 B-, A-, and Z-DNA crystal structures were analyzed. It is shown that the waters are concentrated in six hydration sites per phosphate and that the positions and occupancies of these sites are dependent on the conformation and type of nucleotide. The patterns of hydration that are characteristic of the backbone of the three DNA helical types can be attributed in part to the interactions of these hydration sites.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.