REFMAC5 and related tools for the refinement of atomic models into cryo-EM reconstructions in CCP-EM are reviewed. An upper bound on the correlation between observed and calculated Fourier coefficients is identified, and the practical utility of map blurring/sharpening is discussed. The Divide and Conquer pipeline for refining large complexes in parallel, and ProSHADE for the identification of symmetries in a given map or coordinate set, are presented.
A detailed description of the dnatco.datmos.org web server implementing the universal structural alphabet of nucleic acids is presented. It is capable of processing any mmCIF- or PDB-formatted files containing DNA or RNA molecules; these can either be uploaded by the user or supplied as the wwPDB or PDB-REDO structural database access code. The web server performs an assignment of the nucleic acid conformations and presents the results for the intuitive annotation, validation, modeling and refinement of nucleic acids.
The BALBES database [1] contains over 13, 000 protein domain structures derived from the PDB. Since this database was built specifically for use in a molecular replacement pipeline, the focus of the domain classification was on structural similarity. This feature makes the database amenable for use in such shape-based applications as finding molecular replacement candidates using Patterson map matching (instead of sequence matching) or automatic fitting of domains into cryo-EM maps.Both applications are computationally expensive due to their requirement for multiple searches against the whole database and for this reason are not used routinely. Here we present two translationally and rotationally invariant fast shape descriptors based on spherical harmonic map decomposition. The advantage of these shape descriptors is twofold: 1) they can be used for reclassification of the BALBES domain database, thus making its use more efficient; and 2) these descriptors can be used as fast pre-filters for finding matches between a query map (be it determined by X-ray crystallography or cryo-EM) and the BALBES database.The shape descriptors presented here require transforming the query data into a Patterson-like map, which is subsequently interpolated onto a set of concentric spheres (shells) serving to retain the radial information, while the angular information is represented using the spherical harmonic decomposition of each shell. One of the shape descriptors then uses the correlation between spherical harmonic coefficients from different shells to describe the shape, while the other descriptor borrows from the rotation function [2] optimisation and uses the singular value decomposition to obtain shape descriptors. Both methods result in rotation invariant shape descriptors, while the use of Patterson-like maps ensures translation invariance. Combination of these features removes the requirement for a sixdimensional search, replacing it with matrix comparisons, thus making these methods computationally efficient.It should be noted that the calculation of the presented descriptors is irreversible, meaning that there is a loss of information. This feature is a trade-off between computational speed and accuracy. The optimisation of parameters for descriptor calculation was performed using over 100 protein domain groups from the BALBES database; each group contains different, but very similarly shaped domains. The resulting Area Under the Receiver Operator Characteristic curve (AUROC -which is closely related to Mann-Whitney U test, but has more interpretable value [3]) values were found to be 0.998 and 0.997 in our tests, respectively. While these results show the ability of both descriptors to distinguish between similar and different protein domain shapes, the information loss means that they should be used as pre-filters for oneagainst-many domain structure comparisons, rather than as a replacement for existing methods.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.