In structure-based virtual screening, compound ranking through a consensus of scores from a variety of docking programs or scoring functions, rather than ranking by scores from a single program, provides better predictive performance and reduces target performance variability. Here we compare traditional consensus scoring methods with a novel, unsupervised gradient boosting approach. We also observed increased score variation among active ligands and developed a statistical mixture model consensus score based on combining score means and variances. To evaluate performance, we used the common performance metrics ROCAUC and EF1 on 21 benchmark targets from DUD-E. Traditional consensus methods, such as taking the mean of quantile normalized docking scores, outperformed individual docking methods and are more robust to target variation. The mixture model and gradient boosting provided further improvements over the traditional consensus methods. These methods are readily applicable to new targets in academic research and overcome the potentially poor performance of using a single docking method on a new target.
NMR and SAXS/WAXS are highly complementary approaches for the analysis of RNA structure in solution. Here we describe an efficient NMR-SAXS/WAXS approach for structural investigation of multi-helical RNAs. We illustrate this approach by determining the overall fold of a 92-nucleotide 3-helix junction from the U4/U6 di-snRNA. The U4/U6 di-snRNA is conserved in eukaryotes and is part of the U4/U6.U5 tri-snRNP, a large ribonucleoprotein complex that comprises a major subunit of the assembled spliceosome. Helical orientations can be determined by X-ray scattering data alone, but the addition of NMR RDC restraints improves the structure models. RDCs were measured in 2 different external alignment media and also by magnetic susceptibility anisotropy. The resulting alignment tensors are collinear, which is a previously noted problem for nucleic acids. Including WAXS data in the calculations produces models with significantly better fits to the scattering data. In solution, the U4/U6 di-snRNA forms a 3-helix junction with a planar Y-shaped structure and has no detectable tertiary interactions. Single molecule FRET data support the observed topology. A comparison with the recently determined cryo-EM structure of the U4/U6.U5 tri-snRNP illustrates how proteins scaffold the RNA and dramatically alter the geometry of the U4/U6 3-helix junction.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.