The Chemscore function was implemented as a scoring function for the protein-ligand docking program GOLD, and its performance compared to the original Goldscore function and two consensus docking protocols, "Goldscore-CS" and "Chemscore-GS," in terms of docking accuracy, prediction of binding affinities, and speed. In the "Goldscore-CS" protocol, dockings produced with the Goldscore function are scored and ranked with the Chemscore function; in the "Chemscore-GS" protocol, dockings produced with the Chemscore function are scored and ranked with the Goldscore function. Comparisons were made for a "clean" set of 224 protein-ligand complexes, and for two subsets of this set, one for which the ligands are "drug-like," the other for which they are "fragment-like." For "drug-like" and "fragment-like" ligands, the docking accuracies obtained with Chemscore and Goldscore functions are similar. For larger ligands, Goldscore gives superior results. Docking with the Chemscore function is up to three times faster than docking with the Goldscore function. Both combined docking protocols give significant improvements in docking accuracy over the use of the Goldscore or Chemscore function alone. "Goldscore-CS" gives success rates of up to 81% (top-ranked GOLD solution within 2.0 A of the experimental binding mode) for the "clean list," but at the cost of long search times. For most virtual screening applications, "Chemscore-GS" seems optimal; search settings that give docking speeds of around 0.25-1.3 min/compound have success rates of about 78% for "drug-like" compounds and 85% for "fragment-like" compounds. In terms of producing binding energy estimates, the Goldscore function appears to perform better than the Chemscore function and the two consensus protocols, particularly for faster search settings. Even at docking speeds of around 1-2 min/compound, the Goldscore function predicts binding energies with a standard deviation of approximately 10.5 kJ/mol.
This paper describes the development of a simple empirical scoring function designed to estimate the free energy of binding for a protein-ligand complex when the 3D structure of the complex is known or can be approximated. The function uses simple contact terms to estimate lipophilic and metal-ligand binding contributions, a simple explicit form for hydrogen bonds and a term which penalises flexibility. The coefficients of each term are obtained using a regression based on 82 ligand-receptor complexes for which the binding affinity is known. The function reproduces the binding affinity of the complexes with a cross-validated error of 8.68 kJ/mol. Tests on internal consistency indicate that the coefficients obtained are stable to changes in the composition of the training set. The function is also tested on two test sets containing a further 20 and 10 complexes, respectively. The deficiencies of this type of function are discussed and it is compared to approaches by other workers.
A procedure for analyzing and classifying publicly available crystal structures has been developed. It has been used to identify high-resolution protein-ligand complexes that can be assessed by reconstructing the electron density for the ligand using the deposited structure factors. The complexes have been clustered according to the protein sequences, and clusters have been discarded if they do not represent proteins thought to be of direct interest to the pharmaceutical or agrochemical industry. Rules have been used to exclude complexes containing non-drug-like ligands. One complex from each cluster has been selected where a structure of sufficient quality was available. The final Astex diverse set contains 85 diverse, relevant protein-ligand complexes, which have been prepared in a format suitable for docking and are to be made freely available to the entire research community (http://www.ccdc.cam.ac.uk). The performance of the docking program GOLD against the new set is assessed using a variety of protocols. Relatively unbiased protocols give success rates of approximately 80% for redocking into native structures, but it is possible to get success rates of over 90% with some protocols.
The search for new drugs is plagued by high attrition rates at all stages in research and development. Chemists have an opportunity to tackle this problem because attrition can be traced back, in part, to the quality of the chemical leads. Fragment-based drug discovery (FBDD) is a new approach, increasingly used in the pharmaceutical industry, for reducing attrition and providing leads for previously intractable biological targets. FBDD identifies low-molecular-weight ligands (∼150 Da) that bind to biologically important macromolecules. The three-dimensional experimental binding mode of these fragments is determined using X-ray crystallography or NMR spectroscopy, and is used to facilitate their optimization into potent molecules with drug-like properties. Compared with high-throughput-screening, the fragment approach requires fewer compounds to be screened, and, despite the lower initial potency of the screening hits, offers more efficient and fruitful optimization campaigns. Here, we review the rise of FBDD, including its application to discovering clinical candidates against targets for which other chemistry approaches have struggled.
We present a large test set of protein-ligand complexes for the purpose of validating algorithms that rely on the prediction of protein-ligand interactions. The set consists of 305 complexes with protonation states assigned by manual inspection. The following checks have been carried out to identify unsuitable entries in this set: (1) assessing the involvement of crystallographically related protein units in ligand binding; (2) identification of bad clashes between protein side chains and ligand; and (3) assessment of structural errors, and/or inconsistency of ligand placement with crystal structure electron density. In addition, the set has been pruned to assure diversity in terms of protein-ligand structures, and subsets are supplied for different protein-structure resolution ranges. A classification of the set by protein type is available. As an illustration, validation results are shown for GOLD and SuperStar. GOLD is a program that performs flexible protein-ligand docking, and SuperStar is used for the prediction of favorable interaction sites in proteins. The new CCDC/Astex test set is freely available to the scientific community (http://www.ccdc.cam.ac.uk).
Fragment screening offers an alternative to traditional screening for discovering new leads in drug discovery programs. This paper describes a fragment screening methodology based on high throughput X-ray crystallography. The method is illustrated against five proteins (p38 MAP kinase, CDK2, thrombin, ribonuclease A, and PTP1B). The fragments identified have weak potency (>100 microM) but are efficient binders relative to their size and may therefore represent suitable starting points for evolution to good quality lead compounds. The examples illustrate that a range of molecular interactions (i.e., lipophilic, charge-charge, neutral hydrogen bonds) can drive fragment binding and also that fragments can induce protein movement. We believe that the method has great potential for the discovery of novel lead compounds against a range of targets, and the companion paper illustrates how lead compounds have been identified for p38 MAP kinase starting from fragments such as those described in this paper.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.