The Site Identification by Ligand Competitive Saturation (SILCS) method identifies the location and approximate affinities of small molecular fragments on a target macromolecular surface by performing Molecular Dynamics (MD) simulations of the target in an aqueous solution of small molecules representative of different chemical functional groups. In this study, we introduce a set of small molecules to map potential interactions made by neutral hydrogen bond donors and acceptors, and charged donor and acceptor fragments in addition to nonpolar fragments. The affinity pattern is obtained in the form of discretized probability or, equivalently, free energy maps, called FragMaps, which can be visualized with the target surface. We performed SILCS simulations for four proteins for which structural and thermodynamic data is available for multiple, diverse ligands. Good overlap is shown between high affinity regions identified by the FragMaps and the crystallographic positions of ligand functional groups with similar chemical functionality, thus demonstrating the validity of the qualitative information obtained from the simulations. To test the ability of FragMaps in providing quantitative predictions, we calculate the previously introduced Ligand Grid Free Energy (LGFE) metric and observe its correspondence with experimentally measured binding affinity. LGFE is computed for different conformational ensembles and improvement in prediction is shown with increasing ligand conformational sampling. Ensemble generation includes a Monte Carlo sampling approach that uses the GFE FragMaps directly as the energy function. The results show some, but not all experimental trends are predicted, and warrant improvements in the scoring methodology. In addition, the potential utility of atom-based free energy contributions to the LGFE scores and the use of multiple ligands in SILCS to identify displaceable water molecules during ligand design are discussed.
Solute sampling of explicit bulk-phase aqueous environments in grand canonical (GC) ensemble simulations suffer from poor convergence due to low insertion probabilities of the solutes. To address this, we developed an iterative procedure involving Grand Canonical-like Monte Carlo (GCMC) and molecular dynamics (MD) simulations. Each iteration involves GCMC of both the solutes and water followed by MD, with the excess chemical potential (μex) of both the solute and the water oscillated to attain their target concentrations in the simulation system. By periodically varying the μex of the water and solutes over the GCMC-MD iterations, solute exchange probabilities and the spatial distributions of the solutes improved. The utility of the oscillating-μex GCMC-MD method is indicated by its ability to approximate the hydration free energy (HFE) of the individual solutes in aqueous solution as well as in dilute aqueous mixtures of multiple solutes. For seven organic solutes: benzene, propane, acetaldehyde, methanol, formamide, acetate, and methylammonium, the average μex of the solutes and the water converged close to their respective HFEs in both 1 M standard state and dilute aqueous mixture systems. The oscillating-μex GCMC methodology is also able to drive solute sampling in proteins in aqueous environments as shown using the occluded binding pocket of the T4 lysozyme L99A mutant as a model system. The approach was shown to satisfactorily reproduce the free energy of binding of benzene as well as sample the functional group requirements of the occluded pocket consistent with the crystal structures of known ligands bound to the L99A mutant as well as their relative binding affinities.
Chemical fragment cosolvent sampling techniques have become a versatile tool in ligand-protein binding prediction. Site-Identification by Ligand Competitive Saturation (SILCS) is one such method that maps the distribution of chemical fragments on a protein as free energy fields called FragMaps. Ligands are then simulated via Monte Carlo techniques in the field of the FragMaps (SILCS-MC) to predict their binding conformations and relative affinities for the target protein. Application of SILCS-MC using a number of different scoring schemes and MC sampling protocols against multiple protein targets was undertaken to evaluate and optimize the predictive capability of the method. Seven protein targets and 551 ligands with broad chemical variability were used to evaluate and optimize the model to maximize Pearson's correlation coefficient, Pearlman's Predictive Index, correct relative binding affinity and root mean square error versus the absolute experimental binding affinities. Across the protein-ligand sets, the relative affinities of the ligands were predicted correctly an average of 69 % of the time for the highest overall SILCS protocol. Training the FragMap weighting factors using a Bayesian machine learning (ML) algorithm led to an increase to an average 75 % relative correct affinity predictions. Furthermore, once the optimal protocol is identified for a specific protein-ligand system average predictabilities of 76 % are achieved. The ML algorithm is successful with small training sets of data (30 or more compounds) due to the use of physically correct FragMap weights as priors. Notably, the 76 % correct relative prediction rate is similar to or better than free energy perturbation methods that are significantly computationally more expensive than SILCS. The results further support the utility of SILCS as a powerful and computationally accessible tool to support lead optimization and development in drug discovery.
Database screening using receptor-based pharmacophores is a computer-aided drug design technique that uses the structure of the target molecule (i.e. protein) to identify novel ligands that may bind to the target. Typically receptor-based pharmacophore modeling methods only consider a single or limited number of receptor conformations and map out the favorable binding patterns in vacuum or with a limited representation of the aqueous solvent environment, such that they may suffer from neglect of protein flexibility and desolvation effects. Site-Identification by Ligand Competitive Saturation (SILCS) is an approach that takes into account these, as well as other, properties to determine 3-dimensional maps of the functional group-binding patterns on a target receptor (i.e. FragMaps). In this study, a method to use the FragMaps to automatically generate receptor-based pharmacophore models is presented. It converts the FragMaps into SILCS pharmacophore features including aromatic, aliphatic, hydrogen-bond donor and acceptor chemical functionalities. The method generates multiple pharmacophore hypotheses that are then quantitatively ranked using SILCS grid free energies. The pharmacophore model generation protocol is validated using three different protein targets, including using the resulting models in virtual screening. Improved performance and efficiency of the SILCS derived pharmacophore models as compared to published docking studies, as well as a recently developed receptor-based pharmacophore modeling method is shown, indicating the potential utility of the approach in rational drug design.
Receptor-based pharmacophore modeling is an efficient computer-aided drug design technique that uses the structure of the target protein to identify novel leads. However, most methods consider protein flexibility and desolvation effects in a very approximate way, which may limit their use in practice. The Site-Identification by Ligand Competitive Saturation (SILCS) assisted pharmacophore modeling protocol (SILCS-Pharm) was introduced recently to address these issues as SILCS naturally takes both protein flexibility and desolvation effects into account by using full MD simulations to determine 3D maps of the functional group-affinity patterns on a target receptor. In the present work, the SILCS-Pharm protocol is extended to use a wider range of probe molecules including benzene, propane, methanol, formamide, acetaldehyde, methylammonium, acetate and water. This approach removes the previous ambiguity brought by using water as both the hydrogen-bond donor and acceptor probe molecule. The new SILCS-Pharm protocol is shown to yield improved screening results as compared to the previous approach based on three target proteins. Further validation of the new protocol using five additional protein targets showed improved screening compared to those using common docking methods, further indicating improvements brought by the explicit inclusion of additional feature types associated with the wider collection of probe molecules in the SILCS simulations. The advantage of using complementary features and volume constraints, based on exclusion maps of the protein defined from the SILCS simulations, is presented. In addition, re-ranking using SILCS-based ligand grid free energies is shown to enhance the diversity of identified ligands for the majority of targets. These results suggest that the SILCS-Pharm protocol will be of utility in rational drug design.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.