Decoys Selection in Benchmarking Datasets: Overview and Perspectives

Réau, Manon; Langenfeld, Florent; Zagury, Jean‐François; Lagarde, Nathalie; Montès, Matthieu

doi:10.3389/fphar.2018.00011

Cited by 86 publications

(80 citation statements)

References 95 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Ref. [] for review). The use of polar surface area as an alternative charge‐related physical property was also evaluated, but the use of the formal charge descriptor was preferred, mainly due to practical reasons.…”

Section: Methodological Developments In Docking‐based Drug Lead Discomentioning

confidence: 79%

“…Chemical diversity should be sought to maximize the probability of finding novel ligands. In case of retrospective molecular docking, availability of ligand binding information and an un‐biased database of decoys (small‐molecules which are assumed to be non‐binders) would help to obtain reliable results from HTD The docking strategy: In HTD, two different stages can be identified for each molecule: (1) the docking stage, where an accurate pose (or low‐energy poses) of the molecule within the binding site is sought through optimizing protein‐molecule interactions; (2) the scoring stage, where low energy poses (or the lowest one) are assigned a score aimed as a measure of the probability of that molecule to actually bind to the target.…”

Section: Methodological Developments In Docking‐based Drug Lead Discomentioning

confidence: 99%

See 1 more Smart Citation

Computational chemistry in drug lead discovery and design

Cavasotto

Aucar

Adler

2018

Int J of Quantum Chemistry

View full text Add to dashboard Cite

The main contributions of our group during the last 15 years developing and using biomolecular simulation tools in drug lead discovery and design, in close collaboration with experimental researchers, are presented. Special emphasis has been given to methodological improvements in the following areas: (1) target homology modeling incorporating knowledge about known ligands to accurately characterize the binding site; (2) designing alternative strategies to account for protein flexibility in high‐throughput docking; (3) development of stochastic‐ and normal‐mode‐based methods to de novo design structurally diverse protein conformers; (4) development and validation of quantum mechanical semi‐empirical linear‐scaling calculations to correctly estimate ligand binding free energy. Several successful cases of computer‐aided drug discovery are also presented, especially our recent work on viral targets.

show abstract

Section: Methodological Developments In Docking‐based Drug Lead Discomentioning

confidence: 79%

Section: Methodological Developments In Docking‐based Drug Lead Discomentioning

confidence: 99%

Computational chemistry in drug lead discovery and design

Cavasotto

Aucar

Adler

2018

Int J of Quantum Chemistry

View full text Add to dashboard Cite

show abstract

“…pointed out that the assignment of different weights to the chemical features (represented by different colours), therefore changing the combo score, could improve the performance in virtual screening . Additionally, models’ enrichment can either be over or underestimated by inappropriate decoy selection . Specifically, DUD‐E's decoy set was shown to overestimate AUC values when used to validate machine learning‐based models .…”

Section: Resultsmentioning

confidence: 99%

“…As CSMs calculate the similarity between a compound and a query defined by the molecular volume and some chemical features distributed in specific positions on space, we assumed that dissimilar compounds in decoy set may not interfere in the model's evaluation. Accordingly, the lack of chemical diversity in active compounds set is a major limiting step for the model training and can become a source of bias to the model . Then, for this work, all analysed subsets (active, inactive and decoys compounds) were similar in terms of physicochemical properties to avoid bias.…”

Section: Resultsmentioning

confidence: 99%

Ligand‐ and Structure‐Based Approaches of Escherichia coli FabI Inhibition by Triclosan Derivatives: From Chemical Similarity to Protein Dynamics Influence

et al. 2019

View full text Add to dashboard Cite

Enoyl‐acyl carrier protein reductase (FabI) is the limiting step to complete the elongation cycle in type II fatty acid synthase (FAS) systems and is a relevant target for antibacterial drugs. E. coli FabI has been employed as a model to develop new inhibitors against FAS, especially triclosan and diphenyl ether derivatives. Chemical similarity models (CSM) were used to understand which features were relevant for FabI inhibition. Exhaustive screening of different CSM parameter combinations featured chemical groups, such as the hydroxy group, as relevant to distinguish between active/decoy compounds. Those chemical features can interact with the catalytic Tyr156. Further molecular dynamics simulation of FabI revealed the ionization state as a relevant for ligand stability. Also, our models point the balance between potency and the occupancy of the hydrophobic pocket. This work discusses the strengths and weak points of each technique, highlighting the importance of complementarity among approaches to elucidate EcFabI inhibitor's binding mode and offers insights for future drug discovery.

show abstract

“…Since the release of the first benchmarking sets, constant efforts have been taken to improve the quality of benchmarking sets by reducing three main types of biases, i. e. artificial enrichment, analogue bias and false negative bias . Among the currently available benchmarking sets, MUV, DEKOIS, DUD‐E, NRLiSt BDB and MUBD‐HDACs were selected as the state‐of‐the‐art as of 2017 . It is worth to mention that MUBD‐HDACs, with the full name of Maximal Unbiased Benchmarking Data sets for HDACs, was made possible from our research effort.…”

Section: Introductionmentioning

confidence: 99%

MUBD‐DecoyMaker 2.0: A Python GUI Application to Generate Maximal Unbiased Benchmarking Data Sets for Virtual Drug Screening

Xia

Ding

et al. 2019

Molecular Informatics

View full text Add to dashboard Cite

Ligand enrichment assessment based on benchmarking data sets has become a necessity for the rational selection of the best‐suited approach for prospective data mining of drug‐like molecules. Up to now, a variety of benchmarking data sets had been generated and frequently used. Among them, MUBD‐HDACs from our prior research efforts was regarded as one of five state‐of‐the‐art benchmarks in 2017 by Frontiers in Pharmacology. This benchmarking set was generated by one of our unique de‐biasing algorithms. It also rendered quite a few other cases of successful applications in recent years, thus is expected to have more impact in modern drug discovery. To make our algorithm amenable to more users, we developed a Python GUI application called MUBD‐DecoyMaker 2.0. Moreover, it has two new additional functional modules, i. e. “Detect 2D Bias” and “Quality Control”. This new GUI version had been proved to be easy to use while generate benchmarking data sets of the same quality. MUBD‐DecoyMaker 2.0 is freely available at https://github.com/jwxia2014/MUBD‐DecoyMaker2.0, along with its manual and testcase.

show abstract

Decoys Selection in Benchmarking Datasets: Overview and Perspectives

Cited by 86 publications

References 95 publications

Computational chemistry in drug lead discovery and design

Computational chemistry in drug lead discovery and design

Ligand‐ and Structure‐Based Approaches of Escherichia coli FabI Inhibition by Triclosan Derivatives: From Chemical Similarity to Protein Dynamics Influence

MUBD‐DecoyMaker 2.0: A Python GUI Application to Generate Maximal Unbiased Benchmarking Data Sets for Virtual Drug Screening

Contact Info

Product

Resources

About