Prediction of Binding Free Energy of Protein–Ligand Complexes with a Hybrid Molecular Mechanics/Generalized Born Surface Area and Machine Learning Method

Dong, Liang; Qu, Xiaoyang; Zhao, Yuan; Wang, Binju

doi:10.1021/acsomega.1c04996

Cited by 27 publications

(20 citation statements)

References 77 publications

(107 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Unfortunately, the problematic parameter choices of these physical models have overshadowed the valuable information extracted from the molecular surface. There are some recent efforts to directly incorporate the surface area descriptors to capture the protein–ligand potency. , However, conventional surface area models do not portray crucial physical and chemical interactions such as noncovalent bonds, hydrogen bonds, van der Waals interactions, etc., which lead to discouraging results and limited capacity to handle diverse biomolecular data sets. These issues call for robustness and scalable surface area representations for biomolecular structures.…”

Section: Discussionmentioning

confidence: 99%

“…57 However, the role of the surface area in capturing the crucial physical and chemical interactions in the biomolecular structures is not fully explored. Despite the recent efforts to integrate the surface area information into predictive models such as Cyscore 9 and GLXE 26 for protein− ligand binding affinity prediction, those surface area-based models are far from the competitive level with their counterparts.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

EISA-Score: Element Interactive Surface Area Score for Protein–Ligand Binding Affinity Prediction

Rana

Nguyen

2022

J. Chem. Inf. Model.

View full text Add to dashboard Cite

Molecular surface representations have been advertised as a great tool to study protein structure and functions, including protein–ligand binding affinity modeling. However, the conventional surface-area-based methods fail to deliver a competitive performance on the energy scoring tasks. The main reason is the lack of crucial physical and chemical interactions encoded in the molecular surface generations. We present novel molecular surface representations embedded in different scales of the element interactive manifolds featuring the dramatically dimensional reduction and accurately physical and biological properties encoders. Those low-dimensional surface-based descriptors are ready to be paired with any advanced machine learning algorithms to explore the essential structure–activity relationships that give rise to the element interactive surface area-based scoring functions (EISA-score). The newly developed EISA-score has outperformed many state-of-the-art models, including various well-established surface-related representations, in standard PDBbind benchmarks.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

EISA-Score: Element Interactive Surface Area Score for Protein–Ligand Binding Affinity Prediction

Rana

Nguyen

2022

J. Chem. Inf. Model.

View full text Add to dashboard Cite

show abstract

“…Thanks to the recent developments in ML force fields (Unke et al, 2021), accurate alchemical free energy calculations based on such force fields are starting to appear (Rufa et al, 2020;Wieder et al, 2021). ML-based corrections to conventional free energy calculations will also play an important role in reaching good prediction accuracy of protein-ligand binding free energies (Dong et al, 2021). While such methods are outside the scope of this review, we believe the exploration and development of ML and DL methods in the field of free energy calculations will provide very interesting outcomes in the coming years, by getting the methodology closer to chemical accuracy while significantly reducing computational costs.…”

Section: Other Methodsmentioning

confidence: 99%

Scoring Functions for Protein-Ligand Binding Affinity Prediction Using Structure-based Deep Learning: A Review

2022

View full text Add to dashboard Cite

The rapid and accurate in silico prediction of protein-ligand binding free energies or binding affinities has the potential to transform drug discovery. In recent years, there has been a rapid growth of interest in deep learning methods for the prediction of protein-ligand binding affinities based on the structural information of protein-ligand complexes. These structure-based scoring functions often obtain better results than classical scoring functions when applied within their applicability domain. Here we review structure-based scoring functions for binding affinity prediction based on deep learning, focussing on different types of architectures, featurization strategies, data sets, methods for training and evaluation, and the role of explainable artificial intelligence in building useful models for real drug-discovery applications.

show abstract

“…The performances of ML-based SFs are highly dependent on the input features. Generally speaking, input features can be specific energy features, − protein–ligand atom pairwise counts or potentials, − interaction fingerprints, − mathematical features, − grid-based features, − graph-based features, ,− etc. Unlike other machine learning algorithms, many deep learning-based SFs can automatically extract features and use them for training. , Currently, extensive efforts still rely on the use of traditional ML to improve the scoring power of SFs.…”

Section: Introductionmentioning

confidence: 99%

Systematic Improvement of the Performance of Machine Learning Scoring Functions by Incorporating Features of Protein-Bound Water Molecules

Dong

Zhang

et al. 2022

J. Chem. Inf. Model.

Self Cite

View full text Add to dashboard Cite

Water molecules at the ligand–protein interfaces play crucial roles in the binding of the ligands, but the behavior of protein-bound water is largely ignored in many currently used machine learning (ML)-based scoring functions (SFs). In an attempt to improve the prediction performance of existing ML-based SFs, we estimated the water distribution with a HydraMap (HM) method and then incorporated the features extracted from protein-bound waters obtained in this way into three ML-based SFs: RF-Score, ECIF, and PLEC. It was found that a combination of HM-based features can consistently improve the performance of all three SFs, including their scoring, ranking, and docking power. HydraMap-based features show consistently good performance with both crystal structures and docked structures, demonstrating their robustness for SFs. Overall, HM-based features, which are a statistical representation of hydration sites at protein–ligand interfaces, are expected to improve the prediction performance for diverse SFs.

show abstract

Prediction of Binding Free Energy of Protein–Ligand Complexes with a Hybrid Molecular Mechanics/Generalized Born Surface Area and Machine Learning Method

Cited by 27 publications

References 77 publications

EISA-Score: Element Interactive Surface Area Score for Protein–Ligand Binding Affinity Prediction

EISA-Score: Element Interactive Surface Area Score for Protein–Ligand Binding Affinity Prediction

Scoring Functions for Protein-Ligand Binding Affinity Prediction Using Structure-based Deep Learning: A Review

Systematic Improvement of the Performance of Machine Learning Scoring Functions by Incorporating Features of Protein-Bound Water Molecules

Contact Info

Product

Resources

About