2022
DOI: 10.3390/biom12070919
|View full text |Cite
|
Sign up to set email alerts
|

A Physics-Guided Neural Network for Predicting Protein–Ligand Binding Free Energy: From Host–Guest Systems to the PDBbind Database

Abstract: Calculation of protein–ligand binding affinity is a cornerstone of drug discovery. Classic implicit solvent models, which have been widely used to accomplish this task, lack accuracy compared to experimental references. Emerging data-driven models, on the other hand, are often accurate yet not fully interpretable and also likely to be overfitted. In this research, we explore the application of Theory-Guided Data Science in studying protein–ligand binding. A hybrid model is introduced by integrating Graph Convo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1
1

Relationship

2
4

Authors

Journals

citations
Cited by 8 publications
(3 citation statements)
references
References 62 publications
0
2
0
Order By: Relevance
“…The downside to acquiring data from Sabio-RK or BRENDA is it will have to be cleaned extensively as this data is of quite low quality due to misannotations, human error in curation, and high experimental variance [53,51,52]. Another possible approach for improving model performance is to integrate physical principles of molecule-protein interaction into enzymesubstrate prediction models, which could lead to further improvements in model performance [63,64]. The addition of 3D protein information could also be explored as this is critical for determining substrate preferences in many enzyme-catalyzed reactions [65][66][67][68].…”
Section: Discussionmentioning
confidence: 99%
“…The downside to acquiring data from Sabio-RK or BRENDA is it will have to be cleaned extensively as this data is of quite low quality due to misannotations, human error in curation, and high experimental variance [53,51,52]. Another possible approach for improving model performance is to integrate physical principles of molecule-protein interaction into enzymesubstrate prediction models, which could lead to further improvements in model performance [63,64]. The addition of 3D protein information could also be explored as this is critical for determining substrate preferences in many enzyme-catalyzed reactions [65][66][67][68].…”
Section: Discussionmentioning
confidence: 99%
“…It is then not surprising that the recent explosive development of machine learning (ML) techniques, including deep neural networks (DNNs), is already making a noticeable impact on this field, including works aimed directly at improving the accuracy of the description of complex solvation effects. It should be noted that the majority of these recent works combine quantum mechanics (QM)-based methodology with ML, while our interest here is purely classical approaches. Among recent purely ML-based approaches, a featurization algorithm, based on functional class fingerprints and implemented within the DeepChem ML framework, was used in ref to predict the hydration-free energies (HFEs) of a diverse set of 642 neutral small molecules available in FreeSolvarguably the largest public database of experimentally measured HFEs.…”
Section: Introductionmentioning
confidence: 99%
“…Emerging data-driven models, on the other hand, are often accurate, yet not fully interpretable, and are likely to be overfitted. In [ 3 ], Cain et al explored the application of theory-guided data science in studying protein–ligand binding. A hybrid model is introduced by integrating a graph convolutional network (data-driven model) with the GBNSR6 implicit solvent (physics-based model).…”
mentioning
confidence: 99%