2022
DOI: 10.1016/j.drudis.2022.103372
|View full text |Cite
|
Sign up to set email alerts
|

Machine learning methods for pKa prediction of small molecules: Advances and challenges

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
17
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 25 publications
(27 citation statements)
references
References 59 publications
0
17
0
Order By: Relevance
“…where δ i is 1 for acids and −1 for bases. However, accurate prediction of pK a is still quite difficult due to the scarcity of data and inherent complexity of the property, 17 making direct application of this formula a great challenge. 18 In recent years, with the development of machine learning (ML) algorithms, many handcrafted descriptor-based ML models have been developed for log D 7.4 prediction, such as random forest (RF), support vector machine (SVM), and extreme gradient boosting (XGBoost).…”
Section: ■ Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…where δ i is 1 for acids and −1 for bases. However, accurate prediction of pK a is still quite difficult due to the scarcity of data and inherent complexity of the property, 17 making direct application of this formula a great challenge. 18 In recent years, with the development of machine learning (ML) algorithms, many handcrafted descriptor-based ML models have been developed for log D 7.4 prediction, such as random forest (RF), support vector machine (SVM), and extreme gradient boosting (XGBoost).…”
Section: ■ Introductionmentioning
confidence: 99%
“…Experimental approaches such as the shake flask method, filter probe method, “slow stirring” method, chromatography method, and potentiometric titration method are costly and time-consuming. On the other hand, log D 7.4 can be derived from log P and p K a as follows log nobreak0em.25em⁡ D false( pH false) = log nobreak0em.25em⁡ P log ( 1 + 10 false( pH normalp K normala false) δ i ) where δ i is 1 for acids and −1 for bases. However, accurate prediction of p K a is still quite difficult due to the scarcity of data and inherent complexity of the property, making direct application of this formula a great challenge …”
Section: Introductionmentioning
confidence: 99%
“…It prevents time- and cost-consuming experiments, suggesting easily screenable sets of data. Computational calculation of acid–base dissociation constants (pKa) is among the most embraced methods [ 8 , 9 ].…”
Section: Introductionmentioning
confidence: 99%
“…QSAR based on machine learning can exploit different approaches ranging from the descriptor models (i. e. atomic descriptor, rooted fingerprints, and hybrid features) to the graph-based models, organized in kernels and neural networks. [3] The former cover simple linear regression complex to neural network, while the latter still require a remarkable number of steps in the identification of compound resides, the overall optimization of structures, and algorithms. [3] Among physics-based models, the ab initio bond length high correlation subsets (AIBLHiCoS or AIBL) method gave promising results for the pKa prediction of a wide range of complex organic molecules.…”
Section: Introductionmentioning
confidence: 99%