2022
DOI: 10.1021/acsomega.2c03885
|View full text |Cite
|
Sign up to set email alerts
|

Prediction of the Aqueous Solubility of Compounds Based on Light Gradient Boosting Machines with Molecular Fingerprints and the Cuckoo Search Algorithm

Abstract: Aqueous solubility is one of the most important physicochemical properties in drug discovery. At present, the prediction of aqueous solubility of compounds is still a challenging problem. Machine learning has shown great potential in solubility prediction. Most machine learning models largely rely on the setting of hyperparameters, and their performance can be improved by setting the hyperparameters in a better way. In this paper, we used MACCS fingerprints to represent the structural features and optimized th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
7
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(9 citation statements)
references
References 38 publications
0
7
0
Order By: Relevance
“…The schematic representation of the GBM model is shown in Figure . Due to the desirable properties of GBM and its excellent prediction capabilities, GBM is being increasingly used in different fields of science and engineering. Recently Li et al employed different versions of GBM for prediction of aqueous solubility of compounds.…”
Section: Brief Descriptions Of Gradient Boosting Machines (Gbm) Suppo...mentioning
confidence: 99%
“…The schematic representation of the GBM model is shown in Figure . Due to the desirable properties of GBM and its excellent prediction capabilities, GBM is being increasingly used in different fields of science and engineering. Recently Li et al employed different versions of GBM for prediction of aqueous solubility of compounds.…”
Section: Brief Descriptions Of Gradient Boosting Machines (Gbm) Suppo...mentioning
confidence: 99%
“…Researchers have used different types of artificial intelligence algorithms to build predictive models of chemicals, including artificial neural network (ANN), light gradient boosting (LightGBM), deep neural networks (DNN), random forest (RF), extra trees (ET), multiple linear regression (MLR), partial least squared (PLS), k-nearest neighbors (k-NN), support vector machine (SVM), and Ridge regression. For example, Ling et al used the XGBoost algorithm combined with SHapley Additive exPlanation (SHAP) to predict the viscosities of deep eutectic solvents (DESs) at different temperatures . Shen et al used different algorithms to predict properties such as the flash point temperature of substances and obtained accurate results. However, using chemical simulation software to calculate the viscosities of polymers under different conditions poses some difficulties and challenges.…”
Section: Introductionmentioning
confidence: 99%
“…Chinta and Rengaswamy proposed a machine learning-based QSPR-based approach to predict drug solubility in binary solvent systems using structural features, such as molar refractivity, McGowan volume, topological surface area, etc. The CS-LightGBM model was established by Li to predict the aqueous solubility of 2446 organic compounds. This model had a better prediction performance than the other six different models and can be used to solve solubility prediction problems in other fields, such as solvent selection and drug screening.…”
Section: Introductionmentioning
confidence: 99%