2022
DOI: 10.1016/j.ces.2021.117219
|View full text |Cite
|
Sign up to set email alerts
|

Insights into ensemble learning-based data-driven model for safety-related property of chemical substances

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
15
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 23 publications
(15 citation statements)
references
References 40 publications
(47 reference statements)
0
15
0
Order By: Relevance
“…Before training the framework, the dataset needs to be divided into two disjoint subsets: a training set for model training with multilayer stack ensembling and n ‐repeated k ‐fold bagging, and a test set for evaluating the external predictive performance of the developed model. In this work, identical to Wang et al, 3 the training and test set account for 90% and 10% of the entire FPT dataset, respectively.…”
Section: Case Studiesmentioning
confidence: 99%
See 2 more Smart Citations
“…Before training the framework, the dataset needs to be divided into two disjoint subsets: a training set for model training with multilayer stack ensembling and n ‐repeated k ‐fold bagging, and a test set for evaluating the external predictive performance of the developed model. In this work, identical to Wang et al, 3 the training and test set account for 90% and 10% of the entire FPT dataset, respectively.…”
Section: Case Studiesmentioning
confidence: 99%
“…Toward this goal, recent studies have shown increasingly remarkable interest in applying machine learning (ML) for modeling a wide variety of molecular properties, including melting points (MPs), flash points, logP (the logarithm of the octanol-water partition coefficient), toxicity, and so forth. [2][3][4][5][6][7][8][9] These applications have demonstrated high accuracies comparable to traditional modeling methods and experimental measurements at a fraction of the computational, time, and money cost. 9,10 As shown in Figure 1, when using ML for QSPR model development, every aspect of ML applications such as feature engineering, ML model selection, and model optimization, needs to be carefully configured.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Wang et al 118 conducted a comprehensive study on the safety-related properties of organic compounds based on Stacking. Their work considers several representative ML models, including multiple linear regression, extreme learning machine, feedforward neural network, and support vector machine.…”
Section: Ensemble Learning Enhances Model Prediction Performancementioning
confidence: 99%
“…Researchers have used different types of artificial intelligence algorithms to build predictive models of chemicals, including artificial neural network (ANN), light gradient boosting (LightGBM), deep neural networks (DNN), random forest (RF), extra trees (ET), multiple linear regression (MLR), partial least squared (PLS), k-nearest neighbors (k-NN), support vector machine (SVM), and Ridge regression. For example, Ling et al used the XGBoost algorithm combined with SHapley Additive exPlanation (SHAP) to predict the viscosities of deep eutectic solvents (DESs) at different temperatures . Shen et al used different algorithms to predict properties such as the flash point temperature of substances and obtained accurate results. However, using chemical simulation software to calculate the viscosities of polymers under different conditions poses some difficulties and challenges. The variety and variability of polymer systems hinder generalization and standardization.…”
Section: Introductionmentioning
confidence: 99%