Automatic Machine Learning Combined with High-Throughput Computational Screening of Hydrophobic Metal–Organic Frameworks for Capture of Methanol and Ethanol from the Air

Zhang, Lulu; Huang, Qiuyuan; Li, Lifeng; Yan, Yaling; Yuan, Xueying; Liang, Hong; Li, Shuhua; Wang, Bangfen; Qiao, Zhiwei

doi:10.1021/acsestengg.2c00424

Cited by 2 publications

(4 citation statements)

References 61 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The prediction accuracy of the AutoML‐DE model for various output parameters is evaluated by the root mean square error (RMSE) and determination coefficient ( R 2 ), which are defined as shown in Equations () and () 33 . R 2 is an indicator of the similarity between estimated and measured values, while RMSE serves as the standard deviation between predicted and true values and is widely used to measure the degree of difference between the two 20

RMSE = \sqrt{\frac{1}{n} {false\sum}_{i = 1}^{n} {((), Y_{i} goodbreak- {\overset{truê}{Y}}_{i})}^{2}},

R^{2} = 1 - \frac{{false\sum}_{i = 1}^{n} {((), {\overset{truê}{Y}}_{i} goodbreak- \overset{Y}{true})}^{2}}{{false\sum}_{i = 1}^{n} {((), Y_{i} goodbreak- \overset{Y}{true})}^{2}},

where n is the number of samples;

Y_{i}

and

{\overset{Y}{true}}_{i}

represent the true and predicted values of the i th data; and

\overset{true¯}{Y}

denotes the average of the dataset.…”

Section: Methodsmentioning

confidence: 99%

“…Professor Teng Zhou's team 17–19 have employed this approach to analyze the operational efficacy of covalent organic frameworks in methane utilization, highlighting that AutoML can streamline the modeling process, enable automatic configuration of ML model parameters, and overcome the constraint of expert involvement. Similarly, Zhang et al 20 studied an AutoML algorithm to improve the prediction accuracy of the adsorption performance of metal‐organic frameworks. They found that the AutoML algorithm can not only enhance the reliability of prediction results, but also avoid overfitting in the model.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

An auto‐configurable machine learning framework to optimize and predict catalysts for CO₂ to light olefins process

Yang,

Fan,

Rong

et al. 2024

AIChE Journal

View full text Add to dashboard Cite

This study proposed an auto‐configurable machine learning framework based on the differential evolution algorithm (AutoML‐DE) driven by hybrid data for the screening and discovery of promising CO2 to light olefins (CO2TLO) catalysts candidates. The hybrid dataset comprises 532 experimental data from the literature and 296 simulation data. Results show that the AutoML‐DE model with extreme gradient boosting algorithms demonstrated superior performance for predicting the conversion ratio of CO2 and selectivity of light olefins (average R2 > 0.86). After identifying the input feature with the most significant impact on the output feature, the optimal AutoML‐DE model integrated with the genetic algorithm is applied to multiobjective optimization, sensitivity analysis, and prediction of new CO2TLO catalysts. The optimized Cu‐Zn‐Al/SAPO‐34 catalyst has the highest catalytic performance among the reported CO2TLO catalysts. Moreover, five new CO2TLO catalysts with higher yields are successfully predicted. However, the performance of these catalysts should be further verified by experiment.

show abstract

RMSE = \sqrt{\frac{1}{n} {false\sum}_{i = 1}^{n} {((), Y_{i} goodbreak- {\overset{truê}{Y}}_{i})}^{2}},

R^{2} = 1 - \frac{{false\sum}_{i = 1}^{n} {((), {\overset{truê}{Y}}_{i} goodbreak- \overset{Y}{true})}^{2}}{{false\sum}_{i = 1}^{n} {((), Y_{i} goodbreak- \overset{Y}{true})}^{2}},

where n is the number of samples;

Y_{i}

and

{\overset{Y}{true}}_{i}

represent the true and predicted values of the i th data; and

\overset{true¯}{Y}

denotes the average of the dataset.…”

Section: Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

An auto‐configurable machine learning framework to optimize and predict catalysts for CO₂ to light olefins process

Yang,

Fan,

Rong

et al. 2024

AIChE Journal

View full text Add to dashboard Cite

show abstract

“…Since the range of data varies widely, the comparison will be inconsistent without normalization. Hence, this work used a row scale normalization from 1 to 10 approach in data processing, which has also been used in previous work . The unified units facilitated meaningful comparisons and analyses, ensuring the data set’s quality and integrity.…”

Section: Methodsmentioning

confidence: 99%

“…The AutoML can automatically select the suitable model to predict methane production and determine the related hyperparameters without artificial intervention. Meanwhile, AutoML can improve the reliability of prediction results by avoiding overfitting and other issues . However, the limited availability of data often restricts the application of data-driven models in anaerobic bioprocess research, especially in the context of novel bioprocess testing and initial experimental endeavors …”

Section: Introductionmentioning

confidence: 99%

Predicting and Evaluating Different Pretreatment Methods on Methane Production from Sludge Anaerobic Digestion via Automated Machine Learning with Ensembled Semisupervised Learning

Cheng,

Xu,

et al. 2023

ACS EST Eng.

View full text Add to dashboard Cite

Accurate prediction of methane production in anaerobic digestion with various pretreatment strategies is of the utmost importance for efficient sludge treatment and resource recovery. Traditional machine learning (ML) algorithms have shown limited prediction accuracy due to challenges in optimizing complex parameters and the scarcity of data. This work proposed a novel integrated system that employed an ensemble semisupervised learning (SSL)-automated ML (AutoML) model with limited variable inputs to reveal the effects of different pretreatments on methane production during sludge digestion with explainable analysis. Considering the direct correlations of the pretreatment type and digestion substrates, the pretreatment type is considered as a hidden variable. Results demonstrated that the AutoML model outperformed the conventional ML models (i.e., support vector regression (SVR), extreme gradient boosting (XGB), etc.), as evidenced by its higher R 2 value. Moreover, the integration of SSL further enhanced the prediction accuracy by effectively leveraging unlabeled data, leading to a reduction in the mean squared error from 11.3 to 9.7. Explainable analysis results revealed the significance of different variables and the utmost importance of operating time, followed by proteins, carbohydrates, chemical oxygen demand, and volatile fatty acids. Furthermore, principal component and correlation analysis unveiled the interconnected relationships among substrate concentration, microbial communities, and metabolic functions for methane production and found that the increasing substrate concentration promoted the enrichment of functional microbial and metabolic functions. These insights shed light on the advantages of SSL-AutoML in predicting methane production in anaerobic digestion systems and elucidate the dependence relationships with key variables, offering valuable guidance for effective sludge pretreatment with enhanced resource recovery capabilities.

show abstract

Automatic Machine Learning Combined with High-Throughput Computational Screening of Hydrophobic Metal–Organic Frameworks for Capture of Methanol and Ethanol from the Air

Cited by 2 publications

References 61 publications

An auto‐configurable machine learning framework to optimize and predict catalysts for CO₂ to light olefins process

An auto‐configurable machine learning framework to optimize and predict catalysts for CO₂ to light olefins process

Predicting and Evaluating Different Pretreatment Methods on Methane Production from Sludge Anaerobic Digestion via Automated Machine Learning with Ensembled Semisupervised Learning

Contact Info

Product

Resources

About

Automatic Machine Learning Combined with High-Throughput Computational Screening of Hydrophobic Metal–Organic Frameworks for Capture of Methanol and Ethanol from the Air

Cited by 2 publications

References 61 publications

An auto‐configurable machine learning framework to optimize and predict catalysts for CO2 to light olefins process

An auto‐configurable machine learning framework to optimize and predict catalysts for CO2 to light olefins process

Predicting and Evaluating Different Pretreatment Methods on Methane Production from Sludge Anaerobic Digestion via Automated Machine Learning with Ensembled Semisupervised Learning

Contact Info

Product

Resources

About

An auto‐configurable machine learning framework to optimize and predict catalysts for CO₂ to light olefins process

An auto‐configurable machine learning framework to optimize and predict catalysts for CO₂ to light olefins process