2021
DOI: 10.1038/s41598-021-88027-8
|View full text |Cite
|
Sign up to set email alerts
|

Machine learning with persistent homology and chemical word embeddings improves prediction accuracy and interpretability in metal-organic frameworks

Abstract: Machine learning has emerged as a powerful approach in materials discovery. Its major challenge is selecting features that create interpretable representations of materials, useful across multiple prediction tasks. We introduce an end-to-end machine learning model that automatically generates descriptors that capture a complex representation of a material’s structure and chemistry. This approach builds on computational topology techniques (namely, persistent homology) and word embeddings from natural language … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
40
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 45 publications
(41 citation statements)
references
References 37 publications
(12 reference statements)
1
40
0
Order By: Relevance
“…If only MOFs that are potentially suitable for separation and storage applications (i.e., k H,CH4 > 10 –7 [mol/kg/Pa]) are considered, the RMSE value is even lower to be 0.18. Such a small RMSE value suggests the suitability of the CNN model in a high-throughput screening and it is also among the best machine learning models that have been reported to date for methane adsorption under dilute conditions to the best of our knowledge …”
Section: Resultsmentioning
confidence: 92%
See 1 more Smart Citation
“…If only MOFs that are potentially suitable for separation and storage applications (i.e., k H,CH4 > 10 –7 [mol/kg/Pa]) are considered, the RMSE value is even lower to be 0.18. Such a small RMSE value suggests the suitability of the CNN model in a high-throughput screening and it is also among the best machine learning models that have been reported to date for methane adsorption under dilute conditions to the best of our knowledge …”
Section: Resultsmentioning
confidence: 92%
“…While classical molecular simulations represent an important means toward the search of optimal materials, conducting an exhaustive computational screening in a brute-force manner is still resource-demanding. To this end, machine learning (ML) has recently drawn considerable attention for its great potential in discoveries of high-throughput materials. For example, Cai’s group employed support vector machine (SVM) and tree-based regressions with 37 tailor-made features to study 130 397 hypothetical MOF structures and shed light on the optimal combinations of structural features (e.g., void fraction and surface area) for methane storage . Woo and co-workers also implemented SVM-based machine learning to predict MOFs’ methane deliverable capacity and elucidated the quantitative structure–property relationship (QSPR) .…”
Section: Introductionmentioning
confidence: 99%
“…To this end, we envision that PLD, particularly PLD t‑r proposed in this study, can still serve as a very useful means of preliminary screening of materials. Machine learning (ML) may also be a promising approach to improve the prediction accuracy, as it has been successfully employed for predicting the adsorption properties of various gases in MOFs. Besides PLD t‑r , a variety of other easy-to-compute features such as LCD, void fraction, and open-metal-site angle can be used as input features to train an ML model for the prediction of gaseous transport properties in porous materials.…”
Section: Resultsmentioning
confidence: 99%
“…To address this challenge, the use of ML in conjunction with MOFs has also grown significantly in the past 5 years, from gas adsorption prediction to partial atomic charges, band gap, and other mechanical and chemical property predictions. 2 , 3 , 4 , 5 , 6 , 7 Due to their porous nature, fast kinetics, reversibility, and high gravimetric densities, MOFs have been widely studied for many gas-storage problems, including natural gas and hydrogen. Hydrogen storage is a key enabling technology as hydrogen is considered both a future automotive fuel and a medium for energy storage; however, its application has been limited by hydrogen’s low volumetric density at ambient conditions.…”
Section: Main Textmentioning
confidence: 99%