2023
DOI: 10.1021/acs.chemmater.2c02485
|View full text |Cite
|
Sign up to set email alerts
|

ARC–MOF: A Diverse Database of Metal-Organic Frameworks with DFT-Derived Partial Atomic Charges and Descriptors for Machine Learning

Abstract: Metal–organic frameworks (MOFs) are a class of crystalline materials composed of metal nodes or clusters connected via semi-rigid organic linkers. Owing to their high-surface area, porosity, and tunability, MOFs have received significant attention for numerous applications such as gas separation and storage. Atomistic simulations and data-driven methods [e.g., machine learning (ML)] have been successfully employed to screen large databases and successfully develop new experimentally synthesized and validated M… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
60
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 48 publications
(60 citation statements)
references
References 109 publications
0
60
0
Order By: Relevance
“…Machine learning studies typically need to go through multiple, often iterative stages, all supported by our software library. Collecting a data set. For machine learning efforts to be comparable, consistent data sets, along with measures that mitigate data leakage, are needed. In , we provide a consistent interface to multiple commonly used data sets , as well as a completely new data set of adsorption properties, complementing the QMOF database. , Additionally, we implement measures to mitigate the effects of data leakage. Featurizing a material. Most machine learning models only accept inputs of fixed shape.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…Machine learning studies typically need to go through multiple, often iterative stages, all supported by our software library. Collecting a data set. For machine learning efforts to be comparable, consistent data sets, along with measures that mitigate data leakage, are needed. In , we provide a consistent interface to multiple commonly used data sets , as well as a completely new data set of adsorption properties, complementing the QMOF database. , Additionally, we implement measures to mitigate the effects of data leakage. Featurizing a material. Most machine learning models only accept inputs of fixed shape.…”
Section: Resultsmentioning
confidence: 99%
“…In , we provide a consistent interface to multiple commonly used data sets , as well as a completely new data set of adsorption properties, complementing the QMOF database. , Additionally, we implement measures to mitigate the effects of data leakage.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…drug delivery [6], conductivity [7], and so on. To date, there are more than 100,000 experimentally reported MOF structures [8][9][10] and millions of MOF structures that have been predicted in silico [2,[11][12][13][14][15][16][17][18].…”
Section: Introductionmentioning
confidence: 99%
“…However, as we are interested in a small subset of top-performing materials, most of the effort in these brute-force methods is spent on computing properties of those materials that are not interesting. As the number of structures in these databases has grown significantly, several groups have started taking a different approach in searching more efficiently within this infinite chemical space of MOFs, using methods such as diversity-driven searches [16,17,20], and active learning based searches [19,21].…”
Section: Introductionmentioning
confidence: 99%