2023
DOI: 10.1021/acsomega.3c02203
|View full text |Cite
|
Sign up to set email alerts
|

Clusterome: A Comprehensive Data Set of Atmospheric Molecular Clusters for Machine Learning Applications

Abstract: Formation and growth of atmospheric molecular clusters into aerosol particles impact the global climate and contribute to the high uncertainty in modern climate models. Cluster formation is usually studied using quantum chemical methods, which quickly becomes computationally expensive when system sizes grow. In this work, we present a large database of ∼250k atmospheric relevant cluster structures, which can be applied for developing machine learning (ML) models. The database is used to train the ML model kern… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1
1

Relationship

3
4

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 54 publications
2
1
0
Order By: Relevance
“…Hence, based on these findings, if a semiempirical method should be used in the funneling approach, GFN1-xTB is a better choice than GFN2-xTB. This is consistent with recent benchmark studies 22 , 23 , 51 , 52 and shows that the trend follows here. The empirically corrected DFT methods, PBEh-3c, B97-3c, and r 2 SCAN-3c, perform well with RMSDs of 0.18, 0.11, and 0.09 Å, respectively.…”
Section: Resultssupporting
confidence: 94%
“…Hence, based on these findings, if a semiempirical method should be used in the funneling approach, GFN1-xTB is a better choice than GFN2-xTB. This is consistent with recent benchmark studies 22 , 23 , 51 , 52 and shows that the trend follows here. The empirically corrected DFT methods, PBEh-3c, B97-3c, and r 2 SCAN-3c, perform well with RMSDs of 0.18, 0.11, and 0.09 Å, respectively.…”
Section: Resultssupporting
confidence: 94%
“…Hence, based on these findings, if a semi-empirical method should be used in the funneling approach, GFN1-xTB is a better choice than GFN2-xTB. This is consistent with recent benchmark studies 22,23,49,50 and shows that the trend follows here. The empirically corrected DFT methods PBEh-3c, B97-3c, and r 2 SCAN-3c perform well with RMSDs of 0.18, 0.11, and 0.09 Å, respectively.…”
Section: Cluster Geometriessupporting
confidence: 93%
“…In our recent Clusterome paper, 123 we presented a large (∼250k), multiacid–multibase, atmospherically relevant, molecular cluster database (available in the ACDB 2.0 repository, see the Supporting Information ). Here, we extract only ∼32k structures of the (H 2 SO 4 –SA) 0–2 (bases) 0–2 clusters (termed Clusteromics I), where bases correspond to ammonia (AM), methylamine (MA), dimethylamine (DMA), trimethylamine (TMA), and ethylenediamine (EDA).…”
Section: Application and Discussionmentioning
confidence: 99%