2020
DOI: 10.1038/s41597-020-0385-y
|View full text |Cite
|
Sign up to set email alerts
|

Atomic structures and orbital energies of 61,489 crystal-forming organic molecules

Abstract: Data science and machine learning in materials science require large datasets of technologically relevant molecules or materials. Currently, publicly available molecular datasets with realistic molecular geometries and spectral properties are rare. We here supply a diverse benchmark spectroscopy dataset of 61,489 molecules extracted from organic crystals in the Cambridge Structural Database (CSD), denoted OE62. Molecular equilibrium geometries are reported at the Perdew-Burke-Ernzerhof (PBE) level of density f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
97
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
8

Relationship

4
4

Authors

Journals

citations
Cited by 72 publications
(98 citation statements)
references
References 72 publications
1
97
0
Order By: Relevance
“… 149 , 150 , 211 We performed PADF- G 0 W 0 @PBE and PADF- G 0 W 0 @PBE0 calculations for all molecules in the GW100 database 82 as well as PADF- G 0 W 0 @PBE0 212 , 213 calculations for selected molecules from the GW5000 database. 214 For GW100, we used the structures as published in the original work, 82 except for vinyl bromide and phenol for which we used the updated structures. 215 To preclude potential confusion, we emphasize that all results from the other codes we refer to herein have been taken from the literature and have not been calculated by us.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“… 149 , 150 , 211 We performed PADF- G 0 W 0 @PBE and PADF- G 0 W 0 @PBE0 calculations for all molecules in the GW100 database 82 as well as PADF- G 0 W 0 @PBE0 212 , 213 calculations for selected molecules from the GW5000 database. 214 For GW100, we used the structures as published in the original work, 82 except for vinyl bromide and phenol for which we used the updated structures. 215 To preclude potential confusion, we emphasize that all results from the other codes we refer to herein have been taken from the literature and have not been calculated by us.…”
Section: Resultsmentioning
confidence: 99%
“… 82 Using localized basis functions, even on the QZ level, BSEs for HOMO and especially LUMO QP energies can exceed several hundred meV, necessitating a CBS limit extrapolation to obtain very accurate reference values. 46 , 82 , 214 Using localized AOs, one needs to rely on heuristics since the expansion of MOs in this basis does not converge uniformly, unlike expansions in terms of PW 224 or finite elements in RS. 225 HOMO QP energies obtained with these basis set types are generally in good agreement with the original ones by van Setten et al., 82 while differences for unbound LUMO energies often exceed 1 eV.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“… 46 , 47 As molecular properties, both ϵ HOMO and λ h can be determined by efficient first-principles calculations as detailed in Supplementary Note 2 , where the density-functional theory (DFT) B3LYP 48 50 level of theory constitutes a well established accuracy standard 27 , 31 , 39 , 40 , 51 , matching experimental data 44 , 52 . We emphasize though that using the lowest-energy gas-phase conformer for the descriptor calculation disregards packing-effects in the molecular crystal 53 55 and we further discuss the influence of conformers on descriptor values in Supplementary Note 3 .…”
Section: Resultsmentioning
confidence: 99%
“…State-of-the-art methods actually surpass chemical accuracy (ca. 0.05 eV) when applied to standard benchmarks like the QM9 database 34 38 . Similarly, ML models can be applied to conformational space (e.g., when trained on ab initio molecular dynamics trajectories) or even interpolate across chemical and conformational space at the same time 39 41 .…”
Section: Introductionmentioning
confidence: 99%