2020
DOI: 10.1021/acs.jcim.0c00740
|View full text |Cite
|
Sign up to set email alerts
|

PubChemQC PM6: Data Sets of 221 Million Molecules with Optimized Molecular Geometries and Electronic Properties

Abstract: We report on optimized molecular geometries and electronic properties calculated by the PM6 method for 94.0% of the 91.6 million molecules cataloged in PubChem Compounds retrieved on August 29, 2016. In addition to neutral states, we also calculated those for cationic, anionic, and spin flipped electronic states of 56.2%, 49.7%, and 41.3% of the molecules, respectively. Thus, the grand total of the PM6 calculations amounted to 221 million. We compared the resulting molecular geometries with B3LYP/6-31G* optimi… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
53
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 55 publications
(56 citation statements)
references
References 63 publications
0
53
0
Order By: Relevance
“…We obtained HOMO (Highest Occupied Molecular Orbital), LUMO (Lowest Unoccupied Molecular Orbital), formation heat, and dipole moment values for each compound via PM7. Because PM6 is as accurate as density functional theory (DFT) calculations [ 22 , 23 ], we assumed that the PM7 calculations exhibit identical acceptable errors as DFT calculations under the B3LYP/6-31G* condition.…”
Section: Methodsmentioning
confidence: 99%
“…We obtained HOMO (Highest Occupied Molecular Orbital), LUMO (Lowest Unoccupied Molecular Orbital), formation heat, and dipole moment values for each compound via PM7. Because PM6 is as accurate as density functional theory (DFT) calculations [ 22 , 23 ], we assumed that the PM7 calculations exhibit identical acceptable errors as DFT calculations under the B3LYP/6-31G* condition.…”
Section: Methodsmentioning
confidence: 99%
“…85,95,98,[102][103][104][105][106][107][108][109][110][111][112][113][114][115] Such studies are also facilitated by the availability of many big databases with these properties. 100,[116][117][118][119][120] One should be aware that orbital energy gaps are a poor approximation for excitation energies. Kohn-Sham energy gaps, for instance, correspond to a zero-order expansion of TD-DFT results.…”
Section: [H2] Reference Datamentioning
confidence: 99%
“…In cases such as calibrating classical or semi-empirical methods to DFT, the size of reference datasets can also become substantially large, making the fits potentially quite predictive. [85][86][87][88] Although the disadvantage of using theory to calibrate theory is clear, its successful applications have allowed it to readily percolate through the field.…”
Section: Computable Properties: Benchmarking and Calibrationmentioning
confidence: 99%