2020
DOI: 10.26434/chemrxiv.12111060.v3
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Machine Learning Enables Highly Accurate Predictions of Photophysical Properties of Organic Fluorescent Materials: Emission Wavelengths and Quantum Yields

Abstract: <div> <p>The development of functional organic fluorescent materials calls for fast and accurate predictions of photophysical parameters for processes such as high-throughput virtual screening, while the task is challenged by the limitations of quantum mechanical calculations. We establish a database covering >4,300 solvated organic fluorescent dyes and develop new machine learning (ML) approach aimed at efficient and accurate predictions of emission wavelength and photoluminescence quantum yie… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
40
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 18 publications
(40 citation statements)
references
References 74 publications
(86 reference statements)
0
40
0
Order By: Relevance
“…We identified two transcription errors by Ju et al in the CGSD solvent descriptors (available from https://figshare.com/articles/dataset/ChemFluor/12110619/3) used to train their GBRT models. 16 • In row 22 of Solvent Descriptors.xlsx, the E T (30) value for 1-methyl-2-pyrrolidinone should be 42.2 rather than 48 according to entry no. 284 of Table 2 of Reichardt's work.…”
Section: Author Contributionsmentioning
confidence: 99%
See 3 more Smart Citations
“…We identified two transcription errors by Ju et al in the CGSD solvent descriptors (available from https://figshare.com/articles/dataset/ChemFluor/12110619/3) used to train their GBRT models. 16 • In row 22 of Solvent Descriptors.xlsx, the E T (30) value for 1-methyl-2-pyrrolidinone should be 42.2 rather than 48 according to entry no. 284 of Table 2 of Reichardt's work.…”
Section: Author Contributionsmentioning
confidence: 99%
“…Entries Dye Solvent Absorption Emission Other ChemDataExtractor 14 8,467 SMILES Name λ max , max --ChemFluor 16 4,386 SMILES Name Among the previous studies on predicting absorption peak wavelengths or excitation energies, the work of Ju et al, 16 Kang et al, 25 and Joung et al 26 is particularly noteworthy because of the size of their training datasets and the accuracies this enabled them to achieve. Although these recent works achieved impressive accuracies, their reported performance may be more representative of how they would perform in substituent-selection applications as opposed to de novo design tasks with unseen chemistries.…”
Section: Datasetmentioning
confidence: 99%
See 2 more Smart Citations
“…For our database we needed molecules that possess rich spectroscopic and photochemical properties but are notorious in theoretical studies due to SIE. We selected 3,926 such species from existing databases, including 1,941 solar cell materials from the Harvard Clean Energy Project (CEP), 101,102 904 pharmaceutically significant compounds from the DeepChem database, 103 431 fluorescence species from the ChemFluor database, 104 337 organic photovoltaic (OPV) molecules from the Harvard Organic Photovoltaic Dataset (HOPV15), 105 84 organic light-emitting diode (OLED) materials studied by Aspuru-Guzik and coworkers, 106 and 229 oligomers added by us in the present work. These compounds were randomly distributed into a training set of 1,970 and a test set of 1,956, and their structures were provided as simplified molecular-input line-entry system (SMILES) strings in the Supporting Information (SI).…”
mentioning
confidence: 99%