2019
DOI: 10.1021/acscentsci.9b00193
|View full text |Cite
|
Sign up to set email alerts
|

A Machine Learning Approach to Zeolite Synthesis Enabled by Automatic Literature Data Extraction

Abstract: Zeolites are porous, aluminosilicate materials with many industrial and “green” applications. Despite their industrial relevance, many aspects of zeolite synthesis remain poorly understood requiring costly trial and error synthesis. In this paper, we create natural language processing techniques and text markup parsing tools to automatically extract synthesis information and trends from zeolite journal articles. We further engineer a data set of germanium-containing zeolites to test the accuracy of the extract… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
167
0
2

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
4
1

Relationship

1
9

Authors

Journals

citations
Cited by 202 publications
(169 citation statements)
references
References 80 publications
(119 reference statements)
0
167
0
2
Order By: Relevance
“…Furthermore, the vast majority of databases are commercial products requiring a license, and programmatic application programming interfaces (APIs) for large‐scale data access are rarely implemented. A large fraction of experimental data are only available in journal publications, though recent successes in text mining offer a potential solution to this conundrum . Finally, major efforts are underway in high‐throughput/combinatorial experiments that can generate large experimental materials database with diverse properties …”
Section: Data Collectionmentioning
confidence: 99%
“…Furthermore, the vast majority of databases are commercial products requiring a license, and programmatic application programming interfaces (APIs) for large‐scale data access are rarely implemented. A large fraction of experimental data are only available in journal publications, though recent successes in text mining offer a potential solution to this conundrum . Finally, major efforts are underway in high‐throughput/combinatorial experiments that can generate large experimental materials database with diverse properties …”
Section: Data Collectionmentioning
confidence: 99%
“…In contrast, running the advanced methods and models of modern QC requires several days in a supercomputer. ML approaches have already proven successful in different elds of chemistry, [51][52][53] with a strong focus on materials science [54][55][56][57][58][59][60][61] and drug discovery. [62][63][64][65][66][67][68] In other areas, including organic synthesis, [69][70][71][72][73] and theoretical [74][75][76][77][78][79][80][81] and inorganic 82,83 chemistry, the use of ML is rapidly growing.…”
Section: Introductionmentioning
confidence: 99%
“…Leveraging recent advances in natural language processing and text markup parsing tools, we recently developed a tool in collaboration with Olivetti and co-workers to automatically extract synthesis information and trends from zeolite journal articles (see Figure 9). 51 Specifically, our pipeline automatically located, extracted, and organized zeolite synthesis data from both the tables and main text of thousands of articles. We validated the accuracy of the extracted data using a subset of articles related to the preparation of germanium-containing zeolites for which the pipeline accurately identified the complex relationships between the synthesis parameters and resulting topology.…”
Section: 3-literature Data Extractionmentioning
confidence: 99%