2020
DOI: 10.1021/acs.chemmater.0c02553
|View full text |Cite
|
Sign up to set email alerts
|

Similarity of Precursors in Solid-State Synthesis as Text-Mined from Scientific Literature

Abstract: Collecting and analyzing the vast amount of information available in the solid-state chemistry literature may accelerate our understanding of materials synthesis. However, one major problem is the difficulty of identifying which materials from a synthesis paragraph are precursors or are target materials. In this study, we developed a two-step Chemical Named Entity Recognition (CNER) model to identify precursors and targets, based on information from the context around material entities. Using the extracted dat… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
69
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6

Relationship

2
4

Authors

Journals

citations
Cited by 60 publications
(69 citation statements)
references
References 63 publications
0
69
0
Order By: Relevance
“…This significantly modifies the meaning of the tokens and usually results in lowered accuracy of the named entity recognition (see below). Currently, this problem is solved case-by-case by creating task-specific wrappers for existing tokenizers and named entity recognition models ( Huang and Ling, 2019 ; Alperin et al., 2016 ; He et al., 2020 ). Building a robust approach for chemistry-specific sentence tokenization and data extraction requires a thorough development of standard nomenclature for complex chemical terms and materials names.…”
Section: Text Mining Of Scientific Literaturementioning
confidence: 99%
See 4 more Smart Citations
“…This significantly modifies the meaning of the tokens and usually results in lowered accuracy of the named entity recognition (see below). Currently, this problem is solved case-by-case by creating task-specific wrappers for existing tokenizers and named entity recognition models ( Huang and Ling, 2019 ; Alperin et al., 2016 ; He et al., 2020 ). Building a robust approach for chemistry-specific sentence tokenization and data extraction requires a thorough development of standard nomenclature for complex chemical terms and materials names.…”
Section: Text Mining Of Scientific Literaturementioning
confidence: 99%
“…The early applications of chemical NER were mainly focused on extraction of drugs and biochemical information to perform more effective document searches ( Corbett and Copestake, 2008 ; Jessop et al., 2011 ; Rocktäschel et al., 2012 ; García-Remesal et al, 2013 ). Recently, chemical NER has shifted toward (in)organic materials and their characteristics ( Swain and Cole, 2016 ; He et al., 2020 ; Weston et al., 2019 ; Shah et al., 2018 ), polymers ( Tchoua et al., 2019 ), nanoparticles ( Hiszpanski et al., 2020 ), synthesis actions and conditions ( Vaucher et al., 2020 ; Hawizy et al., 2011 ; Kim et al., 2017c ; Kononova et al., 2019 ). The methods used for NER vary from traditional rule-based and dictionary look-up approaches to modern methodology built around advanced ML and NLP techniques, including conditional random field (CRF) ( Lafferty et al., 2001 ), long short-term memory (LSTM) neural networks ( Hochreiter and Schmidhuber, 1997 ), and others.…”
Section: Text Mining Of Scientific Literaturementioning
confidence: 99%
See 3 more Smart Citations