2017
DOI: 10.1021/acs.chemrev.6b00851
|View full text |Cite
|
Sign up to set email alerts
|

Information Retrieval and Text Mining Technologies for Chemistry

Abstract: Efficient access to chemical information contained in scientific literature, patents, technical reports, or the web is a pressing need shared by researchers and patent attorneys from different chemical disciplines. Retrieval of important chemical information in most cases starts with finding relevant documents for a particular chemical compound or family. Targeted retrieval of chemical documents is closely connected to the automatic recognition of chemical entities in the text, which commonly involves the extr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
192
0
9

Year Published

2018
2018
2024
2024

Publication Types

Select...
7
1
1

Relationship

2
7

Authors

Journals

citations
Cited by 245 publications
(202 citation statements)
references
References 611 publications
0
192
0
9
Order By: Relevance
“…Furthermore, the vast majority of databases are commercial products requiring a license, and programmatic application programming interfaces (APIs) for large‐scale data access are rarely implemented. A large fraction of experimental data are only available in journal publications, though recent successes in text mining offer a potential solution to this conundrum . Finally, major efforts are underway in high‐throughput/combinatorial experiments that can generate large experimental materials database with diverse properties …”
Section: Data Collectionmentioning
confidence: 99%
See 1 more Smart Citation
“…Furthermore, the vast majority of databases are commercial products requiring a license, and programmatic application programming interfaces (APIs) for large‐scale data access are rarely implemented. A large fraction of experimental data are only available in journal publications, though recent successes in text mining offer a potential solution to this conundrum . Finally, major efforts are underway in high‐throughput/combinatorial experiments that can generate large experimental materials database with diverse properties …”
Section: Data Collectionmentioning
confidence: 99%
“…A large fraction of experimental data are only available in journal publications, though recent successes in text mining offer a potential solution to this conundrum. [48][49][50][51][52][53] Finally, major efforts are underway in high-throughput/combinatorial experiments that can generate large experimental materials database with diverse properties. [31] It is therefore not surprising that many ML works have turned to computed data sources.…”
Section: Data Collectionmentioning
confidence: 99%
“…With recent advances in computer science, theoretical modelling, and experimental instrumentation, materials scientists have in many ways created a "mechanical Brahe" and marched into a new era of big data. Datasets of materials information, obtained from advanced characterization techniques, 1-3 combinatorial experiments, [4][5][6] high-throughput first-principles simulations, 7,8 literature mining, 9,10 and other techniques, are created at a faster rate every day with less and less human labor. All of this data enables new opportunities to construct novel laws of phenomenological behavior for systems that previously lacked them.…”
mentioning
confidence: 99%
“…We used the ChemProt corpus provided by the task organizer, 26 which consists of 4966 PubMed abstracts and 126,457 annotated chemical and protein entities. These relations annotated 10 chemical-protein relationships.…”
Section: Datasetmentioning
confidence: 99%