2022
DOI: 10.1186/s13321-022-00642-3
|View full text |Cite
|
Sign up to set email alerts
|

Review of techniques and models used in optical chemical structure recognition in images and scanned documents

Abstract: Extraction of chemical formulas from images was not in the top priority of Computer Vision tasks for a while. The complexity both on the input and prediction sides has made this task challenging for the conventional Artificial Intelligence and Machine Learning problems. A binary input image which might seem trivial for convolutional analysis was not easy to classify, since the provided sample was not representative of the given molecule: to describe the same formula, a variety of graphical representations whic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
11
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 14 publications
(13 citation statements)
references
References 42 publications
0
11
0
Order By: Relevance
“…The field of optical chemical structure recognition (OCSR) deals with the translation of images of chemical structures as they are published in the scientific literature into machine-readable representations of the underlying molecular graph [84,85]. In the past two years, a variety of deep learning-based OCSR methods [86][87][88][89] has been published, where DECIMER Image-Transformer [90], Img2Mol [91] and SwinOCSR [92] provide openly available source code and trained models.…”
Section: Extraction Of Chemical Information From the Scientific Liter...mentioning
confidence: 99%
See 1 more Smart Citation
“…The field of optical chemical structure recognition (OCSR) deals with the translation of images of chemical structures as they are published in the scientific literature into machine-readable representations of the underlying molecular graph [84,85]. In the past two years, a variety of deep learning-based OCSR methods [86][87][88][89] has been published, where DECIMER Image-Transformer [90], Img2Mol [91] and SwinOCSR [92] provide openly available source code and trained models.…”
Section: Extraction Of Chemical Information From the Scientific Liter...mentioning
confidence: 99%
“…The newest version of DECIMER was trained on more than 400 Million images using the latest Tensor Processing Units [95] available on the Google cloud platform. Currently, DECIMER performs with an accuracy rate of above 90% and is regarded as an important point of reference for future work [85]. Without open databases like PubChem, where one can download over 100 million chemical structures for free, this would not have been possible.…”
Section: Extraction Of Chemical Information From the Scientific Liter...mentioning
confidence: 99%
“…The field of optical chemical structure recognition (OCSR) deals with the translation of images of chemical structures as they are published in the scientific literature into machine-readable representations of the underlying molecular graph [69,70]. In the past two years, a variety of deep learning-based OCSR methods [71][72][73][74] has been published, where DECIMER Image-Transformer [75], Img2Mol [76] and SwinOCSR [77] provide openly available source code and trained models.…”
Section: Extraction Of Chemical Information From the Scientific Liter...mentioning
confidence: 99%
“…Currently, DECIMER performs with an accuracy rate of above 90%. As an open system, DECIMER is regarded as an important point of reference for future work [70]. Without open databases like PubChem, where one can download over 100 million chemical structures for free, this wouldn't have been possible.…”
Section: Extraction Of Chemical Information From the Scientific Liter...mentioning
confidence: 99%
“…In general, rule-based tools work better with clean images, whereas slight distortions may hinder their performance 15 . In recent years, deep-learning-based OCSR tools have been developed 16,17,18 in conjunction with remarkable advancements in computer vision and natural language processing 19,20 . While several publications have claimed to have developed tools that are capable of recognizing chemical depictions with high accuracy, most of these tools are either proprietary or entirely unavailable 16,[21][22][23] .…”
Section: Introductionmentioning
confidence: 99%