2016
DOI: 10.1186/s13321-016-0175-x
|View full text |Cite
|
Sign up to set email alerts
|

ChemEngine: harvesting 3D chemical structures of supplementary data from PDF files

Abstract: Digital access to chemical journals resulted in a vast array of molecular information that is now available in the supplementary material files in PDF format. However, extracting this molecular information, generally from a PDF document format is a daunting task. Here we present an approach to harvest 3D molecular data from the supporting information of scientific research articles that are normally available from publisher’s resources. In order to demonstrate the feasibility of extracting truly computable mol… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 25 publications
(23 reference statements)
0
2
0
Order By: Relevance
“…Tools like ChemEngine have been implemented to automatically extract 3D molecular XYZ coordinates and atom information from articles with the aim to directly generate computable molecular structures. 434 This system used pattern recognition and regular expressions to detect molecular coordinates and distinguish it from surround-ing nonmolecular free text. After generating the atom coordinate matrix data from the previously detected molecular coordinates, tools like ChemEngine build molecules using the bond matrix and the atom connectivity.…”
Section: Linking Documents To Structuresmentioning
confidence: 99%
See 1 more Smart Citation
“…Tools like ChemEngine have been implemented to automatically extract 3D molecular XYZ coordinates and atom information from articles with the aim to directly generate computable molecular structures. 434 This system used pattern recognition and regular expressions to detect molecular coordinates and distinguish it from surround-ing nonmolecular free text. After generating the atom coordinate matrix data from the previously detected molecular coordinates, tools like ChemEngine build molecules using the bond matrix and the atom connectivity.…”
Section: Linking Documents To Structuresmentioning
confidence: 99%
“…Authors can also present chemical structural information in documents, especially in case of supporting/Supporting Information of scientific articles, in the form of plain text 3D X, Y, Z atom coordinate values. Tools like ChemEngine have been implemented to automatically extract 3D molecular XYZ coordinates and atom information from articles with the aim to directly generate computable molecular structures . This system used pattern recognition and regular expressions to detect molecular coordinates and distinguish it from surrounding nonmolecular free text.…”
Section: Linking Documents To Structuresmentioning
confidence: 99%