Proceedings of the 2017 ACM Symposium on Document Engineering 2017
DOI: 10.1145/3103010.3121041
|View full text |Cite
|
Sign up to set email alerts
|

Detecting In-line Mathematical Expressions in Scientific Documents

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
21
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 21 publications
(26 citation statements)
references
References 4 publications
0
21
0
Order By: Relevance
“…After the word extraction process, word features and conditional random field (CRF) are used for inline expression detection. The achieved accuracy in detection is 88.95% on PDF files from the ACL Anthology dataset [31] but there are still many errors in the detection of variables reported in the research.…”
Section: ) Mathematical Expression Detection In Native Pdf Documentsmentioning
confidence: 95%
See 2 more Smart Citations
“…After the word extraction process, word features and conditional random field (CRF) are used for inline expression detection. The achieved accuracy in detection is 88.95% on PDF files from the ACL Anthology dataset [31] but there are still many errors in the detection of variables reported in the research.…”
Section: ) Mathematical Expression Detection In Native Pdf Documentsmentioning
confidence: 95%
“…In recent years, several researches [21], [31], [32] have focused on the detection of mathematical expressions in PDF documents. For PDF documents, metadata information of textual words such as font, size, styles can be extracted precisely.…”
Section: ) Mathematical Expression Detection In Native Pdf Documentsmentioning
confidence: 99%
See 1 more Smart Citation
“…Document image processing is an interesting topic among the computer vision research community. Significant progress has been made in this domain, including heuristic-based, convolutional neural network (CNN) based, statistics-based-like conditional random fields (CRFs) and graph trees, and\or a combination of these methods [7,8,[14][15][16]. Heuristics include color-based features, shape-based features, geometric features, and keypoint descriptors.…”
Section: Related Workmentioning
confidence: 99%
“…Iwatsuki et al [14] presented a CRF based method to extract formulas and mathematical zones from PDF documents. Their method uses layout features like font, style, and linguistic features such as n-gram context to build their CRF model.…”
Section: Related Workmentioning
confidence: 99%