Proceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries 2018
DOI: 10.1145/3197026.3197058
|View full text |Cite
|
Sign up to set email alerts
|

Improving the Representation and Conversion of Mathematical Formulae by Considering their Textual Context

Abstract: Mathematical formulae represent complex semantic information in a concise form. Especially in Science, Technology, Engineering, and Mathematics, mathematical formulae are crucial to communicate information, e.g., in scientific papers, and to perform computations using computer algebra systems. Enabling computers to access the information encoded in mathematical formulae requires machine-readable formats that can represent both the presentation and content, i.e., the semantics, of formulae. Exchanging such info… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
47
0
1

Year Published

2018
2018
2022
2022

Publication Types

Select...
6
1
1

Relationship

2
6

Authors

Journals

citations
Cited by 30 publications
(48 citation statements)
references
References 18 publications
0
47
0
1
Order By: Relevance
“…The challenge of Formula Concept retrieval [18] (a method for MathEL) can roughly be split into the discovery (defining concepts by exploring some instances) of Formula Concepts and their recognition (matching new instances to prior defined concepts represented by name 6 ). A Wikidata Entity Linking markup to for L A T E X and MathML was introduced and discussed in [20] and [14]. The proposed markup should be used by authors of documents in the STEM disciplines to semantically annotate mathematical content in documents.…”
Section: Document Annotation Recommendationmentioning
confidence: 99%
See 2 more Smart Citations
“…The challenge of Formula Concept retrieval [18] (a method for MathEL) can roughly be split into the discovery (defining concepts by exploring some instances) of Formula Concepts and their recognition (matching new instances to prior defined concepts represented by name 6 ). A Wikidata Entity Linking markup to for L A T E X and MathML was introduced and discussed in [20] and [14]. The proposed markup should be used by authors of documents in the STEM disciplines to semantically annotate mathematical content in documents.…”
Section: Document Annotation Recommendationmentioning
confidence: 99%
“…There have been efforts to automatically retrieve the semantics of identifiers from the surrounding text [23]. A benchmark MathMLben [20] was created containing formulae from Wikipedia, the arXiV and the DLMF, which were augmented by Wikidata markup [14]. Greiner-Petter and Schubotz [5] examine distributions of mathematical notation on two large corporae from the arXiv 4 and zbMATH 5 repository.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…NLP techniques often play a critical role in bridging the gap between presentation and semantic representations of math formulae. Recent studies on this topic include variable typing (Stathopoulos et al 2018), using the textual context for transformation from a presentation level to semantic level (Schubotz et al 2018), and identifying declarations of mathematical objects (Lin et al 2019).…”
Section: Math Information Retrievalmentioning
confidence: 99%
“…Due to the challenges raised by PDF documents, many approaches use the original source data, just as the original Microsoft Word document or Latex code and resources in order to implement their use cases. For instance, Scharpf et al [12][13][14] focus their research on discovering and recognizing mathematical formulas in scientific publications in order disambiguate formula identifiers so that mathematical publications become interpretable by computers. Groza et al [15,16], on the other hand, deal with the insertion of metadata at the phase of writing in order to improve the retrievability of scientific publications.…”
Section: Related Workmentioning
confidence: 99%