2011 International Conference on Document Analysis and Recognition 2011
DOI: 10.1109/icdar.2011.97
|View full text |Cite
|
Sign up to set email alerts
|

HAMEX - A Handwritten and Audio Dataset of Mathematical Expressions

Abstract: In this paper, we present HAMEX, a new public dataset that contains mathematical expressions available in their on-line handwritten form and in their audio spoken form. We have designed this dataset so that, given a mathematical expression, its handwritten signal and its audio signal can be used jointly to design multimodal recognition systems. Here, we describe the different steps that allowed us to acquire this dataset, from the creation of the mathematical expression corpora (including expressions from Wiki… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
13
0
2

Year Published

2013
2013
2023
2023

Publication Types

Select...
3
3
2

Relationship

3
5

Authors

Journals

citations
Cited by 25 publications
(15 citation statements)
references
References 6 publications
0
13
0
2
Order By: Relevance
“…Some of the combination techniques proposed in our research are adaptations of well known methods such as weighted combination, Borda count [54], and other decision combination techniques [18]. In the context of handwriting and speech recognition, classifier combination techniques have also been used to improve the recognition accuracy of handwriting recognizers [14,41,48,59] as well as speech recognizers [12].…”
Section: Classifier Combinationmentioning
confidence: 99%
See 1 more Smart Citation
“…Some of the combination techniques proposed in our research are adaptations of well known methods such as weighted combination, Borda count [54], and other decision combination techniques [18]. In the context of handwriting and speech recognition, classifier combination techniques have also been used to improve the recognition accuracy of handwriting recognizers [14,41,48,59] as well as speech recognizers [12].…”
Section: Classifier Combinationmentioning
confidence: 99%
“…However, issues such as ambiguity detection and A/V synchronization are not considered in the aforementioned research. Another relevant research effort, closely tied to [33], relates to the creation of a data set with handwritten and spoken mathematical content [41]. Unfortunately, this data set consists of static image segments containing handwritten content, with the corresponding audio stored in separate files.…”
Section: Audio-video Based Content Recognitionmentioning
confidence: 99%
“…Training and test data for CROHME 2012 along with related tools were available from the International Association of Pattern Recognition (IAPR) 4 . For Part 4, the training data includes thousands of expressions from existing handwritten expression datasets, including (i) MathBrush (University of Waterloo) [18], (ii) HAMEX (University of Nantes) [19], (iii) MfrDB (Czech Technical University) [5], (iv) ExpressMatch (University of Sao Paulo) [20] and (v) the KAIST dataset. Due to differences in legal symbols and layouts, not all expressions in these data sets were consistent with the Part 4 grammar.…”
Section: A Datasets and Expression Encodingsmentioning
confidence: 99%
“…The presence of several accessible corpora for the recognition enable this domain and it is useful for many fields, such as the field of Latin mathematical formula recognition. This field presents a datasets that facilitates the progress of this domain like the HAMEX [4]. The HAMEX is a public dataset that contains mathematical expressions in their handwritten form and in their audio spoken form.…”
Section: Introductionmentioning
confidence: 99%