2013 IEEE International Conference on Acoustics, Speech and Signal Processing 2013
DOI: 10.1109/icassp.2013.6639275
|View full text |Cite
|
Sign up to set email alerts
|

Open vocabulary handwriting recognition using combined word-level and character-level language models

Abstract: In this paper, we present a unified search strategy for open vocabulary handwriting recognition using weighted finite state transducers. Additionally to a standard word-level language model we introduce a separate n-gram character-level language model for out-of-vocabulary word detection and recognition. The probabilities assigned by those two models are combined into one Bayes decision rule. We evaluate the proposed method on the IAM database of English handwriting. An improvement from 22.2% word error rate t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
11
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 27 publications
(11 citation statements)
references
References 21 publications
0
11
0
Order By: Relevance
“…Another approach for coping with OOV words consists of modeling text at a sub-word level, as a sequence of characters, syllables or multi-grams [15]. Hybrid approaches [16,17] consist of using word-based language models for the most frequent words and character-based models for the less frequent ones. In sub-word approaches, the dictionary is considerably reduced to the number of lexical units, as well as the computational complexity.…”
Section: Introductionmentioning
confidence: 99%
“…Another approach for coping with OOV words consists of modeling text at a sub-word level, as a sequence of characters, syllables or multi-grams [15]. Hybrid approaches [16,17] consist of using word-based language models for the most frequent words and character-based models for the less frequent ones. In sub-word approaches, the dictionary is considerably reduced to the number of lexical units, as well as the computational complexity.…”
Section: Introductionmentioning
confidence: 99%
“…The state-of-the-art methods in the field of OFHR often applied on unconstrained open vocabulary databases [23]. Kozielski [23] used IAM database for English handwriting, creating language model for the character level and combine it into the word level language model.…”
Section: Comparative Resultsmentioning
confidence: 99%
“…Kozielski [23] used IAM database for English handwriting, creating language model for the character level and combine it into the word level language model. The major drawback for using such open database is that the The first attempt to develop a Malay word was by Wahap [12].…”
Section: Comparative Resultsmentioning
confidence: 99%
“…A further contribution of this paper is the presentation of a working recognition system that can cope with very large vocabularies of several hundred thousand words, which is much more than existing system [7,9], to the knowledge of the authors.…”
Section: Introductionmentioning
confidence: 99%