2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR) 2017
DOI: 10.1109/asar.2017.8067772
|View full text |Cite
|
Sign up to set email alerts
|

Combining deep learning and language modeling for segmentation-free OCR from raw pixels

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
3
3
1

Relationship

1
6

Authors

Journals

citations
Cited by 12 publications
(6 citation statements)
references
References 16 publications
0
6
0
Order By: Relevance
“…Another such Arabic OCR model was trained on DARPA corpus using stacked BLSTM which is connected with CTC loss function for predicting Arabic text sequences. Also, a language model was added to the technique at the time of prediction to enhance the output of the actual trained OCR [66].…”
Section: B Deep Learning For Ocrmentioning
confidence: 99%
See 2 more Smart Citations
“…Another such Arabic OCR model was trained on DARPA corpus using stacked BLSTM which is connected with CTC loss function for predicting Arabic text sequences. Also, a language model was added to the technique at the time of prediction to enhance the output of the actual trained OCR [66].…”
Section: B Deep Learning For Ocrmentioning
confidence: 99%
“…A lower value of these measures represents a higher effectiveness of a technique and vice versa. The reason for the choice of these measures stems from the fact that these measures have been widely used to evaluate the effectiveness of OCR and speech recognition systems [61], [66], [59]. These measures are based on the Levenshtein distance [75] which measures similarity between two strings.…”
Section: A Performance Evaluation Measuresmentioning
confidence: 99%
See 1 more Smart Citation
“…While most methods focus on a single language, the multilingual setting was addressed in [18] via a new gated convolutional feature extractor. Note that, while convolutional extractors are the most common ones, fully-connected layers can also be employed, as demonstrated in [19]. In any event, the effectiveness of these techniques has been demonstrated on academic databases only, and these databases consist of standard text lines and paragraphs.…”
Section: A Contributionsmentioning
confidence: 99%
“…For a reasonably well performing OCR model, we choose one that is very similar in structure to [24] but not finetuned like it. The model is segmentation free and the LSTM output does not require processing except for decoding.…”
Section: Modelmentioning
confidence: 99%