2016
DOI: 10.1007/978-3-319-46604-0_32
|View full text |Cite
|
Sign up to set email alerts
|

Downtown Osaka Scene Text Dataset

Abstract: This article introduces publicly available datasets in scene text detection and recognition. The information is as of 2017.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 25 publications
(16 citation statements)
references
References 35 publications
0
16
0
Order By: Relevance
“…2 https://catalist-2021.github.io/ ReCTS-25k, CTW, and RRC-LSVT from ICDAR'19 Robust Reading Competition (RRC) [23,33,31,24]. Korean and Japanese scene-text recognition datasets involve KAIST and DOST [9,7]. Different English datasets are listed in the last row of Table 1 [30,28,20,13,16,10,17,27,3,15,14].…”
Section: Related Workmentioning
confidence: 99%
“…2 https://catalist-2021.github.io/ ReCTS-25k, CTW, and RRC-LSVT from ICDAR'19 Robust Reading Competition (RRC) [23,33,31,24]. Korean and Japanese scene-text recognition datasets involve KAIST and DOST [9,7]. Different English datasets are listed in the last row of Table 1 [30,28,20,13,16,10,17,27,3,15,14].…”
Section: Related Workmentioning
confidence: 99%
“…Arabic datasets like ARASTEC (260 images of signboards, hoardings, and advertisements) and ALIF (7k text images from TV Broadcast) also exist in the scene-text recognition community [29,32]. Korean and Japanese scene-text recognition datasets include KAIST (2, 385 images from signboards, book covers, and English and Korean characters) and DOST (32k sequential images) [7,5]. The MLT dataset available from the IC-DAR'17 RRC contains 18k scene images (around 1 − 2k images per language) in Arabic, Bangla, Chinese, English, French, German, Italian, Japanese, and Korean [15].…”
Section: Related Workmentioning
confidence: 99%
“…3 2 for the first five languages we discussed in the previous section (we notice that the last two languages also follow the similar trend). On the left, we show the frequency distribution of top-5 n-grams, (n ∈ [1,5]). On the right, we show the frequency distribution of all n-grams with n ∈ [1,5].…”
Section: Datasets and Motivationmentioning
confidence: 99%
“…The Street View Text (SVT) dataset [19] was harvested from "Google Street View" images. The Downtown Osaka Scene Text dataset consists of sequential images captured in shopping streets with an omnidirectional camera [20]. Finally, the Synthetic Word Dataset [21] [22] contains 9 million images covering English words and supports tasks in text recognition and segmentation.…”
Section: Previous Workmentioning
confidence: 99%