Proceedings of the 1st International Workshop on coMics ANalysis, Processing and Understanding 2016
DOI: 10.1145/3011549.3011551
|View full text |Cite
|
Sign up to set email alerts
|

Manga109 dataset and creation of metadata

Abstract: We have created Manga109, a dataset of a variety of 109 Japanese comic books publicly available for use for academic purposes. This dataset provides numerous comic images but lacks the annotations of elements in the comics that are necessary for use by machine learning algorithms or evaluation of methods. In this paper, we present our ongoing project to build metadata for Manga109. We first define the metadata in terms of frames, texts and characters. We then present our web-based software for efficiently crea… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
66
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
4
2
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 113 publications
(66 citation statements)
references
References 3 publications
0
66
0
Order By: Relevance
“…The network is trained on 800 training images and 5 validation images in the process. For testing we have used five standard benchmark datasets: Set5 [30], Set14 [31], B100 [32] [33], Urban100 [34] and Manga109 [35]. Set5 and Set14 has random images from animals to human faces.…”
Section: Prototypingmentioning
confidence: 99%
“…The network is trained on 800 training images and 5 validation images in the process. For testing we have used five standard benchmark datasets: Set5 [30], Set14 [31], B100 [32] [33], Urban100 [34] and Manga109 [35]. Set5 and Set14 has random images from animals to human faces.…”
Section: Prototypingmentioning
confidence: 99%
“…The images were originally acquired for a project on generic shape matching and recognition. 18 Manga109 [82] A publicly available dataset of 109 Japanese comic books with numerous comic sketches.…”
Section: Live [81]mentioning
confidence: 99%
“…For example, given an album A, we use all the annotated images of this album from the eBDtheque dataset as testing set and collect other unannotated images of this album from other sources to build the training set. We selected the eBDtheque dataset because it provides text transcription for all images and from diverse writing styles (more representative than Manga109 dataset [3]). This dataset is composed by one hundred images containing 4691 annotated text lines.…”
Section: Datasetmentioning
confidence: 99%