Proceedings of the 26th ACM International Conference on Multimedia 2018
DOI: 10.1145/3240508.3240652
|View full text |Cite
|
Sign up to set email alerts
|

ModaNet

Abstract: Understanding clothes from a single image would have huge commercial and cultural impacts on modern societies. However, this task remains a challenging computer vision problem due to wide variations in the appearance, style, brand and layering of clothing items. We present a new database called "ModaNet", a large-scale collection of images based on Paperdoll dataset [40]. Our dataset provides 55, 176 street images, fully annotated with polygons on top of the 1 million weakly annotated street images in Paperdol… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 91 publications
(10 citation statements)
references
References 43 publications
(63 reference statements)
0
10
0
Order By: Relevance
“…Deep learning methods can currently solve various problems in the domain of fashion and style such as human pose estimation, body parts segmentation [2], clothing items detection [3], [4] and semantic segmentation [5].…”
Section: Related Workmentioning
confidence: 99%
“…Deep learning methods can currently solve various problems in the domain of fashion and style such as human pose estimation, body parts segmentation [2], clothing items detection [3], [4] and semantic segmentation [5].…”
Section: Related Workmentioning
confidence: 99%
“…These defects also lead to the low accuracy of image retrieval and limit the application scope of global descriptor algorithm. Just at this time, image retrieval algorithm based on local features brings the dawn to solve this problem [7]. The figure 3…”
Section: Content-based Image Retrievalmentioning
confidence: 99%
“…We use ModaNet dataset 3 in our experiments, which comprises 52,377 fully annotated images for training as well as 13 meta categories including footwear, sunglasses, pants, etc. In detail, we use the first 4,000 images for the testing and the remaining part for training.…”
Section: Experiments 41 Dataset and Evaluation Metricsmentioning
confidence: 99%