2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2022
DOI: 10.1109/wacv51458.2022.00059
|View full text |Cite
|
Sign up to set email alerts
|

MovingFashion: a Benchmark for the Video-to-Shop Challenge

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(6 citation statements)
references
References 10 publications
0
6
0
Order By: Relevance
“…This makes still image animation ambiguous, reducing the informativeness of the video. An alternative Dataset Instance # image/video # class Stanford Online Products [99] Furniture 59,551 11,318 DeepFashion [83] Clothing 239,557 33,881 DeepFashion2 [38] Clothing 224,114 45,417 Street2Shop [42] Clothing >420,357 204,795 MovingFashion [39] Clothing ∼ 15,045 15,045 Table 1. Publicly Available Products Datasets for Evaluating VM Representations solution that can enable the post-production step to enhance the central route of persuasion is to extract selling points [41] from the VMs or descriptions by summarization or image/video captioning techniques, then explicitly present this information as voice-over or captions in the generated videos.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…This makes still image animation ambiguous, reducing the informativeness of the video. An alternative Dataset Instance # image/video # class Stanford Online Products [99] Furniture 59,551 11,318 DeepFashion [83] Clothing 239,557 33,881 DeepFashion2 [38] Clothing 224,114 45,417 Street2Shop [42] Clothing >420,357 204,795 MovingFashion [39] Clothing ∼ 15,045 15,045 Table 1. Publicly Available Products Datasets for Evaluating VM Representations solution that can enable the post-production step to enhance the central route of persuasion is to extract selling points [41] from the VMs or descriptions by summarization or image/video captioning techniques, then explicitly present this information as voice-over or captions in the generated videos.…”
Section: Discussionmentioning
confidence: 99%
“…All of them are E-commerce platforms selling clothing. MovingFashion [39] is a publicly available video dataset for clothing items. Each video is associated with a distinct image of a shop, and is retrieved from Net-A-Porter 7 , Instagram or Tik Tok.…”
Section: Evaluating Vm Representationsmentioning
confidence: 99%
“…When a subject approaches the totem, a small video sequence is captured by the system and the crop of the face is used to identify the person through the reidentification module. We then use the pretrained model introduced in MovingFashion [20] to perform the detection of the top and bottom part of the outfit. The crops of the clothes are fed to a simple convolutional network that extracts a collection of attributes, derived from the DeepFashion dataset [27], which we then use in a naive manner to infer the style of the person's clothing.…”
Section: Body Clothing Analysismentioning
confidence: 99%
“…Instance # image/video # class Stanford Online Products [69] Furniture 59,551 11,318 DeepFashion [151] Clothing 239,557 33,881 DeepFashion2 [152] Clothing 224,114 45,417 Street2Shop [153] Clothing >420,357 204,795 MovingFashion [154] Clothing ⇠ 15,045 15,045…”
Section: Datasetmentioning
confidence: 99%
“…MovingFashion [154] is a publicly available video dataset for clothing items. Each video is associated with a distinct image of a shop, and is retrieved from Net-A-Porter 7 , Instagram or Tik Tok.…”
Section: Datasetsmentioning
confidence: 99%