2019
DOI: 10.3390/info10100308
|View full text |Cite
|
Sign up to set email alerts
|

Multimodal Sequential Fashion Attribute Prediction

Abstract: We address multimodal product attribute prediction of fashion items based on product images and titles. The product attributes, such as type, sub-type, cut or fit, are in a chain format, with previous attribute values constraining the values of the next attributes. We propose to address this task with a sequential prediction model that can learn to capture the dependencies between the different attribute values in the chain. Our experiments on three product datasets show that the sequential model outperforms t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
6
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
3
1

Relationship

1
6

Authors

Journals

citations
Cited by 11 publications
(7 citation statements)
references
References 30 publications
0
6
0
Order By: Relevance
“…Visual Attributes. Visual attribute prediction (VAP) is a multi-label classification task that has been widely studied [42,4,2,22,38,23,32,30]. There are two main types of VAP tasks, as illustrated in Figure 1.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Visual Attributes. Visual attribute prediction (VAP) is a multi-label classification task that has been widely studied [42,4,2,22,38,23,32,30]. There are two main types of VAP tasks, as illustrated in Figure 1.…”
Section: Related Workmentioning
confidence: 99%
“…For example, they may only be related to low-level characteristics, such as color, which makes the task more interesting. In the fashion domain, some works [42,23,22,4] have used landmarks for clothing items, pose detection, or textual item descriptions to improve overall accuracy. However, such strong requirements regarding auxiliary information limit such methods to domain-specific solutions.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…However, these traditional machine learning methods cannot take raw features from large, annotated datasets and use them to identify the patterns buried inside them. Deep learning algorithms is a powerful approach for learning complex patterns and has led to multiple performance breakthroughs in many research fields, including computer vision [ 32 , 33 , 34 , 35 , 36 , 37 ] and natural language processing [ 38 , 39 , 40 ]. However, very few prediction models have implemented the concept of deep learning into the sgRNA off-target propensity prediction problem.…”
Section: Introductionmentioning
confidence: 99%