2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2022
DOI: 10.1109/cvprw56347.2022.00246
|View full text |Cite
|
Sign up to set email alerts
|

Dual-Branch Collaborative Transformer for Virtual Try-On

Abstract: Image-based virtual try-on has recently gained a lot of attention in both the scientific and fashion industry communities due to its challenging setting and practical real-world applications. While pure convolutional approaches have been explored to solve the task, Transformer-based architectures have not received significant attention yet. Following the intuition that self-and cross-attention operators can deal with long-range dependencies and hence improve the generation, in this paper we extend a Transforme… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 8 publications
(3 citation statements)
references
References 15 publications
0
3
0
Order By: Relevance
“…A neural network was also used to learn the transformation parameters of TPS in CP-VITON [81]. Later, Fenocchi et al [82] introduced self-and cross-attention operations to the warping module. They aligned the refined representation of a person and an in-shop garment using two-branch cross-modal attention blocks.…”
Section: Virtual Try-onmentioning
confidence: 99%
“…A neural network was also used to learn the transformation parameters of TPS in CP-VITON [81]. Later, Fenocchi et al [82] introduced self-and cross-attention operations to the warping module. They aligned the refined representation of a person and an in-shop garment using two-branch cross-modal attention blocks.…”
Section: Virtual Try-onmentioning
confidence: 99%
“…For instance, context-driven virtual try-on network (C-VTON) ( 38 ) employs discriminators specific to different types of contextual information, allowing enhanced clothing synthesis quality. The dual-branch collaborative transformer (DBCT) ( 39 ) utilizes a transformer-based architecture and cross-modal information to improve the virtual try-on process. To address the alignment of target garments to corresponding body parts, a novel global appearance flow estimation model has been proposed, which warps clothing spatially ( 40 ).…”
Section: Related Workmentioning
confidence: 99%
“…Virtual Try-on. Virtual try-on approaches can be categorized into 3D-based [7,21,30,38] and image-based methods [9,17,22,23,28,29,32,40,41,51]. Imagebased methods are more promising because of their lightweight nature and the ability to generate reasonable results using large-scale try-on datasets.…”
Section: Related Workmentioning
confidence: 99%