Davide Morelli scite author profile

Research related to fashion and e-commerce domains is gaining attention in computer vision and multimedia communities. Following this trend, this article tackles the task of generating fine-grained and accurate natural language descriptions of fashion items, a recently-proposed and under-explored challenge that is still far from being solved. To overcome the limitations of previous approaches, a transformer-based captioning model was designed with the integration of external textual memory that could be accessed through k-nearest neighbor (kNN) searches. From an architectural point of view, the proposed transformer model can read and retrieve items from the external memory through cross-attention operations, and tune the flow of information coming from the external memory thanks to a novel fully attentive gate. Experimental analyses were carried out on the fashion captioning dataset (FACAD) for fashion image captioning, which contains more than 130k fine-grained descriptions, validating the effectiveness of the proposed approach and the proposed architectural strategies in comparison with carefully designed baselines and state-of-the-art approaches. The presented method constantly outperforms all compared approaches, demonstrating its effectiveness for fashion image captioning.

show abstract

Dual-Branch Collaborative Transformer for Virtual Try-On

Fenocchi

Morelli

Cornia

et al. 2022

View full text Add to dashboard Cite

Image-based virtual try-on has recently gained a lot of attention in both the scientific and fashion industry communities due to its challenging setting and practical real-world applications. While pure convolutional approaches have been explored to solve the task, Transformer-based architectures have not received significant attention yet. Following the intuition that self-and cross-attention operators can deal with long-range dependencies and hence improve the generation, in this paper we extend a Transformer-based virtual try-on model by adding a dual-branch collaborative module that can exploit cross-modal information at generation time. We perform experiments on the VITON dataset, which is the standard benchmark for the task, and on a recently collected virtual try-on dataset with multi-category clothing, Dress Code. Experimental results demonstrate the effectiveness of our solution over previous methods and show that Transformer-based architectures can be a viable alternative for virtual try-on.

show abstract

A Game-theoretical Design Technique for Multi-stage Supply Chains under Uncertainty

Cavone

Dotoli

Morelli

et al. 2018

View full text Add to dashboard Cite

A Convolution Residual Network for Heating-Invariant Defect Segmentation in Composite Materials Inspected by Lock-in Thermography

Morelli

Marani²,

D’Accardi

et al. 2021

IEEE Trans. Instrum. Meas.

View full text Add to dashboard Cite

Defect detection by a deep learning approach with active IR thermography

Guaragnella

Morelli

D’Orazio³

et al. 2022

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Davide Morelli

Dress Code: High-Resolution Multi-Category Virtual Try-On

Dress Code: High-Resolution Multi-category Virtual Try-On

Design of Modern Supply Chain Networks Using Fuzzy Bargaining Game and Data Envelopment Analysis

Fashion-Oriented Image Captioning with External Knowledge Retrieval and Fully Attentive Gates

Dual-Branch Collaborative Transformer for Virtual Try-On

A Game-theoretical Design Technique for Multi-stage Supply Chains under Uncertainty

A Convolution Residual Network for Heating-Invariant Defect Segmentation in Composite Materials Inspected by Lock-in Thermography

Defect detection by a deep learning approach with active IR thermography

Contact Info

Product

Resources

About