M. Hadi Kiapour scite author profile

In this paper we demonstrate an effective method for parsing clothing in fashion photographs, an extremely challenging problem due to the large number of possible garment items, variations in configuration, garment appearance, layering, and occlusion. In addition, we provide a large novel dataset and tools for labeling garment items, to enable future research on clothing estimation. Finally, we present intriguing initial results on using clothing estimates to improve pose identification, and demonstrate a prototype application for pose-independent visual garment retrieval.

show abstract

Where to Buy It: Matching Street Clothing Photos in Online Shops

Kiapour

et al. 2015

View full text Add to dashboard Cite

Paper Doll Parsing: Retrieving Similar Styles to Parse Clothing Items

Yamaguchi

Kiapour²,

Berg³

2013

253

249

View full text Add to dashboard Cite

Hipster Wars: Discovering Elements of Fashion Styles

et al. 2014

View full text Add to dashboard Cite

Retrieving Similar Styles to Parse Clothing

Yamaguchi

Kiapour

Ortiz

et al. 2015

IEEE Trans. Pattern Anal. Mach. Intell.

137

View full text Add to dashboard Cite

Abstract-Clothing recognition is a societally and commercially important yet extremely challenging problem due to large variations in clothing appearance, layering, style, and body shape and pose. In this paper, we tackle the clothing parsing problem using a retrieval-based approach. For a query image, we find similar styles from a large database of tagged fashion images and use these examples to recognize clothing items in the query. Our approach combines parsing from: pre-trained global clothing models, local clothing models learned on the fly from retrieved examples, and transferred parse-masks (Paper Doll item transfer) from retrieved examples. We evaluate our approach extensively and show significant improvements over previous state-of-the-art for both localization (clothing parsing given weak supervision in the form of tags) and detection (general clothing parsing). Our experimental results also indicate that the general pose estimation problem can benefit from clothing parsing.

show abstract

Conditional Image-Text Embedding Networks

Plummer

Kordas

Kiapour

et al. 2018

View full text Add to dashboard Cite

This paper presents an approach for grounding phrases in images which jointly learns multiple text-conditioned embeddings in a single end-to-end model. In order to differentiate text phrases into semantically distinct subspaces, we propose a concept weight branch that automatically assigns phrases to embeddings, whereas prior works predefine such assignments. Our proposed solution simplifies the representation requirements for individual embeddings and allows the underrepresented concepts to take advantage of the shared representations before feeding them into concept-specific layers. Comprehensive experiments verify the effectiveness of our approach across three phrase grounding datasets, Flickr30K Entities, ReferIt Game, and Visual Genome, where we obtain a (resp.) 4%, 3%, and 4% improvement in grounding performance over a strong region-phrase embedding baseline 1 .

show abstract

Materials discovery: Fine-grained classification of X-ray scattering images

Kiapour

Yager²,

Berg

et al. 2014

View full text Add to dashboard Cite

We explore the use of computer vision methods for organizing, searching, and classifying x-ray scattering images. X-ray scattering is a technique that shines an intense beam of x-rays through a sample of interest. By recording the intensity of x-ray deflection as a function of angle, scientists can measure the structure of materials at the molecular and nano-scale. Current and planned synchrotron instruments are producing x-ray scattering data at an unprecedented rate, making the design of automatic analysis techniques crucial for future research. In this paper, we devise an attribute-based approach to recognition in x-ray scattering images and demonstrate applications to image annotation and retrieval.

show abstract

ModaNet

Zheng

Yang

Kiapour

et al. 2018

View full text Add to dashboard Cite

Understanding clothes from a single image would have huge commercial and cultural impacts on modern societies. However, this task remains a challenging computer vision problem due to wide variations in the appearance, style, brand and layering of clothing items. We present a new database called "ModaNet", a large-scale collection of images based on Paperdoll dataset [40]. Our dataset provides 55, 176 street images, fully annotated with polygons on top of the 1 million weakly annotated street images in Paperdoll. ModaNet aims to provide a technical benchmark to fairly evaluate the progress of applying the latest computer vision techniques that rely on large data for fashion understanding. The rich annotation of the dataset allows to measure the performance of state-of-the-art algorithms for object detection, semantic segmentation and polygon prediction on street fashion images in detail.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

M. Hadi Kiapour

Parsing clothing in fashion photographs

Where to Buy It: Matching Street Clothing Photos in Online Shops

Paper Doll Parsing: Retrieving Similar Styles to Parse Clothing Items

Hipster Wars: Discovering Elements of Fashion Styles

Retrieving Similar Styles to Parse Clothing

Conditional Image-Text Embedding Networks

Materials discovery: Fine-grained classification of X-ray scattering images

ModaNet

Contact Info

Product

Resources

About