Zhao-Min Chen scite author profile

The task of multi-label image recognition is to predict a set of object labels that present in an image. As objects normally co-occur in an image, it is desirable to model the label dependencies to improve the recognition performance. To capture and explore such important dependencies, we propose a multi-label classification model based on Graph Convolutional Network (GCN). The model builds a directed graph over the object labels, where each node (label) is represented by word embeddings of a label, and GCN is learned to map this label graph into a set of inter-dependent object classifiers. These classifiers are applied to the image descriptors extracted by another sub-net, enabling the whole network to be end-to-end trainable. Furthermore, we propose a novel re-weighted scheme to create an effective label correlation matrix to guide information propagation among the nodes in GCN. Experiments on two multi-label image recognition datasets show that our approach obviously outperforms other existing state-of-the-art methods. In addition, visualization analyses reveal that the classifiers learned by our model maintain meaningful semantic topology.

show abstract

Structure-aware human pose estimation with graph convolutional networks

Bin

Chen

Wei³

et al. 2020

Pattern Recognition

View full text Add to dashboard Cite

Hierarchical Context Embedding for Region-Based Object Detection

Chen

Jin²,

Zhao³

et al. 2020

View full text Add to dashboard Cite

Disentangling, Embedding and Ranking Label Cues for Multi-Label Image Recognition

Chen

Cui

Wei

et al. 2021

IEEE Trans. Multimedia

View full text Add to dashboard Cite

Multi-Label Image Recognition with Joint Class-Aware Map Disentangling and Label Correlation Embedding

Chen

Wei²,

Jin³

et al. 2019

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Zhao-Min Chen

Multi-Label Image Recognition With Graph Convolutional Networks

Structure-aware human pose estimation with graph convolutional networks

Hierarchical Context Embedding for Region-Based Object Detection

Disentangling, Embedding and Ranking Label Cues for Multi-Label Image Recognition

Multi-Label Image Recognition with Joint Class-Aware Map Disentangling and Label Correlation Embedding

Contact Info

Product

Resources

About