Zihang Meng scite author profile

Zihang Meng

5Publications

37Citation Statements Received

107Citation Statements Given

How they've been cited

How they cite others

122

105

Affiliations

University of Wisconsin–Madison, Harbin Institute of Technology

Publications

Order By: Most citations

Efficient Relative Attribute Learning Using Graph Neural Networks

Meng

Adluru

Kim

et al. 2018

View full text Add to dashboard Cite

A sizable body of work on relative attributes provides evidence that relating pairs of images along a continuum of strength pertaining to a visual attribute yields improvements in a variety of vision tasks. In this paper, we show how emerging ideas in graph neural networks can yield a solution to various problems that broadly fall under relative attribute learning. Our main idea is the observation that relative attribute learning naturally benefits from exploiting the graph of dependencies among the different relative attributes of images, especially when only partial ordering is provided at training time. We use message passing to perform end to end learning of the image representations, their relationships as well as the interplay between different attributes. Our experiments show that this simple framework is effective in achieving competitive accuracy with specialized methods for both relative attribute learning and binary attribute prediction, while relaxing the requirements on the training data and/or the number of parameters, or both.

show abstract

Object-Centric Unsupervised Image Captioning

Meng

Yang

Cao

et al. 2022

View full text Add to dashboard Cite

Connecting What to Say With Where to Look by Modeling Human Attention Traces

Meng

Zhang

et al. 2021

View full text Add to dashboard Cite

We introduce a unified framework to jointly model images, text, and human attention traces. Our work is built on top of the recent Localized Narratives annotation framework [30], where each word of a given caption is paired with a mouse trace segment. We propose two novel tasks:(1) predict a trace given an image and caption (i.e., visual grounding), and (2) predict a caption and a trace given only an image. Learning the grounding of each word is challenging, due to noise in the human-provided traces and the presence of words that cannot be meaningfully visually grounded. We present a novel model architecture that is jointly trained on dual tasks (controlled trace generation and controlled caption generation). To evaluate the quality of the generated traces, we propose a local bipartite matching (LBM) distance metric which allows the comparison of two traces of different lengths. Extensive experiments show our model is robust to the imperfect training data and outperforms the baselines by a clear margin. Moreover, we demonstrate that our model pre-trained on the proposed tasks can be also beneficial to the downstream task of COCO's guided image captioning. Our code 1 and project page 2 are publicly available.

show abstract

ReabsNet: Detecting and Revising Adversarial Examples

Chen¹,

Meng²,

Sun³

et al. 2017

Preprint

View full text Add to dashboard Cite

On the Versatile Uses of Partial Distance Correlation in Deep Learning

Zhen

Meng

Chakraborty

et al. 2022

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Zihang Meng

Efficient Relative Attribute Learning Using Graph Neural Networks

Object-Centric Unsupervised Image Captioning

Connecting What to Say With Where to Look by Modeling Human Attention Traces

ReabsNet: Detecting and Revising Adversarial Examples

On the Versatile Uses of Partial Distance Correlation in Deep Learning

Contact Info

Product

Resources

About