Abhinav Moudgil scite author profile

Abhinav Moudgil

5Publications

130Citation Statements Received

105Citation Statements Given

How they've been cited

130

How they cite others

105

Affiliations

International Institute of Information Technology, Hyderabad, Georgia Institute of Technology, Indian Institute of Technology Hyderabad

Publications

Order By: Most citations

Long-Term Visual Object Tracking Benchmark

Moudgil

Gandhi

2019

View full text Add to dashboard Cite

We propose a new long video dataset 1 (called Track Long and Prosper -TLP) and benchmark for single object tracking. The dataset consists of 50 HD videos from real world scenarios, encompassing a duration of over 400 minutes (676K frames), making it more than 20 folds larger in average duration per sequence and more than 8 folds larger in terms of total covered duration, as compared to existing generic datasets for visual tracking. The proposed dataset paves a way to suitably assess long term tracking performance and train better deep learning architectures (avoiding/reducing augmentation, which may not reflect real world behaviour). We benchmark the dataset on 17 state of the art trackers and rank them according to tracking accuracy and run time speeds. We further present thorough qualitative and quantitative evaluation highlighting the importance of long term aspect of tracking. Our most interesting observations are (a) existing short sequence benchmarks fail to bring out the inherent differences in tracking algorithms which widen up while tracking on long sequences and (b) the accuracy of trackers abruptly drops on challenging long sequences, suggesting the potential need of research efforts in the direction of long-term tracking.

show abstract

Long-Term Visual Object Tracking Benchmark

Moudgil¹,

Gandhi²

2017

Preprint

View full text Add to dashboard Cite

Contrast and Classify: Training Robust VQA Models

Kant

Moudgil

Batra

et al. 2021

View full text Add to dashboard Cite

Towards Scaling Difference Target Propagation by Learning Backprop Targets

Ernoult¹,

Fabrice²,

Moudgil³

et al. 2022

Preprint

View full text Add to dashboard Cite

Contrast and Classify: Training Robust VQA Models

Kant¹,

Moudgil²,

Batra³

et al. 2020

Preprint

View full text Add to dashboard Cite

Recent Visual Question Answering (VQA) models have shown impressive performance on the VQA benchmark but remain sensitive to small linguistic variations in input questions. Existing approaches address this by augmenting the dataset with question paraphrases from visual question generation models or adversarial perturbations. These approaches use the combined data to learn an answer classifier by minimizing the standard cross-entropy loss. To more effectively leverage the augmented data, we build on the recent success in contrastive learning. We propose a novel training paradigm (ConCAT) that alternately optimizes cross-entropy and contrastive losses. The contrastive loss encourages representations to be robust to linguistic variations in questions while the cross-entropy loss preserves the discriminative power of the representations for answer classification. We find that alternately optimizing both losses is key to effective training. VQA models trained with ConCAT achieve higher consensus scores on the VQA-Rephrasings dataset as well as higher VQA accuracy on the VQA 2.0 dataset compared to existing approaches across a variety of data augmentation strategies.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Abhinav Moudgil

Long-Term Visual Object Tracking Benchmark

Long-Term Visual Object Tracking Benchmark

Contrast and Classify: Training Robust VQA Models

Towards Scaling Difference Target Propagation by Learning Backprop Targets

Contrast and Classify: Training Robust VQA Models

Contact Info

Product

Resources

About