Vijay Kumar B G scite author profile

To solve deep metric learning problems and producing feature embeddings, current methodologies will commonly use a triplet model to minimise the relative distance between samples from the same class and maximise the relative distance between samples from different classes. Though successful, the training convergence of this triplet model can be compromised by the fact that the vast majority of the training samples will produce gradients with magnitudes that are close to zero. This issue has motivated the development of methods that explore the global structure of the embedding and other methods that explore hard negative/positive mining. The effectiveness of such mining methods is often associated with intractable computational requirements. In this paper, we propose a novel deep metric learning method that combines the triplet model and the global structure of the embedding space. We rely on a smart mining procedure that produces effective training samples for a low computational cost. In addition, we propose an adaptive controller that automatically adjusts the smart mining hyper-parameters and speeds up the convergence of the training process. We show empirically that our proposed method allows for fast and more accurate training of triplet ConvNets than other competing mining methods. Additionally, we show that our method achieves new state-of-the-art embedding results for CUB-200-2011 and Cars196 datasets.

show abstract

DeepSetNet: Predicting Sets with Deep Neural Networks

Rezatofighi

Milan

et al. 2017

View full text Add to dashboard Cite

This paper addresses the task of set prediction using deep learning. This is important because the output of many computer vision tasks, including image tagging and object detection, are naturally expressed as sets of entities rather than vectors. As opposed to a vector, the size of a set is not fixed in advance, and it is invariant to the ordering of entities within it. We define a likelihood for a set distribution and learn its parameters using a deep neural network. We also derive a loss for predicting a discrete distribution corresponding to set cardinality. Set prediction is demonstrated on the problem of multi-class image classification. Moreover, we show that the proposed cardinality loss can also trivially be applied to the tasks of object counting and pedestrian detection. Our approach outperforms existing methods in all three cases on standard datasets.

show abstract

Learning Local Image Descriptors with Deep Siamese and Triplet Convolutional Networks by Minimizing Global Loss Functions

Carneiro

Reid

2016

View full text Add to dashboard Cite

Multi-Scale Regularized Deep Network for Retinal Vessel Segmentation

Cherukuri

Bala

et al. 2019

View full text Add to dashboard Cite

Bayesian Semantic Instance Segmentation in Open Set World

Pham¹,

G²,

Carneiro³

et al. 2018

Preprint

View full text Add to dashboard Cite

Smart Mining for Deep Metric Learning

Harwood¹,

G²,

Carneiro³

et al. 2017

Preprint

View full text Add to dashboard Cite

Exploiting Unlabeled Data with Vision and Language Models for Object Detection

Zhao¹,

Zhang²,

Schulter³

et al. 2022

Preprint

View full text Add to dashboard Cite

DeepSetNet: Predicting Sets with Deep Neural Networks

Rezatofighi¹,

G²,

Milan³

et al. 2016

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Vijay Kumar B G

Smart Mining for Deep Metric Learning

DeepSetNet: Predicting Sets with Deep Neural Networks

Learning Local Image Descriptors with Deep Siamese and Triplet Convolutional Networks by Minimizing Global Loss Functions

Multi-Scale Regularized Deep Network for Retinal Vessel Segmentation

Bayesian Semantic Instance Segmentation in Open Set World

Smart Mining for Deep Metric Learning

Exploiting Unlabeled Data with Vision and Language Models for Object Detection

DeepSetNet: Predicting Sets with Deep Neural Networks

Contact Info

Product

Resources

About