Mis-Classified Vector Guided Softmax Loss for Face Recognition

Wang, Xiaobo; Zhang, Shifeng; Wang, Shuo; Fu, Tianyu; Shi, Hailin; Mei, Tao

doi:10.1609/aaai.v34i07.6906

Cited by 132 publications

(93 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…It is intuitive that the mis-classified samples contribute more to the improvement of the identity discriminability [16]. Given this, we proposed the additive supervision softmax loss (ASsoftmax) to make full use of the prior knowledge of the misclassified samples.…”

Section: Additive Supervision Softmax Lossmentioning

confidence: 99%

See 1 more Smart Citation

Deep Speaker Embedding Extraction with Channel-Wise Feature Responses and Additive Supervision Softmax Loss Function

Zhou¹,

Jiang²,

Li³

et al. 2019

Interspeech 2019

View full text Add to dashboard Cite

In speaker verification, the convolutional neural networks (C-NN) have been successfully leveraged to achieve a great performance. Most of the models based on CNN primarily focus on learning the distinctive speaker embedding from the horizontal direction (time-axis). However, the feature relationship between channels is usually neglected. In this paper, we firstly aim toward an alternate direction of recalibrating the channelwise features by introducing the recently proposed "squeezeand-excitation" (SE) module for image classification. We effectively incorporate the SE blocks in the deep residual networks (ResNet-SE) and demonstrate a slightly improvement on Vox-Celeb corpuses. Additionally, we propose a new loss function, namely additive supervision softmax (AS-Softmax), to make full use of the prior knowledge of the mis-classified samples at training stage by imposing more penalty on the mis-classified samples to regularize the training process. The experimental results on VoxCeleb corpuses demonstrate that the proposed loss could further improve the performance of speaker system, especially on the case that the combination of the ResNet-SE and the AS-Softmax.

show abstract

Section: Additive Supervision Softmax Lossmentioning

confidence: 99%

“…Moreover, the end-to-end loss like TE2E [14] and GE2E [15] have been proposed to training the speaker model in an end-to-end fashion. It is worth noting that aforementioned loss functions did not pour much attention on the hard samples, which are beneficial for learning a distinctive representation [16,17].…”

Section: Introductionmentioning

confidence: 99%

Deep Speaker Embedding Extraction with Channel-Wise Feature Responses and Additive Supervision Softmax Loss Function

Zhou¹,

Jiang²,

Li³

et al. 2019

Interspeech 2019

View full text Add to dashboard Cite

show abstract

“…. , w K } and the feature x of the last fully connected layer are usually normalized and their magnitudes are replaced as a scale parameter s (Wang et al, 2017;Deng et al, 2019;Wang et al, 2019b). In consequence, given an input feature vector x with its ground truth label y, the original softmax loss Eq.…”

Section: Preliminary Knowledgementioning

confidence: 99%

“…where cos(θ w k ,x ) = w T k x is the cosine similarity and θ w k ,x is the angle between w k and x. As pointed out by a great many studies (Liu et al, 2016;Wang et al, 2018b;Deng et al, 2019;Wang et al, 2019b), the learned features with softmax loss are prone to be separable, rather than to be discriminative for face recognition.…”

Section: Preliminary Knowledgementioning

confidence: 99%

“…The task of face recognition contains two categories: face identification to classify a given face to a specific identity, and face verification to determine whether a pair of face images are of the same identity. In recent years, the advanced face recognition methods (Simonyan & Andrew, 2014;Guo et al, 2018;Wang et al, 2019b;Deng et al, 2019) are built upon convolutional neural networks (CNNs) and the learned high-level discriminative features are adopted for evaluation. To train CNNs with discriminative features, the loss function plays an important Proceedings of the 37 th International Conference on Machine Learning, Vienna, Austria, PMLR 119, 2020.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A joint loss function for deep face recognition

Wang

Chen

2018

Multidim Syst Sign Process

View full text Add to dashboard Cite

In face recognition, designing margin-based (e.g., angular, additive, additive angular margins) softmax loss functions plays an important role in learning discriminative features. However, these hand-crafted heuristic methods are sub-optimal because they require much effort to explore the large design space. Recently, an AutoML for loss function search method AM-LFS has been derived, which leverages reinforcement learning to search loss functions during the training process. But its search space is complex and unstable that hindering its superiority. In this paper, we first analyze that the key to enhance the feature discrimination is actually how to reduce the softmax probability. We then design a unified formulation for the current margin-based softmax losses. Accordingly, we define a novel search space and develop a reward-guided search method to automatically obtain the best candidate. Experimental results on a variety of face recognition benchmarks have demonstrated the effectiveness of our method over the state-of-the-art alternatives.

show abstract

Semi-Siamese Training for Shallow Face Learning

Shi

Liu

et al. 2020

Computer Vision – ECCV 2020

View full text Add to dashboard Cite

Mis-Classified Vector Guided Softmax Loss for Face Recognition

Cited by 132 publications

References 27 publications

Deep Speaker Embedding Extraction with Channel-Wise Feature Responses and Additive Supervision Softmax Loss Function

Deep Speaker Embedding Extraction with Channel-Wise Feature Responses and Additive Supervision Softmax Loss Function

A joint loss function for deep face recognition

Semi-Siamese Training for Shallow Face Learning

Contact Info

Product

Resources

About