AdaCos: Adaptively Scaling Cosine Logits for Effectively Learning Deep Face Representations

Zhang, Xiao; Zhao, Rui; Qiao, Yu; Wang, Xiaogang; Li, Hongsheng

doi:10.1109/cvpr.2019.01108

Cited by 192 publications

(119 citation statements)

References 38 publications

(158 reference statements)

Supporting

Mentioning

119

Contrasting

Order By: Relevance

“…It also designs the appropriate loss function that can enhance the discriminative power of DCNNs based, large-scale face recognition. However, cosine-based softmax losses [167][168][169] provide better results in deep learning-based face recognition. High discriminative features were achieved using an Additive Angular Margin Loss(AcrFace) for face recognition [170].…”

Section: ) Object Detection In Surveillancementioning

confidence: 99%

Exploring Deep Learning-Based Architecture, Strategies, Applications and Current Trends in Generic Object Detection: A Comprehensive Review

Aziz

Salam²,

Sheikh³

et al. 2020

IEEE Access

119

View full text Add to dashboard Cite

Object detection is a fundamental but challenging issue in the field of generic image analysis; it plays an important role in a wide range of applications and has been receiving special attention in recent years. Although there are enomerous methods exist, an in-depth review of the literature concerning generic detection remains. This paper provides a comprehensive survey of recent advances in visual object detection with deep learning. Covering about 300 publications that we survey 1) region proposal-based object detection methods such as R

show abstract

Section: ) Object Detection In Surveillancementioning

confidence: 99%

Exploring Deep Learning-Based Architecture, Strategies, Applications and Current Trends in Generic Object Detection: A Comprehensive Review

Aziz

Salam²,

Sheikh³

et al. 2020

IEEE Access

119

View full text Add to dashboard Cite

show abstract

“…First, we plan to apply GO loss to other datasets for a thorough evaluation of its performance under different application scenarios. Second, we will propose a method to quantitatively determine the value of the hyperparameters, such as by visual analytics [6] or adaptive scaling [47].…”

Section: Resultsmentioning

confidence: 99%

GO Loss: A Gaussian Distribution‐Based Orthogonal Decomposition Loss for Classification

et al. 2019

Self Cite

View full text Add to dashboard Cite

We present a novel loss function, namely, GO loss, for classi cation. Most of the existing methods, such as center loss and contrastive loss, dynamically determine the convergence direction of the sample features during the training process. By contrast, GO loss decomposes the convergence direction into two mutually orthogonal components, namely, tangential and radial directions, and conducts optimization on them separately. e two components theoretically a ect the interclass separation and the intraclass compactness of the distribution of the sample features, respectively. us, separately minimizing losses on them can avoid the e ects of their optimization. Accordingly, a stable convergence center can be obtained for each of them. Moreover, we assume that the two components follow Gaussian distribution, which is proved as an e ective way to accurately model training features for improving the classi cation e ects. Experiments on multiple classi cation benchmarks, such as MNIST, CIFAR, and ImageNet, demonstrate the e ectiveness of GO loss.

show abstract

“…The choice of the scaling hyper-parameter usually relies on heuristic trials, which are both time consuming and inconvenient to use. The automatic selection of has been discussed in [29]. Inspired by these efforts, we designed a simple scheme to automatically determine this scale parameter for different logits, so that their scale ranges are the same.…”

Section: Automatic Scale Parameter Selectionmentioning

confidence: 99%

“…Similarly, NormFace [24] normalized both feature vectors and weight vectors to optimize cosine similarity instead of the inner products in the softmax loss, thereby effectively improving the angular discrimination of the features. Besides, the adaptive selection of the scale and margin hyper-parameters were studied in [28,29,47]. Zhang et al studied the settings of scale and angular margin parameter in cosine-based softmax losses and proposed AdaCos [29] to adaptively scale cosine logits to enhance the supervision during training.…”

Section: Introductionmentioning

confidence: 99%

“…Besides, the adaptive selection of the scale and margin hyper-parameters were studied in [28,29,47]. Zhang et al studied the settings of scale and angular margin parameter in cosine-based softmax losses and proposed AdaCos [29] to adaptively scale cosine logits to enhance the supervision during training. Liu et al proposed an adaptive margin softmax [28] to adaptively adjust the margins for different classes to tackle the problem of imbalanced training data in face recognition.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

LinCos-Softmax: Learning Angle-Discriminative Face Representations With Linearity-Enhanced Cosine Logits

Zhou

et al. 2020

IEEE Access

View full text Add to dashboard Cite

In recent years, the angle-based softmax losses have significantly improved the performance of face recognition whereas these loss functions are all based on cosine logit. A potential weakness is that the nonlinearity of the cosine function may undesirably saturate the angular optimization between the features and the corresponding weight vectors, thereby preventing the network from fully learning to maximize the angular discriminability of features. As a result, the generalization of learned features may be compromised. To tackle this issue, we propose a Linear-Cosine Softmax Loss (LinCos-Softmax) to more effectively learn angle-discriminative facial features. The main characteristic of the loss function we propose is the use of an approximated linear logit. Compared with the conventional cosine logit, it has a stronger linear relationship with the angle on enhancing angular discrimination through Taylor expansion. We also propose an automatic scale parameter selection scheme, which can conveniently provide an appropriate scale for different logits without the need for exhaustive parameter search to improve performance. In addition, we propose a marginenhanced Linear-Cosine Softmax Loss (m-LinCos-Softmax) to further enlarge inter-class distances and reduce intra-class variations. Experimental results on several face recognition benchmarks (LFW, AgeDB-30, CFP-FP, MegaFace Challenge 1) demonstrate the effectiveness of the proposed method and its superiority to existing angular softmax loss variants.

show abstract

AdaCos: Adaptively Scaling Cosine Logits for Effectively Learning Deep Face Representations

Cited by 192 publications

References 38 publications

Exploring Deep Learning-Based Architecture, Strategies, Applications and Current Trends in Generic Object Detection: A Comprehensive Review

Exploring Deep Learning-Based Architecture, Strategies, Applications and Current Trends in Generic Object Detection: A Comprehensive Review

GO Loss: A Gaussian Distribution‐Based Orthogonal Decomposition Loss for Classification

LinCos-Softmax: Learning Angle-Discriminative Face Representations With Linearity-Enhanced Cosine Logits

Contact Info

Product

Resources

About