2019 International Conference on Biometrics (ICB) 2019
DOI: 10.1109/icb45273.2019.8987385
|View full text |Cite
|
Sign up to set email alerts
|

Video Face Recognition: Component-wise Feature Aggregation Network (C-FAN)

Abstract: We propose a new approach to video face recognition. Our component-wise feature aggregation network (C-FAN) accepts a set of face images of a subject as an input, and outputs a single feature vector as the face representation of the set for the recognition task. The whole network is trained in two steps: (i) train a base CNN for still image face recognition; (ii) add an aggregation module to the base network to learn the quality value for each feature component, which adaptively aggregates deep feature vectors… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
35
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 44 publications
(35 citation statements)
references
References 40 publications
(91 reference statements)
0
35
0
Order By: Relevance
“…Results from ArcFace are shown with the prefix Arc-. Two recent works [60] and [61] have reported results on the IJB-S dataset. These works mainly focused on face recognition and not detection so that they built video templates by matching their detections with ground truth bounding boxes provided by the protocols and evaluated their methods using identification accuracy and not EERR metric.…”
Section: Cosmentioning
confidence: 99%
“…Results from ArcFace are shown with the prefix Arc-. Two recent works [60] and [61] have reported results on the IJB-S dataset. These works mainly focused on face recognition and not detection so that they built video templates by matching their detections with ground truth bounding boxes provided by the protocols and evaluated their methods using identification accuracy and not EERR metric.…”
Section: Cosmentioning
confidence: 99%
“…It results that high-quality face made more contribution to the final feature and favors the face images more discriminative. The study [3] firstly provided a component-wise aggregation, which controls the normalized quality of corresponding feature pooling multiple frames together. Some of the works like ref.…”
Section: A Video Face Recognitionmentioning
confidence: 99%
“…Although the compelling progress in deep learning and computer vision, it is still a great challenge to match surveillance face images in different modalities, especially in open-set scenario [1]. There have been varieties of efforts about video-based face recognition [2,3,4]. However, most of them focus on learning an image-level face representation or aggregating face representations through simple pooling from favorable viewing angles.…”
Section: Introductionmentioning
confidence: 99%
“…Chen et al [30,31] proposed a set-based algorithm using sparse representation and dictionary learning. Some adaptive pooling methods [32][33][34] are also proposed, which flatten face sets by performing adaptive weighted average pooling on face features. Based on [2], Zheng et al further introduced a graphical modelbased approach which leverages the contextual information in the videos by identity information propagation to improve face recognition performance.…”
Section: Set/sequence-based Face Matchingmentioning
confidence: 99%