2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018
DOI: 10.1109/cvpr.2018.00045
|View full text |Cite
|
Sign up to set email alerts
|

Supervision-by-Registration: An Unsupervised Approach to Improve the Precision of Facial Landmark Detectors

Abstract: In this paper, we present supervision-by-registration, an unsupervised approach to improve the precision of facial landmark detectors on both images and video. Our key observation is that the detections of the same landmark in adjacent frames should be coherent with registration, i.e., optical flow. Interestingly, the coherency of optical flow is a source of supervision that does not require manual labeling, and can be leveraged during detector training. For example, we can enforce in the training loss functio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
177
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 194 publications
(186 citation statements)
references
References 37 publications
2
177
0
Order By: Relevance
“…Similar work [25,26] propagates pose results temporally using optical flow to encourage time consistency of the estimated bodies. Apart from its application in warping between frames, the structural information existing in optical flow alone has been used for pose estimation [27] or in conjunction with an image stream [28,29].…”
Section: Related Workmentioning
confidence: 99%
“…Similar work [25,26] propagates pose results temporally using optical flow to encourage time consistency of the estimated bodies. Apart from its application in warping between frames, the structural information existing in optical flow alone has been used for pose estimation [27] or in conjunction with an image stream [28,29].…”
Section: Related Workmentioning
confidence: 99%
“…Linear regression based methods learn a function that maps the input face image to the normalized landmark coordinates [44,7]. Heatmap regression based methods produce one heatmap for each landmark, where the coordinate is the location of the highest response on this heatmap [41,11,9,30,5]. All above algorithms can be readily integrated into our framework, serving as different student detectors.…”
Section: Supervised Facial Landmark Detectionmentioning
confidence: 99%
“…The 300-W dataset [35] annotates 68 landmarks from five facial landmark datasets, i.e., LFPW, AFW, HELEN, XM2VTS, and IBUG. Following the common settings [11,9,27], we regard all the training samples from LFPW, HE-LEN and the full set of AFW as the training set, in which there is 3148 training images. The common test subset consists of 554 test images from LFPW and HELEN.…”
Section: Datasetsmentioning
confidence: 99%
See 1 more Smart Citation
“…More recently, PCD-CNN [9] uses head pose information to drive the training process. CPM+SBR [5] employs landmark registration to regularize training. SAN [4] uses adversarial networks to convert images from different styles to an aggregated style, upon which regression is performed.…”
Section: Related Workmentioning
confidence: 99%