2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017
DOI: 10.1109/cvpr.2017.605
|View full text |Cite
|
Sign up to set email alerts
|

Deep Structured Learning for Facial Action Unit Intensity Estimation

Abstract: We consider the task of automated estimation of facial expression intensity. This involves estimation of multiple output variables (facial action units -AUs) that are structurally dependent. Their structure arises from statistically induced co-occurrence patterns of AU intensity levels. Modeling this structure is critical for improving the estimation performance; however, this performance is bounded by the quality of the input features extracted from face images. The goal of this paper is to model these struct… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
71
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 112 publications
(76 citation statements)
references
References 39 publications
(59 reference statements)
1
71
0
Order By: Relevance
“…It is also worth mentioning recent works on Deep Learning for Action Unit detection [47] and Intensity Estimation [48], [11]. Although these models have a high modelling power, the reported results have not shown significant improvements compared to traditional shallow methods using hand-crafted features.…”
Section: Conclusion and Discussionmentioning
confidence: 94%
See 1 more Smart Citation
“…It is also worth mentioning recent works on Deep Learning for Action Unit detection [47] and Intensity Estimation [48], [11]. Although these models have a high modelling power, the reported results have not shown significant improvements compared to traditional shallow methods using hand-crafted features.…”
Section: Conclusion and Discussionmentioning
confidence: 94%
“…Although these models have a high modelling power, the reported results have not shown significant improvements compared to traditional shallow methods using hand-crafted features. For example, the recently proposed Copula Convolutional Neural Network (CNN) [11] for AU Intensity Estimation is highly-related to our approach, because it combines a CNN with a probabilistic graphical model similar to the one employed in MI-DORF. Even though the Copula CNN requires intensity labels for all the frames during training, the reported results on the DISFA dataset are comparable to those achieved by our method.…”
Section: Conclusion and Discussionmentioning
confidence: 99%
“…Two main ways of solving multi-label learning in AU recognition are either capturing correlations through fully-connected layers [31,8,15] or inferring structure through probabilistic graphical models (PGM) [30,24,7]. While the former can capture correlations between classes, this is not done explicitly.…”
Section: Methodsmentioning
confidence: 99%
“…Together with patch-learning, Zhao et al [30] used positive and negative competitions among AUs to model a discriminative multi-label classifier. Walecki et al [24] placed a CRF on top of deep representations learned by a CNN. Both components are trained iteratively to estimate AU intensity.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation