2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2022
DOI: 10.1109/cvprw56347.2022.00269
|View full text |Cite
|
Sign up to set email alerts
|

Coarse-to-Fine Cascaded Networks with Smooth Predicting for Video Facial Expression Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 27 publications
(5 citation statements)
references
References 28 publications
0
5
0
Order By: Relevance
“…Since the human face contains strong salient information that is conducive to extracting more refined emotion information, such as micro-expressions [22][23][24], the research on human emotion recognition methods throughout the past decade has focused on facial expression analysis [25][26][27][28][29]. Traditional research either uses facial fiducial points based on the Gabor-feature facial point detector [30] or focuses on facial action unit detection where a set of facial muscle movements is utilized for encoding corresponding facial expressions [31,32].…”
Section: Context-aware Emotion Recognitionmentioning
confidence: 99%
“…Since the human face contains strong salient information that is conducive to extracting more refined emotion information, such as micro-expressions [22][23][24], the research on human emotion recognition methods throughout the past decade has focused on facial expression analysis [25][26][27][28][29]. Traditional research either uses facial fiducial points based on the Gabor-feature facial point detector [30] or focuses on facial action unit detection where a set of facial muscle movements is utilized for encoding corresponding facial expressions [31,32].…”
Section: Context-aware Emotion Recognitionmentioning
confidence: 99%
“…Jeong et al [11] extended the DAN model and achieved 2nd in ABAW3. Xue et al [30] utilized a coarseto-fine cascade network with a temporal smoothing strategy and ranked 3rd in ABAW3. Zhang et al [35] found that AU, VA, and Expr representations are intrinsically associated with each other and proposed a streaming network for multi-task learning.…”
Section: Related Workmentioning
confidence: 99%
“…Jeong et al [15]proposed a multi-head cross attention networks and pretrained on Glint360K [1] and some private commercial datasets. Xue et al [46] proposed the Coarse-to-Fine Cascaded networks (CFC) to address the label ambiguity problem and used smooth predicting method to post-process the extracted features. Savchenko et al [37] proposed the novel frame-level emotion recognition algorithm which can be implemented even for video analytics on mobile devices.…”
Section: Affective Behavior Analysis In-the-wildmentioning
confidence: 99%