2022
DOI: 10.1007/s00371-022-02413-5
|View full text |Cite
|
Sign up to set email alerts
|

A convolution-transformer dual branch network for head-pose and occlusion facial expression recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
13
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 25 publications
(13 citation statements)
references
References 46 publications
0
13
0
Order By: Relevance
“…Fan et al [ 33 ] modeled a hierarchical scale network (HSNet), in which the scale information of facial expression images was enhanced by a dilation convolution block. In [ 34 ], a dual-branch network was projected with one branch using CNN to capture local marginal information and the other applying a visual transformer to obtain compact global representation. Wang et al constructed an architecture similar to U-Net as an attention branch to highlight subtle local facial expression information [ 35 ].…”
Section: Related Workmentioning
confidence: 99%
“…Fan et al [ 33 ] modeled a hierarchical scale network (HSNet), in which the scale information of facial expression images was enhanced by a dilation convolution block. In [ 34 ], a dual-branch network was projected with one branch using CNN to capture local marginal information and the other applying a visual transformer to obtain compact global representation. Wang et al constructed an architecture similar to U-Net as an attention branch to highlight subtle local facial expression information [ 35 ].…”
Section: Related Workmentioning
confidence: 99%
“…Facial expression recognition is a hot topic in computer vision, with a wide range of applications including human behaviour analysis, mental disorder identification, and human-computer interaction, to name a few. Most recent research [1], [2], and [3][4][5][6][7] has concentrated on developing deep ANNs to achieve cutting-edge outcomes. Even though handcrafted feature-based artificial neural network models [8] and [9] provide results that are less accurate than deep learning networks, they have attracted less attention.…”
Section: Introductionmentioning
confidence: 99%
“…Various methods are employed to identify emotions based on face traits. This manuscript examines many recent investigations into the automatic data-driven technique [3][4][5][6][7] and the handcrafted approach [1][2] to facial emotion recognition. In the most difficult real-world dataset, FER-2013, these approaches have computationally complex solutions that give good accuracy while training and testing on the same datasets.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…To alleviate the effectiveness of occlusions and head-pose variants, Liang et al 29 proposed a robust convolution-transformer dual branch network (CT-DBN) which can model local and global facial information for FER. However, these models only use single-modal information.…”
Section: Introductionmentioning
confidence: 99%