2017 IEEE International Conference on Computer Vision Workshops (ICCVW) 2017
DOI: 10.1109/iccvw.2017.374
|View full text |Cite
|
Sign up to set email alerts
|

Facial Expression Recognition via Joint Deep Learning of RGB-Depth Map Latent Representations

Abstract: Humans use facial expressions successfully for conveying their emotional states. However, replicating such success in the human-computer interaction domain is an active research problem. In this paper, we propose deep convolutional neural network (DCNN) for joint learning of robust facial expression features from fused RGB and depth map latent representations. We posit that learning jointly from both modalities result in a more robust classifier for facial expression recognition (FER) as opposed to learning fr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
31
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 41 publications
(32 citation statements)
references
References 27 publications
1
31
0
Order By: Relevance
“…BU-3DFE dataset contains both 3D and 2D data modalities for facial expressions of 100 different subjects. The data processing in [14] is followed for preparing the training data. DepthMap-ResNet50 [14] 61.11 DepthMap-VGG19 [14] 28.06 DepthMap-scratch [14] 84.72 RGB-ResNet50 [14] 82.92 RGB-VGG19 [14] 81.…”
Section: Bu-3dfe Facial Expression Rgb-d Datasetmentioning
confidence: 99%
See 3 more Smart Citations
“…BU-3DFE dataset contains both 3D and 2D data modalities for facial expressions of 100 different subjects. The data processing in [14] is followed for preparing the training data. DepthMap-ResNet50 [14] 61.11 DepthMap-VGG19 [14] 28.06 DepthMap-scratch [14] 84.72 RGB-ResNet50 [14] 82.92 RGB-VGG19 [14] 81.…”
Section: Bu-3dfe Facial Expression Rgb-d Datasetmentioning
confidence: 99%
“…DepthMap-ResNet50 [14] 61.11 DepthMap-VGG19 [14] 28.06 DepthMap-scratch [14] 84.72 RGB-ResNet50 [14] 82.92 RGB-VGG19 [14] 81. [16] 82.30 Distance+slopes+SVM [18] 87.10 2D+3D features fusion+SVM [19] 86.32 Geometric scattering representation+SVM [20] 84.80 Geometric+photometric attributes+VGG19 [21] 84.87 NF:RGB-ResNet50+DepthMap-scratch [14] 87.08 NF: RGB-VGG19+DepthMap-scratch [14] 89.31 Ours: RGB-ResNet50+DepthMap-scratch 89.86 Ours: RGB-VGG19+DepthMap-scratch 90.69 Table 2. Results comparison on BU-3DFE dataset Also, a similar experimental setting is used for extracting latent expresentations from both data modalities; that is, using pre-trained models (ResNet-50 and VGG19) on ImageNet dataset for RGB data and training a DNN from scratch on depth data.…”
Section: Bu-3dfe Facial Expression Rgb-d Datasetmentioning
confidence: 99%
See 2 more Smart Citations
“…The two pieces of information combined provide more dimensions to model and process, e.g. [9,26,16]. This richer information is desired in several scenarios.…”
Section: Introductionmentioning
confidence: 99%