2017
DOI: 10.3390/app7111184
|View full text |Cite
|
Sign up to set email alerts
|

NIRExpNet: Three-Stream 3D Convolutional Neural Network for Near Infrared Facial Expression Recognition

Abstract: Facial expression recognition (FER) under active near-infrared (NIR) illumination has the advantages of illumination invariance. In this paper, we propose a three-stream 3D convolutional neural network, named as NIRExpNet for NIR FER. The 3D structure of NIRExpNet makes it possible to extract automatically, not just spatial features, but also, temporal features. The design of multiple streams of the NIRExpNet enables it to fuse local and global facial expression features. To avoid over-fitting, the NIRExpNet h… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 12 publications
(8 citation statements)
references
References 31 publications
(49 reference statements)
0
8
0
Order By: Relevance
“…However, the infrared images records the emotions produced by skin distribution which are not subtle to the illumination variations. In 2017, Wu et al [171] given a 3D CNN architecture to fuse spatial and temporal features in FER images.…”
Section: E Fer On Infrared Datamentioning
confidence: 99%
“…However, the infrared images records the emotions produced by skin distribution which are not subtle to the illumination variations. In 2017, Wu et al [171] given a 3D CNN architecture to fuse spatial and temporal features in FER images.…”
Section: E Fer On Infrared Datamentioning
confidence: 99%
“…For all of the methods, we used the tenfold cross-validation method to obtain an average recognition rate. The results of Deep Temporal Geometry Network (DTAGN), 3D CNN Deformable Facial Action Parts (DAP), and NIRExpNet were obtained from [37], and the result of LBP-TOP was obtained by implementing the algorithm using MatLab software (MathWorks, Natick, MA, USA). SETFNet and SETFNet + global were implemented by using Caffe.…”
Section: Comparisons With Other Methodsmentioning
confidence: 99%
“…Jeni et al [36] proposed a 3D-shape-information-based recognition technique and further proved that an NIR camera configuration is suitable for facial expressions under light-changing conditions. Wu et al [37] proposed a three-stream 3D convolutional network for NIR facial expression recognition, using a combination of global and local features, but did not consider assigning different weights to local features.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In 2006 and later, Hinton proposed the DBN [3] and CD-K [4] algorithms, which has enabled ANNs to develop from a shallow to deep structure, achieving significant performance improvements. As a typical type of deep network [5], DBNs are widely used in image processing [6][7][8][9][10], speech recognition [11][12][13] and nonlinear function prediction [14], yielding excellent performance. However, DBNs still have many problems worth studying, such as the network structure design [15][16][17][18][19], selection and improvement of training algorithms [20,21], introduction of automatic encoders, and implementation of GPU parallel acceleration [22,23].…”
Section: Introductionmentioning
confidence: 99%