NIRExpNet: Three-Stream 3D Convolutional Neural Network for Near Infrared Facial Expression Recognition

Wu, Zhan; Chen, Tong; Chen, Ying; Zhang, Zhihao; Liu, Guangyuan

doi:10.3390/app7111184

Cited by 12 publications

(8 citation statements)

References 31 publications

(49 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, the infrared images records the emotions produced by skin distribution which are not subtle to the illumination variations. In 2017, Wu et al [171] given a 3D CNN architecture to fuse spatial and temporal features in FER images.…”

Section: E Fer On Infrared Datamentioning

confidence: 99%

Facial Sentiment Analysis Using AI Techniques: State-of-the-Art, Taxonomies, and Challenges

et al. 2020

View full text Add to dashboard Cite

With the advancements in machine and deep learning algorithms, the envision of various critical real-life applications in computer vision becomes possible. One of the applications is facial sentiment analysis. Deep learning has made facial expression recognition the most trending research fields in computer vision area. Recently, deep learning-based FER models have suffered from various technological issues like under-fitting or over-fitting. It is due to either insufficient training and expression data. Motivated from the above facts, this paper presents a systematic and comprehensive survey on current state-of-art Artificial Intelligence techniques (datasets and algorithms) that provide a solution to the aforementioned issues. It also presents a taxonomy of existing facial sentiment analysis strategies in brief. Then, this paper reviews the existing novel machine and deep learning networks proposed by researchers that are specifically designed for facial expression recognition based on static images and present their merits and demerits and summarized their approach. Finally, this paper also presents the open issues and research challenges for the design of a robust facial expression recognition system. INDEX TERMS Facial sentiment analysis, machine learning, deep learning, convolutional neural network, deep belief network, artificial intelligence.

show abstract

Section: E Fer On Infrared Datamentioning

confidence: 99%

Facial Sentiment Analysis Using AI Techniques: State-of-the-Art, Taxonomies, and Challenges

et al. 2020

View full text Add to dashboard Cite

show abstract

“…For all of the methods, we used the tenfold cross-validation method to obtain an average recognition rate. The results of Deep Temporal Geometry Network (DTAGN), 3D CNN Deformable Facial Action Parts (DAP), and NIRExpNet were obtained from [37], and the result of LBP-TOP was obtained by implementing the algorithm using MatLab software (MathWorks, Natick, MA, USA). SETFNet and SETFNet + global were implemented by using Caffe.…”

Section: Comparisons With Other Methodsmentioning

confidence: 99%

“…Jeni et al [36] proposed a 3D-shape-information-based recognition technique and further proved that an NIR camera configuration is suitable for facial expressions under light-changing conditions. Wu et al [37] proposed a three-stream 3D convolutional network for NIR facial expression recognition, using a combination of global and local features, but did not consider assigning different weights to local features.…”

Section: Related Workmentioning

confidence: 99%

“…Tables [8][9][10][11] show the confusion matrix of the comparison algorithms, with the labels on the left-hand side representing actual classes and those at the bottom representing the predicted classes. The confusion matrix of NIRExpNet (Table 8) was adopted from [37] directly. The other matrixes were obtained by implementing the algorithms with MatLab code on the database (tenfold cross-validation).…”

Section: Confusion Matrixesmentioning

confidence: 99%

See 1 more Smart Citation

Three-Stream Convolutional Neural Network with Squeeze-and-Excitation Block for Near-Infrared Facial Expression Recognition

et al. 2019

Self Cite

View full text Add to dashboard Cite

Near-infrared (NIR) facial expression recognition is resistant to illumination change. In this paper, we propose a three-stream three-dimensional convolution neural network with a squeeze-and-excitation (SE) block for NIR facial expression recognition. We fed each stream with different local regions, namely the eyes, nose, and mouth. By using an SE block, the network automatically allocated weights to different local features to further improve recognition accuracy. The experimental results on the Oulu-CASIA NIR facial expression database showed that the proposed method has a higher recognition rate than some state-of-the-art algorithms.

show abstract

“…In 2006 and later, Hinton proposed the DBN [3] and CD-K [4] algorithms, which has enabled ANNs to develop from a shallow to deep structure, achieving significant performance improvements. As a typical type of deep network [5], DBNs are widely used in image processing [6][7][8][9][10], speech recognition [11][12][13] and nonlinear function prediction [14], yielding excellent performance. However, DBNs still have many problems worth studying, such as the network structure design [15][16][17][18][19], selection and improvement of training algorithms [20,21], introduction of automatic encoders, and implementation of GPU parallel acceleration [22,23].…”

Section: Introductionmentioning

confidence: 99%

DBN Structure Design Algorithm for Different Datasets Based on Information Entropy and Reconstruction Error

Jiang

Zhang

et al. 2018

Entropy

View full text Add to dashboard Cite

Deep belief networks (DBNs) of deep learning technology have been successfully used in many fields. However, the structure of a DBN is difficult to design for different datasets. Hence, a DBN structure design algorithm based on information entropy and reconstruction error is proposed. Unlike previous algorithms, we innovatively combine network depth and node number and optimizes them simultaneously. First, the mathematical model of the structural design problem is established, and the boundary constraint for node number based on information entropy is derived by introducing the idea of information compression. Moreover, the optimization objective of the network performance based on reconstruction error is proposed by deriving the fact that network energy is proportional to reconstruction error. Finally, the improved simulated annealing (ISA) algorithm is used to adjust the DBN network layers and nodes simultaneously. Experiments were carried out on three public datasets (MNIST, Cifar-10 and Cifar-100). The results show that the proposed algorithm can design its proper structure to different datasets, yielding a trained DBN which has the lowest reconstruction error and prediction error rate. The proposed algorithm is shown to have the best performance compared with other algorithms and can be used to assist the setting of DBN structural parameters for different datasets.

show abstract

NIRExpNet: Three-Stream 3D Convolutional Neural Network for Near Infrared Facial Expression Recognition

Cited by 12 publications

References 31 publications

Facial Sentiment Analysis Using AI Techniques: State-of-the-Art, Taxonomies, and Challenges

Facial Sentiment Analysis Using AI Techniques: State-of-the-Art, Taxonomies, and Challenges

Three-Stream Convolutional Neural Network with Squeeze-and-Excitation Block for Near-Infrared Facial Expression Recognition

DBN Structure Design Algorithm for Different Datasets Based on Information Entropy and Reconstruction Error

Contact Info

Product

Resources

About