2014 13th International Conference on Machine Learning and Applications 2014
DOI: 10.1109/icmla.2014.70
|View full text |Cite
|
Sign up to set email alerts
|

Facial Expression Recognition Using Kinect Depth Sensor and Convolutional Neural Networks

Abstract: Facial expression recognition is an active area of research with applications in the design of Human Computer Interaction (HCI) systems. In this paper, we propose an approach for facial expression recognition using deep convolutional neural networks (CNN) based on features generated from depth information only. The Gradient direction information of depth data is used to represent facial information, due its invariance to distance from the sensor. The ability of a convolutional neural networks (CNN) to learn lo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
11
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 32 publications
(11 citation statements)
references
References 13 publications
(13 reference statements)
0
11
0
Order By: Relevance
“…Then, the patient logs into the system through a face recognition technology embedded in the exercise game using Kinect v2. We used face recognition to identify the user because of the following two reasons: first, it allows natural interaction with the system, with high recognition accuracy [ 34 - 37 ], and second, in the future, we can extend the exercise game system with emotion recognition using the camera [ 38 - 40 ].…”
Section: Methodsmentioning
confidence: 99%
“…Then, the patient logs into the system through a face recognition technology embedded in the exercise game using Kinect v2. We used face recognition to identify the user because of the following two reasons: first, it allows natural interaction with the system, with high recognition accuracy [ 34 - 37 ], and second, in the future, we can extend the exercise game system with emotion recognition using the camera [ 38 - 40 ].…”
Section: Methodsmentioning
confidence: 99%
“…Some approaches applied 3D scanning [13] and thermal imaging [14] to recording the facial information. Though 3D scanning is accurate and invariant to illumination changes compared to other approaches, it requires specialized expensive equipment and capture in controlled environments [15]. Therefore, Microsoft Kinect as a 3D sensor is an attractive alternative due to its low cost, portability, and applicability in many interactive applications such as games and action recognition.…”
Section: Introductionmentioning
confidence: 99%
“…In [15], depth information was used to recognize facial expressions with open mouth, occlusion of mouth by hand and occlusion by paper. The Gradient direction information of depth data was used as facial features and sent into the convolutional neural network for classification.…”
Section: Introductionmentioning
confidence: 99%
“…, SIFT, HOG, LBP) at the frame-by-frame level and train off-the-shelf classifiers for the recognition of AUs at the frame level. Representative approaches include neural networks [33], Bayesian networks [35], support vector machine with single margin [5], [7], [24] or multiple margins [42], boosting based approaches [1], and more recently the end-to-end convolutional neural networks [14], [45]. Dynamic approaches consider temporal information by recognizing AUs at the segment level ( i.e.…”
Section: Introductionmentioning
confidence: 99%
“…CNN has become one of the most powerful machine learning methods in large-scale object detection, image classification [21], [30], and more recently AU detection [14], [17]. Other approaches to AU detection first engineer hand-crafted features and then independently train classifiers.…”
Section: Introductionmentioning
confidence: 99%