2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2019
DOI: 10.1109/cvprw.2019.00030
|View full text |Cite
|
Sign up to set email alerts
|

LBVCNN: Local Binary Volume Convolutional Neural Network for Facial Expression Recognition From Image Sequences

Abstract: Recognizing facial expressions is one of the central problems in computer vision. Temporal image sequences have useful spatio-temporal features for recognizing expressions. In this paper, we propose a new 3D Convolution Neural Network (CNN) that can be trained end-to-end for facial expression recognition on temporal image sequences without using facial landmarks. More specifically, a novel 3D convolutional layer that we call Local Binary Volume (LBV) layer is proposed. The LBV layer, when used with our newly p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
24
0
6

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 36 publications
(31 citation statements)
references
References 40 publications
0
24
0
6
Order By: Relevance
“…The average accuracy of 10 runs for seven-class and eight-class are reported. Among the many previous works, some works such as STRNN [ 42 ], LBVCNN [ 41 ], TPOEM [ 38 ], PHRNN-MSCNN [ 39 ], and SAANet [ 43 ] used image sequence as their experimental data, while others used a static image. Although Specific preprocessing [ 16 ], ALAW [ 22 ], Feature loss [ 28 ], OAENet [ 35 ], and S-DSRN [ 23 ] used seven expressions, contempt expression is replaced with neural.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…The average accuracy of 10 runs for seven-class and eight-class are reported. Among the many previous works, some works such as STRNN [ 42 ], LBVCNN [ 41 ], TPOEM [ 38 ], PHRNN-MSCNN [ 39 ], and SAANet [ 43 ] used image sequence as their experimental data, while others used a static image. Although Specific preprocessing [ 16 ], ALAW [ 22 ], Feature loss [ 28 ], OAENet [ 35 ], and S-DSRN [ 23 ] used seven expressions, contempt expression is replaced with neural.…”
Section: Methodsmentioning
confidence: 99%
“…To overcome the limitation that traditional LBP can lose the neighboring pixels related to different scales that can affect the texture of facial images, Yasmin et al [ 30 ] proposed a new extended LBP method based on the bitwise “AND” operation of two rotational kernels to extract facial features. In view of satisfactory performance of the LBP operator, the CNNs that integrate advantages of the LBP have been developed [ 41 , 55 , 56 ]. Lyons et al [ 27 ] used a multiscale, multiorientation set of Gabor filters to code facial expression images through comparing the similarity space derived from semantic ratings of the images by human observers with the one derived from Gabor representation; authors believed that the latter shows a significant degree of psychological plausibility.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Although these approaches are effective methods in extracting spatial information, they fail to capture morphological and contextual variations in the expression process. Recent methods aim to solve this problem by using massive datasets to obtain more efficient features of FER [9][10][11][12][13][14][15]. Some researchers use multimodal fusion to recognize emotions, such as voices, expressions, and actions [16].…”
Section: Related Workmentioning
confidence: 99%