2019 International Conference on Multimodal Interaction 2019
DOI: 10.1145/3340555.3355719
|View full text |Cite
|
Sign up to set email alerts
|

Bi-modality Fusion for Emotion Recognition in the Wild

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
18
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 35 publications
(20 citation statements)
references
References 15 publications
2
18
0
Order By: Relevance
“…Typically, low and high-level feature extractors are combined in a base network architecture. We choose VGG16 [22] as the base network because this network is still used as the base of many recent network for the FER task [31], [32], [33]. From the base network, VGG16 [22], we separated into two parts for two levels of input.…”
Section: Pyramid With Super-resolution (Psr) Networkmentioning
confidence: 99%
“…Typically, low and high-level feature extractors are combined in a base network architecture. We choose VGG16 [22] as the base network because this network is still used as the base of many recent network for the FER task [31], [32], [33]. From the base network, VGG16 [22], we separated into two parts for two levels of input.…”
Section: Pyramid With Super-resolution (Psr) Networkmentioning
confidence: 99%
“…The accuracy measurements of our proposed methods and related methods on the AFEW validation set are shown in Table 6. Our spatiotemporal method outperforms other recently reported methods using the same approach, by around 0.14% compared with Li et al [63]. Recently, Kumar et al [66] used multi-level attention with an unsupervised approach by iterative training between student and teacher models.…”
Section: Discussion and Comparison With Related Workmentioning
confidence: 54%
“…The bias of nth hidden unit is represented as b n . The activation probability of mth visible unit having known hidden vector h(h 1 , … , h n , .., h k ) is computed using Equation (12),…”
Section: Ta B L E 2 Steps Applied By Contrastive Divergence Algorithmmentioning
confidence: 99%
“…Moreover, the advantages of noncontact is found in video recordings, therefore it is applied in worldwide applications. 12 Still now, number of techniques are made available for recognizing the distinct expressions from the human face. Some of the common techniques basically applied for this purpose are convolutional neural network (CNN) and deep belief network (DBN).…”
Section: Introductionmentioning
confidence: 99%