2021
DOI: 10.1109/taffc.2019.2940224
|View full text |Cite
|
Sign up to set email alerts
|

Spatio-Temporal Encoder-Decoder Fully Convolutional Network for Video-Based Dimensional Emotion Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
9
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 28 publications
(10 citation statements)
references
References 62 publications
0
9
0
Order By: Relevance
“…The experimental results demonstrated that the LSTM outperformed SVR dramatically. Compared with RNN based methods, a fully convolutional network was designed by Du et al [6]. It performed emotion recognition in a coarse-to-fine strategy by aggregating multi-level features with various scales.…”
Section: Related Workmentioning
confidence: 99%
“…The experimental results demonstrated that the LSTM outperformed SVR dramatically. Compared with RNN based methods, a fully convolutional network was designed by Du et al [6]. It performed emotion recognition in a coarse-to-fine strategy by aggregating multi-level features with various scales.…”
Section: Related Workmentioning
confidence: 99%
“…Temporal Modelling in Emotion Recognition: Most existing methods model the temporal dynamics of continuous emotions using deterministic approaches such as Time-Delay Neural Networks [43], RNNs, LSTMs and GRUs [32,28,26,71,8,59], multi-head attention models [20], 3D Convolutions [76,35], 3D ConvLSTMs [19], and temporal-hourglass CNNs [10]. While these deterministic models are capable of effectively learning the temporal dynamics, they do not take the inherent stochastic nature of the continuous emotion labels into account.…”
Section: Related Workmentioning
confidence: 99%
“…Moreover, it is difficult for LSTM-RNNs to avoid the exploding/vanishing gradient problem when input sequences are long [9]. Recently, due to their advantageous parallelism, flexible receptive field and stable gradient, temporal convolutional networks [8] have proven effective at capturing long range patterns [10].…”
Section: Introductionmentioning
confidence: 99%