2019
DOI: 10.1049/el.2019.0443
|View full text |Cite
|
Sign up to set email alerts
|

Learning content‐adaptive feature pooling for facial depression recognition in videos

Abstract: Recently, a deep representation of facial depression built on convolutional neural networks has shown impressive performance in videobased depression recognition. However, most existing approaches either fix the weights or using a certain heuristics to integrate the frame-level facial features, resulting in suboptimal feature aggregation in encoding the helpful while discarding noisy information in videos. To address this issue, the authors introduce the memory attention mechanism in a regression network to le… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
12
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 17 publications
(13 citation statements)
references
References 10 publications
1
12
0
Order By: Relevance
“…Our proposed method achieves better results than the schemes in [14,23,24] which are based on handcrafted features. The proposed method also outperforms the deep learning schemes proposed in [21,3,11,22,9] on AVEC2014, confirming the good performance of our model. In [25], the method is based on distribution learning with expectation loss function.…”
Section: Experimental Analysissupporting
confidence: 71%
See 2 more Smart Citations
“…Our proposed method achieves better results than the schemes in [14,23,24] which are based on handcrafted features. The proposed method also outperforms the deep learning schemes proposed in [21,3,11,22,9] on AVEC2014, confirming the good performance of our model. In [25], the method is based on distribution learning with expectation loss function.…”
Section: Experimental Analysissupporting
confidence: 71%
“…In addition, our proposed method achieves better results than the method in [9] which uses a four-stream model to explore multiple facial regions, showing the importance of exploring the temporal information for depression detection. Finally, the authors in [22] explore spatial information and employ attention mechanism to fuse facial features. Our proposed method outperforms such method, which suggest that exploring temporal information between the frames is a better approach.…”
Section: Experimental Analysismentioning
confidence: 99%
See 1 more Smart Citation
“…Mean accuracy achieved was 86.6% and maximum accuracy achieved was 86.9% Zhou et al (2019) [146] T his paper studies the facial expression of the videos. T hey took the content from the adaptive feature by extracting.…”
Section: Multi-modalmentioning
confidence: 98%
“…Mental illnesses have a significant impact on an individual's physical health (1), achievements (2,3), and life satisfaction (4). In addition to scales, behavioral recognition methods have been developed to judge the existence (5) or degree (6,7) of specific mental illnesses. However, identifying an individual's mental health status from a range of perspectives may be more helpful in non-professional scenarios such as self-monitoring or large-scale monitoring.…”
Section: Introductionmentioning
confidence: 99%