Learning content‐adaptive feature pooling for facial depression recognition in videos

Zhou, Xiuzhuang; Huang, Peng; Liu, Haoming; Niu, Sihua

doi:10.1049/el.2019.0443

Cited by 17 publications

(13 citation statements)

References 10 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Our proposed method achieves better results than the schemes in [14,23,24] which are based on handcrafted features. The proposed method also outperforms the deep learning schemes proposed in [21,3,11,22,9] on AVEC2014, confirming the good performance of our model. In [25], the method is based on distribution learning with expectation loss function.…”

Section: Experimental Analysissupporting

confidence: 71%

“…In addition, our proposed method achieves better results than the method in [9] which uses a four-stream model to explore multiple facial regions, showing the importance of exploring the temporal information for depression detection. Finally, the authors in [22] explore spatial information and employ attention mechanism to fuse facial features. Our proposed method outperforms such method, which suggest that exploring temporal information between the frames is a better approach.…”

Section: Experimental Analysismentioning

confidence: 99%

“…Zhou et al [9] used multiple 2D CNNs to investigate different facial regions with a scheme to combine all responses. In [10], the authors proposed to integrate the facial features by using an attention mechanism to fuse features from a video. Such approaches consider the spatial dependencies of extracted features, disregarding the temporal information between the frames, which can impair the exploitation of spatio-temporal relations.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Encoding Temporal Information For Automatic Depression Recognition From Facial Analysis

Melo

Granger

López

2020

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

View full text Add to dashboard Cite

Depression is a mental illness that may be harmful to an individual's health. Using deep learning models to recognize the facial expressions of individuals captured in videos has shown promising results for automatic depression detection. Typically, depression levels are recognized using 2D-Convolutional Neural Networks (CNNs) that are trained to extract static features from video frames, which impairs the capture of dynamic spatio-temporal relations. As an alternative, 3D-CNNs may be employed to extract spatiotemporal features from short video clips, although the risk of overfitting increases due to the limited availability of labeled depression video data. To address these issues, we propose a novel temporal pooling method to capture and encode the spatio-temporal dynamic of video clips into an image map. This approach allows fine-tuning a pre-trained 2D CNN to model facial variations, and thereby improving the training process and model accuracy. Our proposed method is based on two-stream model that performs late fusion of appearance and dynamic information. Extensive experiments on two benchmark AVEC datasets indicate that the proposed method is efficient and outperforms the state-of-the-art schemes.

show abstract

Section: Experimental Analysissupporting

confidence: 71%

Section: Experimental Analysismentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Encoding Temporal Information For Automatic Depression Recognition From Facial Analysis

Melo

Granger

López

2020

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

View full text Add to dashboard Cite

show abstract

“…Mean accuracy achieved was 86.6% and maximum accuracy achieved was 86.9% Zhou et al (2019) [146] T his paper studies the facial expression of the videos. T hey took the content from the adaptive feature by extracting.…”

Section: Multi-modalmentioning

confidence: 98%

A review and meta-analysis of machine intelligence approaches for mental health issues and depression detection

Chahar¹,

Dubey²,

Narang³

2021

IJATEE

View full text Add to dashboard Cite

“…Mental illnesses have a significant impact on an individual's physical health (1), achievements (2,3), and life satisfaction (4). In addition to scales, behavioral recognition methods have been developed to judge the existence (5) or degree (6,7) of specific mental illnesses. However, identifying an individual's mental health status from a range of perspectives may be more helpful in non-professional scenarios such as self-monitoring or large-scale monitoring.…”

Section: Introductionmentioning

confidence: 99%

Identifying Psychological Symptoms Based on Facial Movements

Wang

Zhou

et al. 2020

Front. Psychiatry

View full text Add to dashboard Cite

Background: Many methods have been proposed to automatically identify the presence of mental illness, but these have mostly focused on one specific mental illness. In some non-professional scenarios, it would be more helpful to understand an individual's mental health status from all perspectives.Methods: We recruited 100 participants. Their multi-dimensional psychological symptoms of mental health were evaluated using the Symptom Checklist 90 (SCL-90) and their facial movements under neutral stimulation were recorded using Microsoft Kinect. We extracted the time-series characteristics of the key points as the input, and the subscale scores of the SCL-90 as the output to build facial prediction models. Finally, the convergent validity, discriminant validity, criterion validity, and the split-half reliability were respectively assessed using a multitrait-multimethod matrix and correlation coefficients.Results: The correlation coefficients between the predicted values and actual scores were 0.26 and 0.42 (P < 0.01), which indicated good criterion validity. All models except depression had high convergent validity but low discriminant validity. Results also indicated good levels of split-half reliability for each model [from 0.516 (hostility) to 0.817 (interpersonal sensitivity)] (P < 0.001).Conclusion: The validity and reliability of facial prediction models were confirmed for the measurement of mental health based on the SCL-90. Our research demonstrated that fine-grained aspects of mental health can be identified from the face, and provided a feasible evaluation method for multi-dimensional prediction models.

show abstract

Learning content‐adaptive feature pooling for facial depression recognition in videos

Cited by 17 publications

References 10 publications

Encoding Temporal Information For Automatic Depression Recognition From Facial Analysis

Encoding Temporal Information For Automatic Depression Recognition From Facial Analysis

A review and meta-analysis of machine intelligence approaches for mental health issues and depression detection

Identifying Psychological Symptoms Based on Facial Movements

Contact Info

Product

Resources

About