2021
DOI: 10.1002/int.22704
|View full text |Cite
|
Sign up to set email alerts
|

DepNet: An automated industrial intelligent system using deep learning for video‐based depression analysis

Abstract: As a common mental disorder, depression has attracted many researchers from affective computing field to estimate the depression severity. However, existing approaches based on Deep Learning (DL) are mainly focused on single facial image without considering the sequence information for predicting the depression scale. In this paper, an integrated framework, termed DepNet, for automatic diagnosis of depression that adopts facial images sequence from videos is proposed. Specifically, several pretrained models ar… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
14
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 20 publications
(19 citation statements)
references
References 53 publications
0
14
0
Order By: Relevance
“…LR up LR enh (8) Overall, the cross-scale residual feature fusion module developed in this paper possesses powerful texture feature characterization capability and enhanced discriminativeness of the network without the involvement of prior knowledge. In addition, mining the patchlevel facial semantic correlation of multiple reference sources makes the results more convincing.…”
Section: External-mining Modulementioning
confidence: 97%
See 3 more Smart Citations
“…LR up LR enh (8) Overall, the cross-scale residual feature fusion module developed in this paper possesses powerful texture feature characterization capability and enhanced discriminativeness of the network without the involvement of prior knowledge. In addition, mining the patchlevel facial semantic correlation of multiple reference sources makes the results more convincing.…”
Section: External-mining Modulementioning
confidence: 97%
“…It is widely acknowledged that the content and structure of face images naturally exhibit nonlocal resemblance and symmetry, 6,8,40 including left and right eyes, nose, brows, upper and lower lips, etc. The semantic components of the same identity may, however, deviate greatly in different circumstances due to factors like expression, posture, and multiple viewpoints, but the semantic regions of faces that are not the same identity show a high degree of similarity.…”
Section: External-mining Modulementioning
confidence: 99%
See 2 more Smart Citations
“…RenderX depression estimation from audio, video, and text information [22,23]. Thus, the possibility to "see" mental disorders is, per se, an innovative technology.…”
Section: Xsl • Fomentioning
confidence: 99%