DepNet: An automated industrial intelligent system using deep learning for video‐based depression analysis

He, Lang; Guo, Chenguang; Tiwari, Prayag; Su, Rui; Pandey, Hari Mohan; Dang, Wei

doi:10.1002/int.22704

Cited by 20 publications

(19 citation statements)

References 53 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…LR up LR enh (8) Overall, the cross-scale residual feature fusion module developed in this paper possesses powerful texture feature characterization capability and enhanced discriminativeness of the network without the involvement of prior knowledge. In addition, mining the patchlevel facial semantic correlation of multiple reference sources makes the results more convincing.…”

Section: External-mining Modulementioning

confidence: 97%

“…It is widely acknowledged that the content and structure of face images naturally exhibit nonlocal resemblance and symmetry, 6,8,40 including left and right eyes, nose, brows, upper and lower lips, etc. The semantic components of the same identity may, however, deviate greatly in different circumstances due to factors like expression, posture, and multiple viewpoints, but the semantic regions of faces that are not the same identity show a high degree of similarity.…”

Section: External-mining Modulementioning

confidence: 99%

“…However, under controlled conditions (such as outdoor video surveillance), the captured facial images usually have a very low‐resolution (LR) with different illumination conditions and arbitrary poses 3 . In reality, there are many uncertainties causing different degrees of image quality degradation, which leads to severe deterioration of the performance in a wide range of practical face‐analysis applications 4–9 . Therefore, it is essential to create a face super‐resolution model that is more practical.…”

Section: Introductionmentioning

confidence: 99%

“…3 In reality, there are many uncertainties causing different degrees of image quality degradation, which leads to severe deterioration of the performance in a wide range of practical face-analysis applications. [4][5][6][7][8][9] Therefore, it is essential to create a face super-resolution model that is more practical.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Face hallucination using multisource references and cross‐scale dual residual fusion mechanism

Wang

Jian

et al. 2022

Int J of Intelligent Sys

View full text Add to dashboard Cite

There is an increasing interest in enhancing the quality of low-resolution (LR) facial images for various social life applications. Existing methods often use domainspecific prior knowledge, which is effective in improving the face super-resolution model's performance.However, it is challenging to obtain rich and accurate prior information from LR inputs in real-world scenarios, which can limit the robustness and generalization ability of the developed face super-resolution model. In this paper, a multisource reference-based face super-resolution Network, namely MSRNet, is proposed. Without considering the prior knowledge of faces, the network can reconstruct a LR face image with a magnitude factor of 8 under the guidance of multiple reference face images of different identities.By constructing an "appearance-alike" reference data set Face_Ref, the designed MSRNet aims to fully exploit the local and spatially similar high frequency information between the distinct references and the current face. More specifically, to effectively combine the information from multiple references, a cross-scale and cross-space feature fusion mechanism is introduced for external and internal references, and then the enhanced local semantics are finally incorporated into the high-resolution face reconstruction. The

show abstract

Section: External-mining Modulementioning

confidence: 97%

Section: External-mining Modulementioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Face hallucination using multisource references and cross‐scale dual residual fusion mechanism

Wang

Jian

et al. 2022

Int J of Intelligent Sys

View full text Add to dashboard Cite

show abstract

“…RenderX depression estimation from audio, video, and text information [22,23]. Thus, the possibility to "see" mental disorders is, per se, an innovative technology.…”

Section: Xsl • Fomentioning

confidence: 99%

Ethical Implications of the Use of Language Analysis Technologies for the Diagnosis and Prediction of Psychiatric Disorders

Loch¹,

Lopes-Rocha²,

Ara³

et al. 2022

JMIR Ment Health

View full text Add to dashboard Cite

Recent developments in artificial intelligence technologies have come to a point where machine learning algorithms can infer mental status based on someone’s photos and texts posted on social media. More than that, these algorithms are able to predict, with a reasonable degree of accuracy, future mental illness. They potentially represent an important advance in mental health care for preventive and early diagnosis initiatives, and for aiding professionals in the follow-up and prognosis of their patients. However, important issues call for major caution in the use of such technologies, namely, privacy and the stigma related to mental disorders. In this paper, we discuss the bioethical implications of using such technologies to diagnose and predict future mental illness, given the current scenario of swiftly growing technologies that analyze human language and the online availability of personal information given by social media. We also suggest future directions to be taken to minimize the misuse of such important technologies.

show abstract

A deep learning model for depression detection based on MFCC and CNN generated spectrogram features

Das,

Naskar

2024

Biomedical Signal Processing and Control

View full text Add to dashboard Cite

DepNet: An automated industrial intelligent system using deep learning for video‐based depression analysis

Cited by 20 publications

References 53 publications

Face hallucination using multisource references and cross‐scale dual residual fusion mechanism

Face hallucination using multisource references and cross‐scale dual residual fusion mechanism

Ethical Implications of the Use of Language Analysis Technologies for the Diagnosis and Prediction of Psychiatric Disorders

A deep learning model for depression detection based on MFCC and CNN generated spectrogram features

Contact Info

Product

Resources

About