2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.00799
|View full text |Cite
|
Sign up to set email alerts
|

Face-Focused Cross-Stream Network for Deception Detection in Videos

Abstract: Automated deception detection (ADD) from real-life videos is a challenging task. It specifically needs to address two problems: (1) Both face and body contain useful cues regarding whether a subject is deceptive. How to effectively fuse the two is thus key to the effectiveness of an ADD model. (2) Real-life deceptive samples are hard to collect; learning with limited training data thus challenges most deep learning based ADD models. In this work, both problems are addressed. Specifically, for face-body multimo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
27
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 40 publications
(28 citation statements)
references
References 53 publications
(129 reference statements)
1
27
0
Order By: Relevance
“…Our unsupervised visual SA model performed comparably to existing fully-supervised, automated approaches that used the same dataset (ACC 75% [11], 77% [13], 79% [12]; AUC 70% [10]). While some prior fully-supervised, automated approaches [21,15,20] outperformed our unsupervised SA, our findings support the potential for introducing unsupervised SA to address the data scarcity problem of modeling high-stakes deception. Our results support our hypothesis that audio-visual representations of low-stakes deception in lab-controlled situations can be leveraged by SA to detect high-stakes deception in real-world situations.…”
Section: Resultssupporting
confidence: 60%
“…Our unsupervised visual SA model performed comparably to existing fully-supervised, automated approaches that used the same dataset (ACC 75% [11], 77% [13], 79% [12]; AUC 70% [10]). While some prior fully-supervised, automated approaches [21,15,20] outperformed our unsupervised SA, our findings support the potential for introducing unsupervised SA to address the data scarcity problem of modeling high-stakes deception. Our results support our hypothesis that audio-visual representations of low-stakes deception in lab-controlled situations can be leveraged by SA to detect high-stakes deception in real-world situations.…”
Section: Resultssupporting
confidence: 60%
“…In addition, Soldner et al [32] introduced dialogue features, consisting of interaction cues. Other multi-modal approaches combined the previously mention verbal and non-verbal features together with micro-expressions [3][4][5], thermal imaging [33], or spatio-temporal features extracted from 3D CNNs [34,35].…”
Section: Related Workmentioning
confidence: 99%
“…In the last decade, there has been a growing interest in the use of facial images to perform lie detection, often based on micro-expressions [3][4][5]13,15] or facial action units [14], achieving the current state-of-the-art accuracy. Table 1 below shows an overview of the major related works outlined in this section.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations