2020
DOI: 10.1007/978-3-030-58577-8_20
|View full text |Cite
|
Sign up to set email alerts
|

Not only Look, But Also Listen: Learning Multimodal Violence Detection Under Weak Supervision

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
158
1
1

Year Published

2021
2021
2022
2022

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 186 publications
(160 citation statements)
references
References 45 publications
0
158
1
1
Order By: Relevance
“…Modifying the proposed neural network to enhance its real-time characteristic is a direction for future work. [32] 30.77 Sultani et al [12] 73.20 Wu et al [15] 78.64 Ours (master branch) 81.28 Ours (combination) 81.69…”
Section: Discussionmentioning
confidence: 99%
See 4 more Smart Citations
“…Modifying the proposed neural network to enhance its real-time characteristic is a direction for future work. [32] 30.77 Sultani et al [12] 73.20 Wu et al [15] 78.64 Ours (master branch) 81.28 Ours (combination) 81.69…”
Section: Discussionmentioning
confidence: 99%
“…The overall architecture of our proposed neural network is shown in Figure 1. Following [15], visual features and audio features are first extracted from the pretrained models I3D [18] and VGGish [19], respectively. After that, the two types of features are fused by three different modules shown in Figure 1.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations