2017
DOI: 10.1007/978-3-319-68548-9_70
|View full text |Cite
|
Sign up to set email alerts
|

Deep Appearance Features for Abnormal Behavior Detection in Video

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
63
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 95 publications
(71 citation statements)
references
References 25 publications
0
63
0
Order By: Relevance
“…We evaluate our approach in comparison with a series of state-of-the-art methods [6,7,9,11,12,13,15,21,22,23,24,25,26,27,28,31,32,33,36,37,38] on the Avenue, the ShanghaiTech, the UCSD Ped2 and the UMN data sets. The corresponding results are presented in Table 1.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…We evaluate our approach in comparison with a series of state-of-the-art methods [6,7,9,11,12,13,15,21,22,23,24,25,26,27,28,31,32,33,36,37,38] on the Avenue, the ShanghaiTech, the UCSD Ped2 and the UMN data sets. The corresponding results are presented in Table 1.…”
Section: Resultsmentioning
confidence: 99%
“…Several abnormal event detection approaches [5,6,9,23,29] learn a dictionary of atoms representing normal events during training, then label the events not represented in the dictionary as abnormal. Some recent approaches have employed locality sensitive hashing [38] and deep learning [11,12,21,24,27,28,31,33,36,37] to achieve better results. For instance, Smeureanu et al [33] employed a one-class Support Vector Machines (SVM) model based on deep features provided by convolutional neural networks (CNN) pre-trained on the ILSVRC benchmark [30], while Ravanbakhsh et al [27] combined pre-trained CNN models with low-level optical-flow maps.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Typical structures of image reconstruction and translation are usually employed and the difference between their output and ground truth is used to indicate the frame-level score [11,37,25]. Some researchers apply pretrained classification models (such as VGG [41]) to extract useful features from input videos [42,16]. Results of object detection and/or foreground estimation are also used for the determination of anomalous events in [14,51].…”
Section: Deep Learningmentioning
confidence: 99%