Abnormal Behavior Detection in Uncrowded Videos with Two-Stream 3D Convolutional Neural Networks

Mehmood, Abid

doi:10.3390/app11083523

Cited by 15 publications

(11 citation statements)

References 60 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The method could accurately classify different suspicious behaviors found in the mini-drone video dataset. Mehmood [10] studied the specifics of motion patterns involved in three abnormal behaviors, i.e., falling, loitering, and violence, and developed a new dataset by selecting videos pertaining to those patterns from public datasets. A two-stream inflated 3D CNN model pre-trained on the Kinetics dataset was then fine-tuned on the newly developed dataset for the detection of the three anomalies.…”

Section: Deep Learning-based Methodsmentioning

confidence: 99%

“…So, for example, the acts of balancing attempts made by a person falling have much in common with the patterns commonly found in suspicious and violent behaviors. Therefore, the solutions targeting this problem mostly aim at providing inclusive methods for detecting multiple anomalies [3,10,11], customizing datasets to learn specific features of targeted behaviors [10], and using advanced techniques of learning the motion patterns [12][13][14], often by incorporating both spatial and temporal features. The second difficulty involves the computational complexity of behavior representation and detection algorithms, resulting in the high expense of computing resources, thus impeding their utilization in many real-world scenarios.…”

Section: Introductionmentioning

confidence: 99%

“…By focusing on more than one anomaly, as mentioned above, the study combines the classification of various commonly found abnormal behaviors in an uncrowded scene. As reported by previous studies such as [10], because of the challenges resulting from the essential similarities in the acts of various abnormal behaviors, the efficient joint detection of different anomalies (leading to an accurate classification as normal and abnormal behavior) is an interesting problem, and hence a notable contribution of the current study.…”

mentioning

confidence: 96%

See 2 more Smart Citations

LightAnomalyNet: A Lightweight Framework for Efficient Abnormal Behavior Detection

Mehmood

2021

Sensors

Self Cite

View full text Add to dashboard Cite

The continuous development of intelligent video surveillance systems has increased the demand for enhanced vision-based methods of automated detection of anomalies within various behaviors found in video scenes. Several methods have appeared in the literature that detect different anomalies by using the details of motion features associated with different actions. To enable the efficient detection of anomalies, alongside characterizing the specificities involved in features related to each behavior, the model complexity leading to computational expense must be reduced. This paper provides a lightweight framework (LightAnomalyNet) comprising a convolutional neural network (CNN) that is trained using input frames obtained by a computationally cost-effective method. The proposed framework effectively represents and differentiates between normal and abnormal events. In particular, this work defines human falls, some kinds of suspicious behavior, and violent acts as abnormal activities, and discriminates them from other (normal) activities in surveillance videos. Experiments on public datasets show that LightAnomalyNet yields better performance comparative to the existing methods in terms of classification accuracy and input frames generation.

show abstract

Section: Deep Learning-based Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

mentioning

confidence: 96%

See 1 more Smart Citation

LightAnomalyNet: A Lightweight Framework for Efficient Abnormal Behavior Detection

Mehmood

2021

Sensors

Self Cite

View full text Add to dashboard Cite

show abstract

“…However, even though optical flow is widely used to describe motion information, a 3D convolutional kernel may improve the extraction of temporal patterns. This is the main focus in [25], whose author proposed a two-stream 3D-CNN architecture to detect anomalous events in videos. This architecture is also a two-stream CNN approach composed of a network to handle spatial information and another one to handle temporal information.…”

Section: Cnn-based Approachesmentioning

confidence: 99%

Spatio-Temporal Deep Learning-Based Methods for Defect Detection: An Industrial Application Study Case

et al. 2021

View full text Add to dashboard Cite

Data-driven methods—particularly machine learning techniques—are expected to play a key role in the headway of Industry 4.0. One increasingly popular application in this context is when anomaly detection is employed to test manufactured goods in assembly lines. In this work, we compare supervised, semi/weakly-supervised, and unsupervised strategies to detect anomalous sequences in video samples which may be indicative of defective televisions assembled in a factory. We compare 3D autoencoders, convolutional neural networks, and generative adversarial networks (GANs) with data collected in a laboratory. Our methodology to simulate anomalies commonly found in TV devices is discussed in this paper. We also propose an approach to generate anomalous sequences similar to those produced by a defective device as part of our GAN approach. Our results show that autoencoders perform poorly when trained with only non-anomalous data—which is important because class imbalance in industrial applications is typically skewed towards the non-anomalous class. However, we show that fine-tuning the GAN is a feasible approach to overcome this problem, achieving results comparable to those of supervised methods.

show abstract

“…In order to constrain the generalization ability of the autoencoder as much as possible, a common solution is to use a dual-stream autoencoder to reconstruct the video image and the corresponding optical flow image. The gap between the images can be used to identify whether there is abnormal behavior in the video frame [3][4][5][6][7] ; another solution is to add a memory module in the autoencoder to enhance the normality in the extracted features. Weights, suppress the expression of abnormal weights, so as to achieve the purpose of constraining the generalization of the autoencoder 8,9 .…”

Section: Introductionmentioning

confidence: 99%

Research on video anomaly detection with variational auto-encoder based on multi-level memory enhancement

Zhang

xiaobing

2022

Third International Conference on Computer Science and Communication Technology (ICCSCT 2022)

View full text Add to dashboard Cite

Using autoencoders to reconstruct current video frames or predict future frames is a popular method in the task of video anomalous behavior detection based on weakly supervised learning. Recent studies have shown that introducing a memory module into an autoencoder can capture a limited number of normal patterns and cannot cope with new scenarios in the test set. Therefore, this paper proposes a dual-stream based multi-level memory-enhanced conditional variational autoencoder model (TS-MemCVAE), which uses RGB image and optical flow image dual-stream input, and adds memory modules at the bottle-neck. The memory module contains normal mode features of different sizes. At the same time, with the aid of optical flow information, the model can sensitively identify abnormal behaviors with large reconstruction errors. The model is divided into two parts: a multi-level memory-enhanced auto-encoder and a conditional variational autoencoder. The former is responsible for the reconstruction of the input video, and the latter is used to capture the high correlation between the reconstructed video and the optical flow image. further predictions. The model is validated on two benchmark datasets, UCSD Ped2 and CUHK Avenue, and achieves 95.83% and 84.16% on AUC, respectively, and its excellent performance proves the effectiveness of the model.

show abstract

Abnormal Behavior Detection in Uncrowded Videos with Two-Stream 3D Convolutional Neural Networks

Cited by 15 publications

References 60 publications

LightAnomalyNet: A Lightweight Framework for Efficient Abnormal Behavior Detection

LightAnomalyNet: A Lightweight Framework for Efficient Abnormal Behavior Detection

Spatio-Temporal Deep Learning-Based Methods for Defect Detection: An Industrial Application Study Case

Research on video anomaly detection with variational auto-encoder based on multi-level memory enhancement

Contact Info

Product

Resources

About