Spatio-Temporal Unity Networking for Video Anomaly Detection

Li, Yuanyuan; Yuan-hu, Cai; Liu, Jiaqi; Lang, Shuyan; Zhang, Xinfeng

doi:10.1109/access.2019.2954540

Cited by 47 publications

(26 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The input pattern of our model is different from the current methods that stack T sequential frames together into the model. Among these methods, the T frames are linked to each corresponding channel in the first output feature data, resulting in the collapse of temporal information [ 22 ]. Thus, we feed T frames into the encoder orderly to generate corresponding feature maps.…”

Section: Proposed Methodsmentioning

confidence: 99%

Spatio-temporal prediction and reconstruction network for video anomaly detection

et al. 2022

View full text Add to dashboard Cite

The existing anomaly detection methods can be divided into two popular models based on reconstruction or future frame prediction. Due to the strong learning capacity, reconstruction approach can hardly generate significant reconstruction errors for anomalies, whereas future frame prediction approach is sensitive to noise in complicated scenarios. Therefore, a solution has been proposed by balancing the merits and demerits of the two models. However, most methods relied on single-scale information to capture spatial features and lacked temporal continuity between the video frames, affecting anomaly detection accuracy. Thus, we propose a novel method to improve anomaly detection performance. Because of the objects of various scales in each video, we select different receptive fields to extract comprehensive spatial features by the hybrid dilated convolution (HDC) module. Meanwhile, the deeper bidirectional convolutional long short-term memory (DB-ConvLSTM) module can remember the temporal information between the consecutive frames. Experiments prove that our method can detect abnormalities in various video scenes more accurately than the state-of-the-art methods in the anomaly-detection task.

show abstract

Section: Proposed Methodsmentioning

confidence: 99%

Spatio-temporal prediction and reconstruction network for video anomaly detection

et al. 2022

View full text Add to dashboard Cite

show abstract

“…The input mode of our network is different from existing methods that conventionally stack T consecutive frames together into a network. In these methods, all the T frames are connected to each channel in the first output feature map, which results in the collapse of temporal information [ 29 ]; thus, we input T frames into the encoder network one by one to generate corresponding feature maps. As shown in Figure 4 , the DB-ConvLSTM structure includes a shallow forward layer and a deeper backward layer.…”

Section: Proposed Methodsmentioning

confidence: 99%

Integrated Multiscale Appearance Features and Motion Information Prediction Network for Anomaly Detection

Liu

Zhang

Wei

2021

Computational Intelligence and Neuroscience

View full text Add to dashboard Cite

The rise of video-prediction algorithms has largely promoted the development of anomaly detection in video surveillance for smart cities and public security. However, most current methods relied on single-scale information to extract appearance (spatial) features and lacked motion (temporal) continuity between video frames. This can cause a loss of partial spatiotemporal information that has great potential to predict future frames, affecting the accuracy of abnormality detection. Thus, we propose a novel prediction network to improve the performance of anomaly detection. Due to the objects of various scales in each video, we use different receptive fields to extract detailed appearance features by the hybrid dilated convolution (HDC) module. Meanwhile, the deeper bidirectional convolutional long short-term memory (DB-ConvLSTM) module can remember the motion information between consecutive frames. Furthermore, we use RGB difference loss to replace optical flow loss as temporal constraint, which greatly reduces the time for optical flow extraction. Compared with the state-of-the-art methods in the anomaly-detection task, experiments prove that our method can more accurately detect abnormalities in various video surveillance scenes.

show abstract

“…During testing, two networks take each frame as input and the outputs jointly determine whether it is novel or not. Li et al [29] propose a new anomaly score function and a spatio-temporal framework combined by U-net and adversarial learning. Similarly, Dong et al [30] propose a new approach with a dual discriminator-based generative adversarial network and U-net structure.…”

Section: B Aae-based Methodsmentioning

confidence: 99%