Integrated Multiscale Appearance Features and Motion Information Prediction Network for Anomaly Detection

Ting Liu; Chengqing Zhang; Liming Wang

doi:10.1155/2021/6789956

Integrated Multiscale Appearance Features and Motion Information Prediction Network for Anomaly Detection

Comput Intell Neurosci. 2021 Oct 20:2021:6789956. doi: 10.1155/2021/6789956. eCollection 2021.

Authors

Ting Liu¹, Chengqing Zhang^{1

2}, Liming Wang¹

Affiliations

¹ State Key Lab for Electronic Testing Technology, North University of China, Taiyuan 030051, China.
² College of Mechatronics Engineering, North University of China, Taiyuan 030051, China.

Abstract

The rise of video-prediction algorithms has largely promoted the development of anomaly detection in video surveillance for smart cities and public security. However, most current methods relied on single-scale information to extract appearance (spatial) features and lacked motion (temporal) continuity between video frames. This can cause a loss of partial spatiotemporal information that has great potential to predict future frames, affecting the accuracy of abnormality detection. Thus, we propose a novel prediction network to improve the performance of anomaly detection. Due to the objects of various scales in each video, we use different receptive fields to extract detailed appearance features by the hybrid dilated convolution (HDC) module. Meanwhile, the deeper bidirectional convolutional long short-term memory (DB-ConvLSTM) module can remember the motion information between consecutive frames. Furthermore, we use RGB difference loss to replace optical flow loss as temporal constraint, which greatly reduces the time for optical flow extraction. Compared with the state-of-the-art methods in the anomaly-detection task, experiments prove that our method can more accurately detect abnormalities in various video surveillance scenes.

MeSH terms

Algorithms*
Cities
Memory, Long-Term*
Rotation