A Transformer-based model for abnormal activity recognition in video

Document Type : Computer Article

Authors

1 Faculty of Electrical and Computer Engineering, Semnan University, Semnan

2 سمنان

3 Semnan University

Abstract

Given the increasing daily volume of videos generated by security cameras in personal and public spaces, monitoring the activities present in videos has become crucial. Many video surveillance systems are designed to verify performance accuracy and provide alerts during the occurrence of abnormal activities. In this regard, various intelligent models have been proposed for detecting activities in videos. Considering recent advances in artificial intelligence, particularly deep learning, this paper introduces a model based on the Transformer network. To reduce computational complexity, keypoints of the human body are utilized in this approach. Fifteen key body points are input into the Transformer model, leveraging parallel processing during training and a self-attention mechanism. This enhances the speed and accuracy of the model. Experimental results on the JHMDB public database indicate an improvement in the accuracy of detecting abnormal activities compared to baseline models.

Keywords: Video processing, Video surveillance, Abnormal activities, Deep learning, Transformer Network.

Keywords

Main Subjects


Volume 22, Issue 76
May 2024
Pages 213-221
  • Receive Date: 07 January 2024
  • Revise Date: 02 February 2024
  • Accept Date: 16 February 2024