Real-Time Video Surveillance and Violent Crowd Behavior Detection

Real-Time Video Surveillance and Violent Crowd Behavior Detection

Automatically detect violent events from the vast amounts of surveillance video data.

Business Challenge

The worldwide utilization of surveillance cameras in smart cities has enabled researchers to analyse a gigantic volume of data to ensure automatic monitoring. An enhanced security system in smart cities, schools, hospitals, and other surveillance domains is mandatory for the detection of violent or abnormal activities to avoid any casualties which could cause social, economic, and ecological damages.

Therefore, the need arises for Real-Time monitoring of crowded events and outbreak of violence. Human surveyor to monitor multiple video screens simultaneously and detect crowds of people in a constantly changing sea of activity is a daunting task. It becomes even more challenging to identify any signs of violence and rule breaking at an early enough stage to alert help beforehand.

A change by Excelledia

When AI-based tools are used for almost everything, why not use the same to maintain law and order? Excelledia thought just that and came up with the perfect and unique solution to counter the business challenge faced. This is a system that analyses crowd activity and, in turn, detects, localizes, and potentially predicts abnormal crowd behaviour and violent activities in them.


It utilizes deep learning-based computer vision models to perform the following:

  • To determine crowd formation,
  • To assess the variation in the distance between the individuals in subsequent frames. The distance is measured based on a previously calibrated pixel-to-real-distance conversion factor.

The model compares the temporal segments in the surveillance videos and assesses the variations in the frames. On noting unusual behaviour amongst the crowd for a certain period, it raises a flag and intimidates the security officials. The model’s final objective is to predict crowd behaviour and inform the concerned officials accordingly.

Our tool is resource-friendly and addresses security concerns through real-time surveillance footage.

The system aims to ease the burden on law-and-order enforcement and focus on more human-centric actions, like dispersal of crowd or arrests. The algorithms used in our tool can also be applied for COVID-19 social distancing norms as well, that would be especially helpful in places like hospitals, malls etc.



  • The system involves a series of modules such as object detection, human motion analysis and classification of identified motion patterns into categories like violent or non-violent.
  • The video is extracted into frames
  • Next, the frames are preprocessed and after that, feature extraction is applied to detect and localize humans using the YOLO convolutional neural network algorithm.
  • Those results are used to calculate pairwise distances between centroids that are fed into the classification module.
  • The last classification process is performed by counting the occurrence of abnormal distance variance and classifying it as violence if it exceeds a threshold.
  • Finally, in the Violent crowd behavior detection app, the video is uploaded where it is classified as violent behavior or non-violent behavior.

Jithin Prem is a digital marketer and aspiring writer who enjoys writing thought-provoking blogs, articles, and marketing articles. His content is easy to interact with and connect with, providing readers an incentive to stick around. Football, travelling, and movies keep his interests and enthusiasm alive, in addition to inventing the latest marketing techniques and learning new things.

Jithin Prem

Leave a Reply