Ijraset Journal For Research in Applied Science and Engineering Technology
Authors: Asma M, Dharini K M, Sheetal Shree P, Vennela C S, Prof. Geetha L S
DOI Link: https://doi.org/10.22214/ijraset.2022.45983
Certificate: View Certificate
The rapid development of object detection algorithm as led to its widespread application in security, such as facial recognition and crowd surveillance. However, real-time tracking of an individual is very challenging, especially in crowded places where the person might be in part or entirely occluded for some period. Hence, this paper objective is to create abnormal activity detection in public places focusing only on people. This system does not just detect a person in real-time but in addition, uses the information it as learned to track the trajectory of the person until they exit the frame of the camera. The system uses the algorithm called YOLO for the person detection. The system was able to successfully detect the abnormal activity in public places and detect crowded group of people in live.
I. INTRODUCTION
The surveillance methods can include observation from a distance by means of electronic equipment such as closed-circuit television (CCTV) cameras, or interception of electronically transmitted information such as Internet traffic or phone calls, and it can include simple, relatively low-technology methods such as human intelligence agents and postal interception. Many organizations and people are deploying video surveillance systems at their locations with Closed Circuit TV (CCTV) cameras for better security. The captured video data is useful to prevent the threats before the crime actually happens. These videos also become a good forensic evidence to identify criminals after the occurrence of crime. Traditionally, the video feed from CCTV cameras is monitored by human operators. These operators monitor multiple screens at a time searching for anomalous activities. This is an expensive and inefficient way of monitoring. Also, concentration of an operator will reduce drastically as time passes. One of the methods to cope with this problem is to use automated video surveillance systems (video analytics) instead of human operators. Such a system can monitor multiple screens simultaneously without the disadvantage of dropping concentration.
Analysis of crowd behavior has become a popular research field in recent years. Crowd behavior analysis can be utilized in variety of applications for example automatic detection of panic and escape behavior as a result of violence, riots, natural disasters.
The denser a crowd, the more vulnerable it is to safety hazards, and dangers in public places will cause more casualties and greater economic losses than in other environments. Terrorist attacks in recent years have continually impacted the bottom line of security. The losses caused by these events are much more than personnel and economic losses, and have resulted in more doubts about current protection capabilities in the security field. As the main means of abnormal warning and detection, researchers must enter a new development stage for the diversification of monitoring functions. The in-depth analysis of scene information and potential risks is also a hotspot for future development by scholars. Determining how to effectively monitor public crowds, and how to effectively identify and even predict safety hazards, has become a primary goal of researchers in the field of intelligent monitoring.
Generally, it is challenging to find effective features in crowd, since people in the crowd may be positioned at different locations and may move in diverse directions. We introduce a novel method for abnormal crowd event detection in surveillance videos. Particularly, our work focuses on panic and escape behavior detection that may appear because of violent events and natural disasters and detection of group in public places.
II. RELATED WORK
Muhamad Izham Hadi Azhar[1] Here the objective is to create a people tracking system in crowd surveillance, using Deep SORT framework. Unlike object detection frameworks like CNN, this system does not just detect a person in real-time but on top of that, uses the information it has learned to track the trajectory of the person until they exit the frame of the camera.
The system will use You Only Look Once (YOLO) for the person detection, and then use Deep SORT to process the detected person frame by frame to predict its movement path. This paper proposes a people tracking system in real-time using YOLO and Deep SORT algorithm. The tiny YOLOv3 and custom YOLOv3 dataset is reduced due to the detection part redetecting the same subject and assign it with another id. This system aims was to be able to keep tracking the individual even occlusion occur. This system can be beneficial especially for maintaining security in places with high rates of individuals going in and out the place. The outcome show that the tracking could be improve by providing a reliable and accurate dataset.
Franjo Matkovic [2] It presents an approach to crowd behaviour recognition in surveillance videos. The approach is based on a 4-stage pipelined multi-person tracker adapted to microscopic crowd level representation and crowd behaviour recognition by the evaluation of fuzzy logic functions. The multi-person tracker combines a CNNbased detector and an optical flow-based tracker.
Arun Kumar Jhapate [3] The motion influence map can be drawn easily in python and the pixel level presentation is often easier. Motion requires some python packages to run effectively with high level of accuracy of true recognition. Python also makes it easy to coding and testing simultaneously by adopting the test driven development (TDD) approach. There is a huge library for python as well as for Open CV which allow users to implement a system with fewer codes. Most of the smart phone applications develop in python that interact intellectually with human.
Dong-Gyu Lee [4] Detect abnormal crowd behaviour using motion history image and optical flow technique optical flow: detection using a angle between the optical floor of previous and current frames as the feature for the frames, this feature are later used by sum to detect normal crowd behaviour from abnormal crowd behaviour. This methodology can be classified into to global abnormal crowd detection methods because the group behaviour of the global is abnormal Optical flow for or each frame is calculated using lucas - kanade algorithm. Lucac-kanade algorithm: is simple technique which can provide an estimate of the moment of interesting feature in successive images of a scene.
Rahul Chauhan[5] The article discusses various aspects of deep learning, CNN in particular and performs image recognition and detection on MNIST and CIFAR -10 datasets using CPU unit only. The accuracy of MNIST is good but the accuracy of CIFAR-10 can be improved by training with larger epochs and on a GPU unit.
III. PROPOSED WORK
A. Pre-trained YOLO Model with opencv
The script requires four input arguments.
This particular model is trained on COCO dataset (common objects in context) from Microsoft. It is capable of detecting 80 common objects. Read the input image and get its width and height. Read the text file containing class names in human readable form and extract the class names to a list. Generate different colors for different classes to draw bounding boxes. Read the weights and config file and creates the network. Generally in a sequential CNN network there will be only one output layer at the end. In the YOLO v3 architecture there are multiple output layers giving out predictions.
get_output layers()
function gives the names of the output layers. An output layer is not connected to any next layer.
draw_bounding_box()
function draws rectangle over the given predicted region and writes class name over the box. If needed, we can write the confidence value too.
We need go through each detection from each output layer to get the class id, confidence and bounding box corners and more importantly ignore the weak detections (detections with low confidence value).Even though we ignored weak detection's, there will be lot of duplicate detections with overlapping bounding boxes.
Non-max suppression removes boxes with high overlapping. Finally we look at the detections that are left and draw bounding boxes around them and display the output frame. We used the centroid tracker to associate the person 1 and person 2, if a trackable person exists for the current person id, we use it for group detection and speed estimation of the person.
Firstly we will check calculate the distance between the centroid of one frame to the centroid of another frame convert the distance into ppm. Using obtained distance and Frame Size we will calculate the speed of a person. If speed increases for certain threshold it displays a message as Abnormal Activity.
a. Reading input video
b. Loading YOLO v3 Network
c. Reading frames in the loop
d. Getting blob from the fram3e
e. Implementing Forward Pass
f. Getting Bounding Boxes
g. Non-maximum Suppression
h. Drawing Bounding Boxes with Labels
i. -Writing processed frames
It has the following parameters:
B. Speed Detection
def estimateSpeed(location1, location2, ppm, fs):
d_pixels = math.sqrt(math.pow(location2[0] - location1[0], 2) + math.pow(location2[1] - location1[1], 2))
d_meters = d_pixels/ppm
speed = d_meters*fs
return speed
C. Group Detection
Distance matrix contains the distances computed pairwise between the vectors of matrix/matrices. scipy.spatial package provides us distance_matrix() method to compute the distance matrix. Generally matrices are in the form of 2-D array and the vectors of the matrix are matrix rows ( 1-D array). The object tracker is responsible for keeping track of which object is which by assigning and maintaining identification numbers (IDs). This object tracking which we’re implementing is called centroid tracking as it relies on the Euclidean distance between existing object centroids (i.e., objects the centroid tracker has already seen before) and new object centroids between subsequent frames in a video. The centroid tracking algorithm is a multi-step process.
The five steps include:
Step #1: Accept bounding box coordinates and compute centroids
Step #2: Compute Euclidean distance between new bounding boxes and existing objects
Step #3: Update (x, y)-coordinates of existing objects
Step #4: Register new objects
Step #5: Deregister old objects
The current proposed system is able to recognize human activity in public and analyze whether the action is normal or abnormal by detecting the change in speed of each person in the frame and a buzzer is invoked immediately. In addition, the proposed system finds if there are any group detected in the input video. A gathering of 4 people is considered as group and displays the number of people gathered. This also invokes a buzzer when number of people gathered reaches the given threshold value. This application can be implemented in any restricted areas and in crowd security surveillance so that immediate action can be taken. Further to the proposed system features like detection of abnormal activity using facial expressions can be added. Also since we are focusing only on human being ,it can be applicable to any other object according to the requirements. More features like pause and play of the output video can also be added as additional feature.
[1] Arun Kumar Jhapate, Sunil Malviya, Monika “Unusual Crowd Activity Detection using OpenCV and Motion Influence Map”,2020 [2] Dong-Gyu Lee, Hrung-II Suk, Sung-Kee Park, Seong-Whan Lee “Motion Influence Map for Unsual Human Activity Detection”, 2015 [3] Shubham Lahiri, Nikhil Jyoti, Sohil Pyati, Jaya Dewan “Abnormal Crowd Behavior Detection”,2018 [4] Franjo Matkovic, Darijin, Marcetic, Slobodan Ribaric “Abnormal Crowd Behaviour Recognition in Surveillance Videos”, 2019 [5] Aiquan Li, Shuqiang Guo, Qianlong Bai, Song Gao, Yaoyao Zhang “An Analysis Method of Crowd Abnormal Behavior for Video Service Robot”, 2016 [6] Muhamad Izham Hadi Azhar, Fadhlan Hafizhelmi Kamaru Zaman, Habibah Hashim, Nnooritawati Md. Tahir “People Tracking System” ,2020 [7] Rahul Chauhan, Kamal Kumar Ghanshala, R.C. Joshi “Convolutional Neural Network (CNN) for Image Detection and Recognition”, 2018
Copyright © 2022 Asma M, Dharini K M, Sheetal Shree P, Vennela C S, Prof. Geetha L S. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Paper Id : IJRASET45983
Publish Date : 2022-07-25
ISSN : 2321-9653
Publisher Name : IJRASET
DOI Link : Click Here