Video Data Analytics for Smart City Applications: Methods and Trends -  - E-Book

Video Data Analytics for Smart City Applications: Methods and Trends E-Book

0,0
44,81 €

-100%
Sammeln Sie Punkte in unserem Gutscheinprogramm und kaufen Sie E-Books und Hörbücher mit bis zu 100% Rabatt.
Mehr erfahren.
Beschreibung

Video data analytics is rapidly evolving and transforming the way we live in urban environments. Video Data Analytics for Smart City Applications: Methods and Trends, data science experts present a comprehensive review of the latest advances and trends in video analytics technologies and their extensive applications in smart city planning and engineering.

The book covers a wide range of topics including object recognition, action recognition, violence detection, and tracking, exploring deep learning approaches and other techniques for video data analytics. It also discusses the key enabling technologies for smart cities and homes and the scope and application of smart agriculture in smart cities.

Moreover, the book addresses the challenges and security issues in terahertz band for wireless communication and the empirical impact of AI and IoT on performance management. One contribution also provides a review of the progress in achieving the Jal Jeevan Mission Goals for institutional capacity building in the Indian State of Chhattisgarh.

For researchers, computer scientists, data analytics professionals, smart city planners and engineers, this book provides detailed references for further reading and demonstrates how technologies are serving their use-cases in the smart city. The book highlights the advances and trends in video analytics technologies and extensively addresses key themes, making it an essential resource for anyone looking to gain a comprehensive understanding of video data analytics for smart city applications.

Das E-Book können Sie in Legimi-Apps oder einer beliebigen App lesen, die das folgende Format unterstützen:

EPUB

Seitenzahl: 204

Veröffentlichungsjahr: 2009

Bewertungen
0,0
0
0
0
0
0
Mehr Informationen
Mehr Informationen
Legimi prüft nicht, ob Rezensionen von Nutzern stammen, die den betreffenden Titel tatsächlich gekauft oder gelesen/gehört haben. Wir entfernen aber gefälschte Rezensionen.



Table of Contents
BENTHAM SCIENCE PUBLISHERS LTD.
End User License Agreement (for non-institutional, personal use)
Usage Rules:
Disclaimer:
Limitation of Liability:
General:
FOREWORD
PREFACE
List of Contributors
Comprehensive Analysis of Video Surveillance System and Applications
Abstract
INTRODUCTION
LITERATURE REVIEW
OPEN CHALLENGES & ISSUES
DISCUSSION, TRENDS, AND FUTURE WORK
CONCLUSION
CONSENT FOR PUBLICATION
CONFLICT OF INTEREST
ACKNOWLEDGEMENT
REFERENCES
Compressed Video-Based Classification for Efficient Video Analytics
Abstract
INTRODUCTION
RELATED WORK
PROPOSED MODEL
COMPRESSION NETWORK
CLASSIFICATION NETWORK
EXPERIMENTS
Experimental Setup
Dataset
Implementation Details
Loss Function
Evaluation
EXPERIMENTAL RESULTS AND DISCUSSION
Experimental Results and Analysis of Compression Network
Experimental Results and Analysis of Classification Network
CONCLUSION
CONSENT FOR PUBLICATION
CONFLICT OF INTEREST
ACKNOWLEDGEMENT
REFERENCES
Object Detection and Tracking: Exploring a Deep Learning Approach and Other Techniques
Abstract
INTRODUCTION
OBJECT DETECTION AND TRACKING WITH CONVOLUTIONAL NEURAL NETWORK (CNN) AND IMAGE PROCESSING
Example of CNN Model Architectures for Object Detection
R-CNN
Limitations of R-CNN
Fast R-CNN
Advantages of Fast R-CNN
Faster R-CNN
Mask R-CNN
R-FCN
Single Shot Detector (SSD)
Advantages of SSD
You Only Look Once (YOLO)
DEVELOPMENT OF THE NEURAL NETWORK ARCHITECTURE
Hyperparameter Values For Each Model Layer
First Convolutional Layer
Second, Third, Fourth, and Fifth Convolutional Layers
OBJECT DETECTION WITH HAAR CASCADE CLASSIFIER
Haar Features Calculation
Integral Images
Adaboost Training
Haar Cascade Classifier Experiments for Face and Eyes Detection
CANNY EDGE DETECTION
Noise Reduction
Gradient Calculation
Non-Maximum Suppression
Double Threshold
Edge Tracking by Hysteresis
BACKGROUND REMOVAL
Background Generation
Background Modeling
Background Model Update
Foreground Detection
MOTION DETECTION USING KNN BACKGROUND SUBTRACTION
CONCLUSION
CONSENT FOR PUBLICATION
CONFLICT OF INTEREST
ACKNOWLEDGEMENT
REFERENCES
Introduction and Overview of Key Enabling Technologies for Smart Cities and Homes
Abstract
INTRODUCTION
TRENDS IN SMART CITIES AND HOMES
Smart cities
Smart Mobility and Smart Traffic Management
Smart Environment
Smart Living
Smart Economy
Smart Governance
Smart People
Smart Homes
Programmable and Zone-based Smart Thermostat
Wireless Power
Automatic Door Locks
Advanced Security System
CHALLENGES IN SMART CITIES AND HOMES
Security
IoT Challenges
Fragmentation of Standards
Processing big data
Scalability
SURVEY OF MAJOR KEY ENABLING TECHNOLOGIES FOR SMART CITIES AND HOMES
Internet of Things
Smart Dust
Smartphones
Cloud Computing
Smart Grid
SMART CITY DATA PLANE CHALLENGES
Compatibility Between Smart City Devices
Simplicity
Mobility and Geographic Control
SOFTWARE-DEFINED NETWORK-BASED SMART CITY NETWORK MANAGEMENT
Communications in Smart Grids
Purpose of the Smart Grid
The Different Segments of the Smart Grid
CONCLUSION
CONSENT FOR PUBLICATION
CONFLICT OF INTEREST
ACKNOWLEDGEMENT
REFERENCES
Intelligent Processing: Scope and Application of Smart Agriculture in Smart City
Abstract
INTRODUCTION
INTERNET OF THINGS (IoT)
INTELLIGENT PROCESSING
Intelligent Processing in Agriculture
INTELLIGENT PROCESSING MODEL
App Development
App Modules
Integration with sensors
Soil Moisture Sensor
Humidity and Temperature Sensor
Wind velocity Sensor
ADVANTAGES OF THE MODEL
CONCLUSION
CONSENT OF PUBLICATION
CONFLICT OF INTEREST
ACKNOWLEDGEMENTS
REFERENCES
Challenges and Security in Terahertz Band for Wireless Communication
Abstract
INTRODUCTION
CHALLENGES IN TERAHERTZ BAND
TERAHERTZ RADIATION SOURCES
TERAHERTZ COMMUNICATION SYSTEMS
CONCLUSION
CONSENT FOR PUBLICATION
CONFLICT OF INTEREST
ACKNOWLEDGEMENT
REFERENCES
Empirical Impact of AI and IoT on Performance Management: A Review
Abstract
INTRODUCTION
Contribution of this Study
LITERATURE REVIEW
Industry 4.0 Revolution Through AI and IoT
AI Intersects with IoT
Future Potential Growth of Performance Management Through AI and IoT
Theoretical Underpinning
RESEARCH METHODS
RESULT AND DISCUSSION
Systematic Review
Thematic Review
DISCUSSION
CONCLUSION AND FUTURE SCOPE
CONSENT FOR PUBLICATION
CONFLICT OF INTEREST
ACKNOWLEDGEMENT
REFERENCES
A Review of Progress Status in Achieving the Jal Jeevan Mission Goals in the State of Chhattisgarh
Abstract
INTRODUCTION
BACKGROUND OF DRINKING WATER POLICIES THROUGHOUT HISTORY
JAL JEEVAN MISSION
POLICY GUIDELINES AT CENTRE
DECENTRALISED MANAGEMENT OF WATER IN CHHATTISGARH
OBJECTIVES OF THE STUDY
RESEARCH METHODOLOGY
RESULTS AND DISCUSSION
Capacity Monitoring of the Project
Functional Household Tap Connections (FHTC)
Surveillance and Monitoring of Water Quality
Institutional Mechanism
NATIONAL LEVEL - NATIONAL JAL JEEVAN MISSION (NJJM)
State Level - State Water and Sanitation Mission (SWSM)
District Level - District Water and Sanitation Mission (DWSM)
Village Level - Gram Panchayat/Village Water & Sanitation Committee/ Pani Samiti
CONCLUSION
Author's Contribution
Funding
Institutional Review Board Statement
Data Availability Statement
CONSENT FOR PUBLICATION
CONFLICT OF INTEREST
ACKNOWLEDGEMENT
REFERENCES
IoT and Big Data Analytics
(Volume 1)
Video Data Analytics for Smart City Applications:
Methods and Trends
Edited by
Abhishek Singh Rathore
Department of Computer Science & Engineering,
Faculty of Science,
Shri Vaishnav Vidyapeeth Vishwavidyalaya,
Indore, India
Surendra Rahamatkar
Faculty of Engineering & Technology,
Faculty of Information Technology,
Amity School of Engineering & Technology, IQAC
Amity University- Chhattisgarh,
Raipur, India
Syed Imran Ali
Engineering Department, Computer Engineering,
University of Technology and Applied Sciences,
Al Musannah,
Sultanate of Oman
Ramgopal Kashyap
Department of Computer Science & Engineering,
Amity University Chhattisgarh,
Raipur, India
&
Nand Kishore Sharma
Department of Computer Science & Engineering,
Pranveer Singh Institute of Technology,
Kanpur, (U.P.),
India

BENTHAM SCIENCE PUBLISHERS LTD.

End User License Agreement (for non-institutional, personal use)

This is an agreement between you and Bentham Science Publishers Ltd. Please read this License Agreement carefully before using the book/echapter/ejournal (“Work”). Your use of the Work constitutes your agreement to the terms and conditions set forth in this License Agreement. If you do not agree to these terms and conditions then you should not use the Work.

Bentham Science Publishers agrees to grant you a non-exclusive, non-transferable limited license to use the Work subject to and in accordance with the following terms and conditions. This License Agreement is for non-library, personal use only. For a library / institutional / multi user license in respect of the Work, please contact: [email protected].

Usage Rules:

All rights reserved: The Work is the subject of copyright and Bentham Science Publishers either owns the Work (and the copyright in it) or is licensed to distribute the Work. You shall not copy, reproduce, modify, remove, delete, augment, add to, publish, transmit, sell, resell, create derivative works from, or in any way exploit the Work or make the Work available for others to do any of the same, in any form or by any means, in whole or in part, in each case without the prior written permission of Bentham Science Publishers, unless stated otherwise in this License Agreement.You may download a copy of the Work on one occasion to one personal computer (including tablet, laptop, desktop, or other such devices). You may make one back-up copy of the Work to avoid losing it.The unauthorised use or distribution of copyrighted or other proprietary content is illegal and could subject you to liability for substantial money damages. You will be liable for any damage resulting from your misuse of the Work or any violation of this License Agreement, including any infringement by you of copyrights or proprietary rights.

Disclaimer:

Bentham Science Publishers does not guarantee that the information in the Work is error-free, or warrant that it will meet your requirements or that access to the Work will be uninterrupted or error-free. The Work is provided "as is" without warranty of any kind, either express or implied or statutory, including, without limitation, implied warranties of merchantability and fitness for a particular purpose. The entire risk as to the results and performance of the Work is assumed by you. No responsibility is assumed by Bentham Science Publishers, its staff, editors and/or authors for any injury and/or damage to persons or property as a matter of products liability, negligence or otherwise, or from any use or operation of any methods, products instruction, advertisements or ideas contained in the Work.

Limitation of Liability:

In no event will Bentham Science Publishers, its staff, editors and/or authors, be liable for any damages, including, without limitation, special, incidental and/or consequential damages and/or damages for lost data and/or profits arising out of (whether directly or indirectly) the use or inability to use the Work. The entire liability of Bentham Science Publishers shall be limited to the amount actually paid by you for the Work.

General:

Any dispute or claim arising out of or in connection with this License Agreement or the Work (including non-contractual disputes or claims) will be governed by and construed in accordance with the laws of Singapore. Each party agrees that the courts of the state of Singapore shall have exclusive jurisdiction to settle any dispute or claim arising out of or in connection with this License Agreement or the Work (including non-contractual disputes or claims).Your rights under this License Agreement will automatically terminate without notice and without the need for a court order if at any point you breach any terms of this License Agreement. In no event will any delay or failure by Bentham Science Publishers in enforcing your compliance with this License Agreement constitute a waiver of any of its rights.You acknowledge that you have read this License Agreement, and agree to be bound by its terms and conditions. To the extent that any other terms and conditions presented on any website of Bentham Science Publishers conflict with, or are inconsistent with, the terms and conditions set out in this License Agreement, you acknowledge that the terms and conditions set out in this License Agreement shall prevail.

Bentham Science Publishers Pte. Ltd. 80 Robinson Road #02-00 Singapore 068898 Singapore Email: [email protected]

FOREWORD

Data is everything and nowadays, Big Data and Big Data Analytics are given more attention in the field of research. The smart city paradigm completely relies on data and smart frameworks. This framework comprehends the physical infrastructure, networking system, centralized computing center, data storage, and higher-level domain use-cases. The physical infrastructure requires sensors, devices, and cameras to capture and generate data. In the smart city, also known as an innovative city, the devices and sensors involved in smart city applications generate a massive amount of multi-media data especially Video Data. The involvement of multimedia sensors enables to obtain precise and concrete information. This book titled “IoT and Big Data Analytics” Vol. 1 (Video Data Analytics for Smart City Applications: Methods and Trends) aims to explore the various applications and use cases of Big Data & Analytics and processing for smart city applications. The technologies like Machine Learning, Deep Learning, IoT, WSN, IoT, and AI have been contributing to the application and efficient processing of data. Object detection and tracking, intelligent processing, video compression, the performance of IoT & AI in business growth, key enabling technologies and video analytics need to be explored more. The information processing is both challenging and interesting. The researchers are working with passion, and dedication to develop new methods and algorithms for information analysis and to provide a solution to remain in the new direction. The book will provide a valuable window and wide-ranging exposure to concerned technologies and their architectural frameworks for smart city applications. How these technologies are serving themselves to fulfilling the needs of a smart city is also discussed, for the research and industry personnel. This book is a good step in that direction and will surely help researchers to find relevant information in one place.

Vishal Nagar Department of Computer Science & Engineering, Pranveer Singh Institute of Technology, Kanpur, India

PREFACE

This book aims to explore the various applications and use cases of Video Data Analytics and processing for Smart City Applications. Everyday things are embedded together with the help of software, and sensors through the internet to collect and exchange data. Hence these devices generate a massive amount of data. The smart city paradigm entirely depends on data. In the context of video data, cameras capture huge data, and later these video recordings are effectively used in various smart city applications such as surveillance systems to counter potential threats. The generated data is heterogeneous and sparse in most cases. The processing of this Big Data in real-time is a matter of concern. With the participation of multimedia sensors, it is possible to obtain more precise and concrete information. Enormous challenges come in the data collection, analysis, and distribution.Various trending technologies like Machine Learning and Deep Learning are contributing to real-time data processing. Hence, these technologies should also be explored for efficient processing.

The video analytics field is continuously in evolution, because of speedy hardware and software progress that is making the technology more reachable and valued. To transform a large amount of video into actionable intelligence as complementary mechanisms, an effective algorithm and model are required. The book will provide comprehensive coverage of the latest and trending technologies like machine learning, deep learning, blockchain, AI, etc.

The book will highlight the advances and trends in various technologies by targeting the research and industry that are involved in video data analytics via addressing extensive themes. This book will surely demonstrate how technologies are serving their use-cases in the smart city. Interested researchers and academicians will get to know about the latest happenings and research, and more research possibilities can be explored. The industry professional may also benefit from this book by connecting the research with industry needs. This book will be proven as a useful resource for the target audience by outlining the promising future research and providing references to newcomers in the field of video data analytics.

Abhishek Singh Rathore Department of Computer Science & Engineering, Faculty of Science, Shri Vaishnav Vidyapeeth Vishwavidyalaya, Indore, IndiaSurendra Rahamatkar Faculty of Engineering & Technology, Faculty of Information Technology, Amity School of Engineering & Technology, IQAC Amity University- Chhattisgarh, Raipur, IndiaSyed Imran Ali Engineering Department, Computer Engineering, University of Technology and Applied Sciences, Al Musannah, Sultanate of OmanRamgopal Kashyap Department of Computer Science & Engineering, Amity University Chhattisgarh, Raipur, India &Nand Kishore Sharma

List of Contributors

Abhishek Singh RathoreShri Vaishnav Vidyapeeth Vishwavidyalaya, , Indore, IndiaArchana NaikDepartment of Computer Applications, The Bhopal School of Social Sciences, Bhopal, M.P, IndiaKarthika DhanasekarDepartment of Computer Science, SDNB Vaishnav College for Women, Chennai,Dileep SinghSchool of Engineering and Technology, Jagran Lake City University, Bhopal, M.P, IndiaFolasade DurodolaDepartment of Mechatronics Engineering, Federal University of Agriculture, Abeokuta, NigeriaGeetanjli KhambraDepartment of Computer Applications, The Bhopal School of Social Sciences, Bhopal, M.P, IndiaJaved BegOracle, Noida, IndiaDeepika Muni Krishna ReddyDepartment of Computer Science, Crescent University, Chennai, IndiaNand Kishore SharmaAmity University Chhattisgarh, Raipur, IndiaNasib Singh GillDepartment of Computer Science & Applications, Maharshi Dayanand University, Rohtak, IndiaJethro OdeyemiDepartment of Mechatronics Engineering, Federal University of Agriculture, Abeokuta, NigeriaPreeti GuliaDepartment of Computer Science & Applications, Maharshi Dayanand University, Rohtak, IndiaNagarajan RamalingamDepartment of Electrical and Electronics Engineering, Gnanamani College of Technology, Tamilnadu, IndiaKannadhasan SuriyanDepartment of Electronics and Communication Engineering, Cheran College of Engineering, Tamilnadu, IndiaSamuel Oluyemi OwoeyeDepartment of Mechatronics Engineering, Federal University of Agriculture, Abeokuta, NigeriaSangeetaDepartment of Computer Science & Applications, Maharshi Dayanand University, Rohtak, IndiaSatyendra PatnaikAmity University Chhattisgarh, IndiaShahnawaz AhmadDepartment of Electrical Engineering, Jamia Millia Islamia, New Delhi, IndiaShabana MehfuzDepartment of Electrical Engineering, Jamia Millia Islamia, New Delhi, IndiaSini ShibuDepartment of Computer Applications, The Bhopal School of Social Sciences, Bhopal, M.P, IndiaSurendra RahamatkarAmity School Engineering & Technology, Amity University Chhattisgarh, IndiaSweta PatnaikWASH Specialist, UNICEF, Chhattisgarh, India

Comprehensive Analysis of Video Surveillance System and Applications

Nand Kishore Sharma1,*,Surendra Rahamatkar1,Abhishek Singh Rathore2
1 Amity School of Engineering & Technology, Amity University Chhattisgarh, Raipur, India
2 Shri Vaishnav Vidyapeeth Vishwavidyalaya, Indore, India

Abstract

In this growing age of technology, various sensors are used to capture data from their nearby environments. The captured data is multimedia in nature. For example, CCTV cameras are used in those places where security matters or where continuous monitoring is required. Hence object detection, object recognition, and face recognition became key elements of city surveillance applications. Manual surveillance seems time-consuming and requires huge space to store the data; hence video surveillance has a significant contribution to unstructured big data. All surveillance techniques and approaches are based on Object Tracking, Target Tracking, Object Recognition, and Object Mobile Tracking Systems (OMTS). The main difficulty, however, lies in effectively processing them in real time. Therefore, finding a solution still needs careful consideration. This paper mainly targeting to the smart city surveillance system and inspects all existing surveillance systems based on various tremendous technologies like a wireless sensor network, machine learning, and Deep Learning. The author discovered the problems in the existing methods and summarized them in the paper. The motive is to point out the various challenges and offer new research prospects for the multimedia-oriented surveillance system over the traditional surveillance system for the smart city network architecture. The thorough survey in this paper starts with object recognition and goes toward action recognition, image annotation, and scene understanding. This comprehensive survey summarizes the comparative analysis of algorithms, models, and datasets in addition to targeting the methodologies.

Keywords: Deep learning, Face-recognition , Image annotation, Multimedia, Machine learning, OMTS, Object detection, Smart city, Video surveillance.
*Corresponding author Nand Kishore Sharma: Amity School of Engineering & Technology, Amity University Chhattisgarh, Raipur, India; E-mail: [email protected]

INTRODUCTION

Surveillance may be outlined in a variety of ways, including vehicle monitoring at roadside traffic areas as an intelligent traffic management system, theft monitoring & identification, capturing abnormal happenings, monitoring of widely open critical areas, and crowd analysis. Apart from that, it is also important in the smart healthcare system to perform hospital surveillance, secure hospital facilities, detect patient emotion and sentiment, detect patient fraud, and analyze hospital traffic patterns.

The video surveillance system has progressed. Now it is not only aiming to capture and show the video but has upgraded towards an autonomous and intelligent system. Only cutting-edge algorithms have made this possible. Because the purpose of all these algorithms was not only to classify images or videos but also to enhance them. Thus, a modern surveillance system workflow is mentioned in Fig. (1). As a result, video surveillance makes a significant contribution to unstructured big data.

Fig. (1)) Surveillance System workflow.

Intelligent surveillance is the main application of a Smart city, and the objective of the Smart city is to improve the quality of human lives. Smart environments contain sensors & devices that are network-connected and work together to perform operations. Though the last decade, the Internet of Things (IoT) with Machine Learning & Deep Learning has received so much attention. The cause for the accomplishment of this much attention is the services and capabilities offered by it.

The IoT is an interconnection between everyday objects and computing machines. It enables them to communicate in many smart city applications, where smart surveillance is one of them. Intelligent Transport Management Systems, Intelligent Traffic Management systems, Vehicular Ad-Hoc Networks, and Car-to-Car Communication are a few examples of IoT. Here, Intelligence indicates the best utilization of data. This data is generated by aggregating the knowledge and then converted into information through modeling. After that, this information is used for further processing.

In reality, all of this generated data is also known as multimedia data, such as audio and video. Their combination also makes the computation more energy efficient. Wireless Multimedia Sensors are used to collect multimedia data. However, Wireless Multimedia Sensors have exceptional issues such as high bandwidth and energy consumption. Other issues observed include quality of service (QoS), data processing, and compression at the node level. Object detection and recognition schemes have emerged in recent years as a solution for reducing the size of multimedia data at the node level.

The strategy used in it is based on motion detection. The camera only starts recording when it detects motion; otherwise, it does not record. As a result, unnecessary recording and storage are avoided. As a result, no overhead exists.

However, it is inefficient in some ways because it requires the user’s involvement to process forwarded data, so that alert decisions can be made. A simple object classification with few details can work. It is based on a genetic algorithm-based classifier. The classifier used only two features of the objects:

1. Video frames: specifically, the shape of the minimum bounding box, also known as video annotation of the object.

2. The frame rate of the observed region.

This method was tested in a simulated environment on three types of objects: humans, animals, and vehicles. The observation was that as the audio was added, the noise count increased [1].

The evolution of IoT presents enormous challenges in data collection, data analysis, and distribution in the direction of more productive use of information [2].

According to the survey, video surveillance systems have advanced technically over three generations.

The survey said that the video surveillance systems have technically progressed as:

The first generation was introduced in the 1960s with Analogue Close Circuit TV (CCTV). That was primarily for indoor surveillance applications. But the limitations were the recording and distribution process.Digital imaging began to expand surveillance systems in the 1980s. Advances in this area include compression and distribution as well as surveillance algorithms. The system included object tracking as well as an event alert system.The third-generation surveillance systems with fully automated and wide-area surveillance were investigated in the 2000s. The goal was to provide inference frameworks and behavioral analysis.

There are several categorizations of video surveillance that can be drawn to fulfill the requirement. Systems are usually classified into three generations as shown in Fig. (2). And the below Table 1 is representing the same categorization.

Fig. (2)) Surveillance system generations.
Table 1The categorization of the Video Surveillance Systems.CriteriaCategories of SystemImage gain obtained• Number of cameras like one or more. • Cameras are fixed or movable.Applications• To track and recognize the objects. • Re-identification, and behavioral analysis.Architecture• Unaided systems, • Cloud-based and distributed system.
The systems try to emulate the method by which individuals recognize activities and classify them. For instance, backdrop or foreground classification is a typical event detection pre-processing technique. The technique tries to differentiate between static and dynamic foreground scenes. The development of a surveillance system is heavily influenced by the caliber of the acquired input. Some of the video sensor's key characteristics include resolution, frame per second (FPS), and contrast.

LITERATURE REVIEW

Low-cost, compact cameras and microphones are now being introduced by recent technological advancements. And research started to get more precise real-world information. The outcome was distributed Wireless Multimedia Sensor Network. Multimedia sensors make it simpler to gather more accurate and thorough data for a variety of smart city applications. A multilayer automatic surveillance system architecture is suggested in a study [3] for outdoor applications. The system has two layers at the node level. Scalar sensors having the ability to sense motion, vibration, and sound are present in the first layer. This layer activates the second layer to record audio and video. Scalar and multimedia sensors gather data, which is then analyzed and combined in three separate layers.

In computer vision and robotics, automatic human action recognition in any video stream has been a highly addressed challenge. Majorly works were focused on classifying the segmented clips, joint detection, and action recognition. But none of them were dealing with wireless camera networks. To deal with this issue [4], an efficient system is presented by using a wireless smart camera network. The approach was based on Deformable Part Models (DPMs) for object detection in the images. Later, the framework was extended with tight integration inside a centralized video analysis system named Deformable Keyframe Model (DKM) from single-view and multi-view video settings to joint detection and action recognition. The DKM was validated on two different datasets- the publicly available dataset, and Bosch Multiview Complex Action (BMCA) dataset.

In a study [5], a deep learning technique was introduced to predict the type of action in smart cities. Four different types of audio datasets were used: crowded city audio, home appliance sound, household item sound, and human action sound.

The analysis of video surveillance was covered in a study [6]. The study classifies it into two categories, abnormal and normal, as well as object and action recognition. Deep learning architectures is the main topic of this survey. CNN, auto-encoders, and their combinations are the most often used deep learning models for surveillance analysis.

The video data is increasing because of the many networked cameras located in public places around the world for security [7