170,99 €
This edited book brings together leading researchers, academic scientists and research scholars to put forward and share their experiences and research results on all aspects of an inspection system for detection analysis for various machine vision applications. It also provides a premier interdisciplinary platform to present and discuss the most recent innovations, trends, methodology, applications, and concerns as well as practical challenges encountered and solutions adopted in the inspection system in terms of image processing and analytics of machine vision for real and industrial application. Machine vision inspection systems (MVIS) utilized all industrial and non-industrial applications where the execution of their utilities based on the acquisition and processing of images. MVIS can be applicable in industry, governmental, defense, aerospace, remote sensing, medical, and academic/education applications but constraints are different. MVIS entails acceptable accuracy, high reliability, high robustness, and low cost. Image processing is a well-defined transformation between human vision and image digitization, and their techniques are the foremost way to experiment in the MVIS. The digital image technique furnishes improved pictorial information by processing the image data through machine vision perception. Digital image pro-cessing has widely been used in MVIS applications and it can be employed to a wide diversity of problems particularly in Non-Destructive testing (NDT), presence/absence detection, defect/fault detection (weld, textile, tiles, wood, etc.,), automated vision test & measurement, pattern matching, optical character recognition & verification (OCR/OCV), barcode reading and traceability, medical diagnosis, weather forecasting, face recognition, defence and space research, etc. This edited book is designed to address various aspects of recent methodologies, concepts and research plan out to the readers for giving more depth insights for perusing research on machine vision using image processing techniques.
Sie lesen das E-Book in den Legimi-Apps auf:
Seitenzahl: 339
Veröffentlichungsjahr: 2020
Cover
Preface
1 Land-Use Classification with Integrated Data
1.1 Introduction
1.2 Background Study
1.3 System Design
1.4 Implementation Details
1.5 System Evaluation
1.6 Discussion
1.7 Conclusion
References
2 Indian Sign Language Recognition Using Soft Computing Techniques
2.1 Introduction
2.2 Related Works
2.3 Experiments
2.4 Summary
References
3 Stored Grain Pest Identification Using an Unmanned Aerial Vehicle (UAV)-Assisted Pest Detection Model
3.1 Introduction
3.2 Related Work
3.3 Proposed Model
3.4 Results and Discussion
3.5 Conclusion
References
4 Object Descriptor for Machine Vision
4.1 Outline
4.2 Chain Codes
4.3 Polygonal Approximation
4.4 Moments
4.5 HU Invariant Moments
4.6 Zernike Moments
4.7 Fourier Descriptors
4.8 Quadtree
4.9 Conclusion
References
5 Flood Disaster Management
5.1 Flood Management
5.2 Existing Disaster Management Systems
5.3 Advancements in Disaster Management Technologies
5.4 Proposed System
References
6 Temporal Color Analysis of Avocado Dip for Quality Control
6.1 Introduction
6.2 Materials and Methods
6.3 Image Acquisition
6.4 Image Processing
6.5 Experimental Design
6.6 Results and Discussion
6.7 Conclusion
References
7 Image and Video Processing for Defect Detection in Key Infrastructure
7.1 Introduction
7.2 Reasons for Defective Roads and Bridges
7.3 Image Processing for Defect Detection
7.4 Image-Based Defect Detection Methods
7.5 Factors Affecting the Performance
7.6 Achievements and Issues
7.7 Conclusion
References
8 Methodology for the Detection of Asymptomatic Diabetic Retinopathy
8.1 Introduction
8.2 Key Steps of Computer-Aided Diagnostic Methods
8.3 DR Screening and Grading Methods
8.4 Key Observations from Literature Review
8.5 Design of Experimental Methodology
8.6 Conclusion
References
9 Offline Handwritten Numeral Recognition Using Convolution Neural Network
9.1 Introduction
9.2 Related Work Done
9.3 Data Set Used for Simulation
9.4 Proposed Model
9.5 Result Analysis
9.6 Conclusion and Future Work
References
10 A Review on Phishing—Machine Vision and Learning Approaches
10.1 Introduction
10.2 Literature Survey
10.3 Role of Data Mining in Antiphishing
10.4 Conclusion
Acknowledgments
References
Index
End User License Agreement
Chapter 1
Table 1.1 Satellite image classification accuracy vs. learning models.
Table 1.2 Base error matrix.
Table 1.3 Accuracy percentages w.r.t. the Base error matrix.
Chapter 2
Table 2.1 Division of data set into training and testing sets on ISL digits.
Table 2.2 The comparative recognition rates of naive Baÿes and kNN classifier...
Table 2.3 The comparative recognition rates of naive Baÿes and neural network...
Table 2.4 Performance of kNN and neural network on one-handed ISL character.
Table 2.5 Classification results of all three feature extraction methods agai...
Table 2.6 ISL-specific computer-related words.
Table 2.7 Number of image frames per ISL word.
Table 2.8 Performance of kNN classifier on ISL word set.
Table 2.9 Recognition rates of neural network classifier against all feature ...
Table 2.10 Summary of results obtained from all classifiers and all feature v...
Chapter 3
Table 3.1 Evaluation of methods using SNR(dB) value.
Table 3.2 Evaluation of methods using PSNR(dB) value.
Table 3.3 PT (in units) calculation of several methods.
Chapter 5
Table 5.1 Machine learning techniques for flood management.
Chapter 7
Table 7.1 Structural defects and severity scales [23].
Chapter 8
Table 8.1 Brief detail of state-of-the-art related to DR screening.
Chapter 9
Table 9.1 Overall statistic of the handwritten numeral data set.
Table 9.2 Training and Testing Accuracy over the three data sets.
Table 9.3 Recognition Rate reported by existing method with current method.
Chapter 1
Figure 1.1 High-level design diagram.
Figure 1.2 Process view of the land-use visualization Web application.
Figure 1.3 Architectural view of the system.
Figure 1.4 Workflow diagram of the system.
Figure 1.5 Variations in features of built-up areas.
Figure 1.6 Workflow of the built-up area extraction process.
Figure 1.7 Original image (left). Image with building and roads (middle). Bu...
Figure 1.8 A satellite image (left) and directly generated segments (middle ...
Figure 1.9 A satellite image (left) and segmented images (middle and right)....
Figure 1.10 Built-up areas extracted from the first image using the devised ...
Figure 1.11 Built-up area and the related flood-filled area.
Figure 1.12 Built-up area segmentation (left) Felzenszwalb’s method, (middle...
Figure 1.13 Foursquare integration.
Figure 1.14 Original satellite image with the land-use visualization.
Figure 1.15 Common API design.
Figure 1.16 Agreement, commission error, and omission error analysis.
Chapter 2
Figure 2.1 The ISL single-handed alphabet signs data set.
Figure 2.2 The ISL double-handed alphabet signs data set.
Figure 2.3 The ISL digit signs data set.
Figure 2.4 The ISL-specific computer terminology word data set.
Figure 2.5 Image preprocessing steps of ISL signs.
Figure 2.6 (a) Original image and (b) thresholded skin color detected image....
Figure 2.7 Comparison of results obtained from Naive Baÿes classifier versus...
Figure 2.8 Comparison of results obtained from Naive Baÿes classifier vs. kN...
Figure 2.9 Performance of classifier kNN through DPV and HC feature vector o...
Figure 2.10 Performance of neural network classifier with DPV and HC feature...
Figure 2.11 The ISL one-handed and two-handed character recognition system....
Figure 2.12 Neural network classifier’s performance against three feature ex...
Figure 2.13 Misclassification due to shape similarity between single-handed ...
Figure 2.14 Misclassification Due to Shape Similarity Between Single-Handed ...
Figure 2.15 Performance of neural network classifier (double-handed ISL alph...
Figure 2.16 The proposed ISL gesture recognition system for specific ISL wor...
Figure 2.17 Comparison of results of all feature vectors against kNN classif...
Figure 2.18 Comparison of results of all feature vectors against neural netw...
Figure 2.19 Comparison of results obtained from DPV and HC feature extractio...
Chapter 3
Figure 3.1 Pest classification over SG.
Figure 3.2 Proposed UAV-assisted pest detection model for SG.
Figure 3.3 Processing of RW (1280 × 853). (a) Original image and results usi...
Figure 3.4 Processing of LGB(800 × 534). (a) Original image and results usin...
Figure 3.5 Processing of RRFB (640 × 426). (a) Original image and results us...
Figure 3.6 Processing of CFB (750 × 511). (a) Original image and results usi...
Figure 3.7 Processing of GW (800 × 502). (a) Original image and results usin...
Figure 3.8 SNR (dB) representation of several methods.
Figure 3.9 PSNR(dB) representation of several methods.
Figure 3.10 PT (in units) representation of several methods.
Chapter 4
Figure 4.1 Workflow of part selection using bounding box part template.
Figure 4.2 Minimum perimeter polygon approximation.
Figure 4.3 Adjacent pixel for selection of the vertex forming the boundary....
Figure 4.4 Selection of adjacent vertex based on segmentation.
Figure 4.5 (a) Four quadrants of the quadtree. (b) The tree structure of the...
Chapter 5
Figure 5.1 The number of people exposed to flood.
Figure 5.2 The pattern of temperature differences and average rainfall in Au...
Figure 5.3 Landmarks extraction from images
Figure 5.4 Flood detection using feature matching.
Figure 5.5 Flood detection using machine learning.
Chapter 6
Figure 6.1 Diagram of the methodology.
Figure 6.2 Diagram of the image acquisition setup with connection to a lapto...
Figure 6.3 Avocado dip samples without microwave treatment taken at 0, 4, 24...
Figure 6.4 Avocado dip samples with microwave treatment taken at 0, 4, 24, a...
Figure 6.5 Distribution color plots in
a
*–
b
* space for each image of avocado...
Figure 6.6 Distribution color plots in
a
*–
b
* space for each image of avocado...
Figure 6.7 Bivariate frequency histogram for
a
*–
b
* space for each image of a...
Figure 6.8 Bivariate frequency histogram for
a
*–
b
* space for each image of a...
Chapter 7
Figure 7.1 Overview of basic defect detection.
Figure 7.2 Crack detection [8].
Figure 7.3 Potholes detection. (a) Input image, (b) Detection, (c) Output im...
Figure 7.4 Water puddles [18].
Figure 7.5 Water puddles detection [18].
Figure 7.6 Pavement distress detection [22].
Chapter 8
Figure 8.1 The architecture of computer-aided diagnostic method for the dete...
Figure 8.2 Flowchart of the design of experiment.
Chapter 9
Figure 9.1 Sample of handwritten Odia numeral database of IIT, BBSR.
Figure 9.2 Sample of handwritten Bangla numeral database of NIT, Rourkela.
Figure 9.3 Sample of handwritten Odia numeral database of ISI, Kolkata.
Figure 9.4 (a) Input an image to layer C1, (b) forwarding the value from lay...
Figure 9.5 (a) Forwarding from layer S2 to layer C3, (b) forwarding from lay...
Figure 9.6 Various Recognition rate reported for IIT BBS Handwritten Odia Nu...
Figure 9.7 Various Recognition rate reported for ISI Kolkata Handwritten Odi...
Figure 9.8 Various recognition rates reported for NIT Rourkela handwritten B...
Cover
Table of Contents
Begin Reading
ii
iii
iv
xi
xii
xiii
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
60
61
62
63
64
65
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
Scrivener Publishing100 Cummings Center, Suite 541JBeverly, MA 01915-6106
Publishers at ScrivenerMartin Scrivener ([email protected])Phillip Carmical ([email protected])
Edited by
Muthukumaran Malarvel, Soumya Ranjan Nayak, Surya Narayan Panda, Prasant Kumar Pattnaik and Nittaya Muangnak
This edition first published 2020 by John Wiley & Sons, Inc., 111 River Street, Hoboken, NJ 07030, USA and Scrivener Publishing LLC, 100 Cummings Center, Suite 541J, Beverly, MA 01915, USA© 2020 Scrivener Publishing LLCFor more information about Scrivener publications please visit www.scrivenerpublishing.com.
All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording, or otherwise, except as permitted by law. Advice on how to obtain permission to reuse material from this title is available at http://www.wiley.com/go/permissions.
Wiley Global Headquarters111 River Street, Hoboken, NJ 07030, USA
For details of our global editorial offices, customer services, and more information about Wiley products visit us at www.wiley.com.
Limit of Liability/Disclaimer of WarrantyWhile the publisher and authors have used their best efforts in preparing this work, they make no representations or warranties with respect to the accuracy or completeness of the contents of this work and specifically disclaim all warranties, including without limitation any implied warranties of merchant-ability or fitness for a particular purpose. No warranty may be created or extended by sales representatives, written sales materials, or promotional statements for this work. The fact that an organization, website, or product is referred to in this work as a citation and/or potential source of further information does not mean that the publisher and authors endorse the information or services the organization, website, or product may provide or recommendations it may make. This work is sold with the understanding that the publisher is not engaged in rendering professional services. The advice and strategies contained herein may not be suitable for your situation. You should consult with a specialist where appropriate. Neither the publisher nor authors shall be liable for any loss of profit or any other commercial damages, including but not limited to special, incidental, consequential, or other damages. Further, readers should be aware that websites listed in this work may have changed or disappeared between when this work was written and when it is read.
Library of Congress Cataloging-in-Publication Data
Names: Malarvel, Muthukumaran, editor. | Nayak, Soumya Ranjan, 1984-editor. | Panda, Sury Narayan, editor. | Pattnaik, Prasant Kumar, 1969-editor. | Muangnak, Nittaya, editor.Title: Machine vision inspection systems / edited by Muthukumaran Malarvel,Soumya Ranjan Nayak, Sury Narayan Panda, Prasant Kumar Pattnaik andNittaya Muangnak.Description: Hoboken, NJ : Wiley-Scrivener, 2020. | Includesbibliographical references and index. | Contents: Volume 1. Imageprocessing, concepts, methodologies and applications —Identifiers: LCCN 2020020076 (print) | LCCN 2020020077 (ebook) | ISBN9781119681809 (hardback) | ISBN 9781119681960 (adobe pdf) | ISBN9781119682097 (epub)Subjects: LCSH: Computer vision. | Computer vision—Industrialapplications. | Engineering inspection—Automation. | Image processing.| Image processing—Digital techniques.Classification: LCC TA1634 .M3354 2020 (print) | LCC TA1634 (ebook) | DDC006.3/7—dc23LC record available at https://lccn.loc.gov/2020020076LC ebook record available at https://lccn.loc.gov/2020020077
ISBN 978-1-119-68180-9
Cover image: Pixabay.ComCover design by Russell Richardson
This edited book aims to bring together leading researchers, academic scientists and research scholars to put forward and share their experiences and research results on all aspects of an inspection system for detection analysis for various machine vision applications. It also provides a premier interdisciplinary platform for educators, practitioners and researchers to present and discuss the most recent innovations, trends, methodology, applications and concerns, as well as practical challenges encountered and solutions adopted in the inspection system in terms of image processing and analytics of machine vision for real and industrial application. The book is organized into ten chapters,
Chapter 1 presents an overview of an automated methodology-based learning model classification technique for identifying the usage and coverage of land use in Sri Lanka by using satellite imagery data. This chapter also discusses the issue related to manual surveys and its limitations about the land-use of different regions.
Chapter 2 focuses on the Indian sign language recognition using machine learning algorithm in machine vision and pattern recognition research areas. The work is to translate acquired images or videos either offline or online to corresponding words, numbers or sentences representing the meaning of the input sign. The Direct Pixel Value, Hierarchical Centroid, Local Histogram features of Image Processing techniques are used as a feature in the experimental analysis. The classifiers used here are k-Nearest Neighbour and Neural Network.
Chapter 3 presents an unmanned aerial vehicle (UAV) assist the pest detection model to track pests in the stored grain (SG). This proposed model consists of four phases such as data acquisition, edge detection, feature extraction, and pest identification. In this model, the edge detection (ED) phase is focused on analyzing the data (pest in the SG images). Many standard edge detection (SED) methods such as Sobel, Prewitt, Roberts, Morphological, Laplacian of Gaussian (LoG), Canny etc. are used to track the shape, location, and quantity of pests in SG. The implementation of the methods are performed using MATLAB R2015b and evaluated using signal to noise ratio (SNR), peak signal to noise ratio (PSNR), and processing time (PT).
Chapter 4 describes object selection as a trade-off between performance and accuracy. Particularly, in machine vision time versus precision for object selection plays a crucial role in image analysis is addressed. These regions are a group of segmented pixels that are used for processing. Such regions are often represented by numbers called “object descriptors”. Using such data, the authors compare and distinguish objects by matching the descriptors. Without loss of generality, these descriptors have certain properties like (a) invariance against geometrical transformations like translation, rotation and scaling, (b) Stability to noise and non-rigid local deformation, (c) Completeness.
Chapter 5 explores flood control and disaster management technologies based on image processing and machine learning. The main objective of this chapter is to develop an understanding of the flood risks, explore the existing systems for managing the risks and devise a flood management model through machine vision. Furthermore, this chapter discusses the limitations of the current technologies and suggests a reliable model to overcome the problems. Finally, this chapter elaborates on the system of how to detect flood-affected areas and determine rescue routes.
Chapter 6 discusses the color changes on the avocado dip under microwave conditions through the machine vision approach. This chapter analyzes the change of color in a* - b* space in terms of different treatments. Also, this chapter discusses real-time experimental analysis by various parameters.
Chapter 7 deliberates the defect detection on defective roads and bridges through computer vision-based techniques. This chapter discusses the basic steps involved in defect detection using image processing along with existing systems and presents the pros and cons of the different existing methods in terms of performance. Also, this chapter applies multiple image processing techniques to solve the various types of defects.
Chapter 8 presents the study and conducts experiments through machine vision techniques on diabetic retinopathy disease present in retinal fundus images. This chapter also discusses various factors of the disease that appears in the image and discusses the possible solutions in terms of image processing techniques. An effective analysis is shown for computer-aided solutions.
Chapter 9 provides a robust method to solve the ambiguities in handwritten the OCR system. This has been resolved using the Convolutional Neural Network (CNN) based approach. This state-of-the-art of CNN-based approach for recognition of multiple handwritten numerals of various scripts is clearly shown here. It is also quite helpful to report the discriminate features of each individual and later lead to reporting a high recognition rate. At the simulation level we have listed the variance nature of the individual’s images and through CNN we have reported a high recognition rate, which is quite helpful in building the automatic recognition system for handwritten numerals to have the solution for real-time problems.
Chapter 10 presents a detailed review of some of the attempts towards avoiding, detecting and preventing phishing in terms of visual methods. This chapter explains the frauds and criminal activities on phishing and, moreover, discusses the various solutions approached in recent years. Additionally, this chapter reviews the role of training and education on the reduction of phishing victims.
We have to start by thanking God Almighty for giving us the ability and opportunity to undertake to edit this book and to complete it satisfactorily. Completion of this book could not have been accomplished without the support of all editors starting from the “Call for Chapters” till their finalization. All the contributors have given their contributions amicably and is a positive sign of significant teamwork. The editors are sincerely thankful to all the members of Scrivener Publishing especially Martin Scrivener for providing constructive inputs and allowing an opportunity to edit this important book. We are equally thankful to all reviewers who hail from different places in and around the globe shared their support and stand firm towards quality chapter submissions. Finally, we are eternally grateful to the authors for contributing quality chapters.
Muthukumaran MalarvelSoumya Ranjan NayakSurya Narayan PandaPrasant Kumar PattnaikNittaya MuangnakApril 2020
D. A. Meedeniya*, J. A. A. M Jayanetti, M. D. N. Dilini, M. H. Wickramapala and J. H. Madushanka
Department of Computer Science and Engineering, University of Moratuwa, Sri Lanka
Abstract
The identification of the usage and coverage of the land is a major part of regional development. Crowdsourced geographic information systems provide valuable information about the land use of different regions. Although these data sources lack reliability and possess some limitations, they are useful in deriving building blocks for the usage of the land, where the manual surveys are not up-to-date, costly, and time consuming. At present, in the context of Sri Lanka, there is a lack of reliable and updated land-use data. Moreover, there is a rapid growth in the construction industry, resulting in frequent changes in land-use and land-cover data. This paper presents a novel and an automated methodology based on learning models for identifying the usage and coverage of the land. The satellite imagery is used to identify the information related to land cover. They are integrated with Foursquare venue data, which is a popular crowdsourced geographic information, thus, enhancing the information level and the quality of land-use visualization. The proposed methodology has shown a kappa coefficient of 74.03%, showing an average land-use classification accuracy within a constrained environment.
Keywords: Geographic information system, land-cover identification, land-use classification, social computing, decision support system, satellite images, Foursquare data
Regional planning and management are major concerns in the development strategy of a country. The information related to the coverage and usage of lands can be used to extract the features in an area and facilitate development activities. The land-use data are related to human activities, whereas the land-cover information represent the natural features and artificial constructions on the earth surface. Crowdsourced geographic information systems provide valuable information about the land use of different regions. At present, up-to-date data on land usage and coverage are not available for all the cities in Sri Lanka. This is due to the cost of labor, lack of the required technologies, and resources associated with the data surveys. Unavailability of a cost-effective way of obtaining such latest and reliable data is a bottleneck to the long-term planning and development of a region. This results in unplanned ad hoc developments, construction of unhealthy residential areas, deterioration of service and infrastructure, environmental pollution, increased traffic congestion, and so on [1], which can be widely seen in many urban areas in Sri Lanka. Therefore, up-to-date data on the usage and coverage of land are important to make strategic decisions on sustainable region planning.
The objective of this research is to design and develop a support system to classify the land-cover and land-use data using Google Satellite imagery [2] and Foursquare data, which is a type of volunteer geographic information (VGI), respectively [3]. The system produces a visualization of different types of land-use in each area (eg. residential, industrial, commercial, agriculture etc.) on a land-use map based on heterogeneous data sources including crowdsourced Foursquare data. Acquiring data on land cover and land use from different data types, which can be integrated into the classification system, will enhance the quality of the processed information [4].
Therefore, this research provides a novel way of identifying and classifying different forms of land-use data, specifically satellite imagery and Foursquare data, with the extensible features for other types of related data. The system refines the land-use mapping with the use of additional parameters, such as context-specific different data sources. Ultimately, the retrieved data can be used to monitor land-use changes in near real time [2]. Moreover, this study focuses on developing a common platform that enables the collaboration of heterogeneous data sources to produce enhanced land-use data. Further, this will increase the utility value of the retrieved information on land-cover and land-use, hence, widening the range of applicable applications from the results. Colombo district is selected as the study area considering the availability and sampling rates of different data sets and issues associated with data validation [4]. The proposed land-use visualization approach identifies and classifies different forms of land cover and land use in a selected area considering the satellite imagery and Foursquare data, respectively, and displays the classification on a land-use map.
The land-use data retrieved from the proposed methodology can be used to monitor land-use changes near real time. Analysis of these detailed snapshots of land-use enables authorities to detect a change and foresee its social and environmental consequences. This, in turn, will enable them to identify long-lasting sustainable solutions to urbanization issues in Sri Lanka.
The paper is structured as follows: Section II explores the related literature and Section III presents the design architectures of the system and Section IV describes the development process with the used techniques. Section V evaluates the validity of the proposed method, and finally, Section VI summarizes the research including future work.
The identification of the usage and coverage of the land is a major part of regional development. Land cover and land use are often interchangeably used in many information systems, despite the distinct difference between those two terms [1, 4]. Land cover refers to observable physical properties of land, such as the areas with trees, grounds, building, roads, water, and so on. On the other hand, land use refers to purposes for which lands are being used, such as residential, commercial, entertainment, and so on. It may be difficult to determine the actual purpose for which land is being used by solely using the information produced by a source of observation. For example, in the absence of additional information sources, it may be difficult to decide whether grasslands are used for agricultural activities or not. Moreover, there is a rapid growth in the construction industry, resulting in frequent changes in land-use and land-cover data.
As a summary, the land coverage and usage data are important to identify correctly and process timely manner in order to make decisions on regional development. However, it is challenging to obtain large-scale, latest data from reliable sources. The unplanned constructions may impact the region with unprogressive infrastructure, unhealthy residentials, and environment issues, such as traffic congestion and pollution.
A Geographical Information Systems (GIS) facilitates collection, store, process, analyze, and visualize data on the surface of the earth. Prior to the discovery of advanced technologies and concepts, a GIS primarily consisted of a database management system (DBMS) and a set of tools that allowed data to be retrieved from the DBMS. With the advent of the Internet and Web-based applications, as well as the increased utilization of mobile devices, the traditional notion of a GIS has been altered significantly. Particularly, the user base of GISs has expanded from traditional users, such as relevant public and private sector to just about anyone who uses an application built on top of a GIS in their mobile or other electronic devices.
Even though the primary functions, such as data storage, retrieval, visualization, and so on, are common to all GISs, the nature of these functions depends largely on the underlying application. Based on the area of application, GIS can be classified into different information system types, such as cadastral, land and property, image-based, natural resource management, spatial information systems, and so on.
The GIS can be used to find solutions to a wide variety of problems, such as determining and predicting the features of a particular location, identifying locations with particular characteristics, monitoring change over periods, determining the best route between different locations, modeling environmental and societal evolution over time, crime location mapping, disaster management, transportation planning, management, and so on.
Point data are used to represent discrete data points that have no adjacent features. For example, in a GIS, the location of a place can be represented by a point location. The GIS data are of two categories, the spatially referenced data and the attribute data table associated with it. The spatially referenced data are represented in vector and raster forms. Foursquare data [3, 5] and OpenStreetMap data [6] are two popular GIS point data sources that can be utilized for the identification of land-use utilization at a detailed level.
Foursquare [3, 5] is a mobile-based social networking service that facilitates to acquire user-location information using check-in and search history; and recommend places near the user’s current location. These data contain the names, locations, and types of places. When providing recommendations, the application makes use of its citizen-contributed database, the places a user goes, and the advice and comments given by other users on a place. This is a volunteered/crowdsourced VGI, that uses geographic data provided by the users. Thus, Foursquare data are used to identify land-use information with proper quality validation and consumes a low-cost mechanism.
OpenStreetMap [6] is a popular GIS data source that provides geographic data. This presents the physical features within a given area, such as commercial buildings, roads, waterways, railways, highways, and amenities using tags, and each of those tags describes a geographical attribute. Moreover, OpenStreetMap releases map data under a free and open license, which makes them available for researchers and as a data validation source.
An aerial data set is prepared by means of airborne methods and aerial photographs and Google satellite imagery some examples [2]. Satellite images of the Earth’s surface captured by remote-sensing techniques have proven to be a useful data source for many research studies and applications in diverse fields [2, 7]. Satellite images enable wider geographical areas to be covered quickly with relative convenience. They provide a way of monitoring the Earth’s surface, hence, eliminating the need to rely solely on labor-intensive processes, such as area frame surveys for maintaining up-to-date information on the Earth’s surface, which tends to be a slow and a tedious process. Moreover, the digital format of satellite images enables to be directly processed digitally and integrated with other data sources with relative ease. Data retrieved from satellite images are used to successfully extract the required information on the land usage for decision making and predicting systems.
High elevation aerial photographs of the Earth’s surface are a remote-sensing data source. They can be used to identify the coverage of lands. These visual interpretations are also used in conjunction with satellite images, particularly to fill areas which are not clear on satellite images due to prevalent atmospheric conditions, such as cloud cover at the time of capturing those. Aerial photographs are of high spatial resolution. Therefore, these images comprise a high level of information which is useful for various analytical and reference purposes associated with land-cover classifications. The spectral range of aerial photography is narrow, and therefore, a considerable number of photographs will have to be patched together to cover wider geographical areas. Further, the cost per surface unit is higher with aerial photographs compared with satellite images.
Among the variety of research studies, the study by Quercia and Saez [5], has described the feasibility of acquiring data from the locations of social-media users, considering the mining urban deprivation information in London city. This study has addressed the reliable, latest and inferring free data retrieval, and shown the use of Foursquare data in monitoring fine-grain resolutions, which cannot be obtained from other land-use data used in practice. However, this gives an insight into the limitations of Foursquare data such as demographic and geographic biases and the Foursquare categories not being fully structured. For instance, these can be biased by the penetration rate of the Foursquare data and a given location can be named with different category types based on the thinking patterns of the users who entered the Foursquare data.
A multidata approach (MDA) is presented by Bareth [4] to generate data related to land use. They have combined the traditional types of data for land usage with the information obtained from the different remote-sensing analysis. As a first step, remote sensing data were classified using supervised classification techniques and then the quality of the classified data was assessed. The second step of the research was to import the classified data into a GIS environment and overlay them with relevant 3rd-party data sources, such as official topographic or land-use data. The importance of this approach is that it enables useful and high-quality land-use information in various spatial databases, such as spatial biotope or biodiversity databases, spatial databases of national parks, and so on to be integrated with the results of remote-sensing analysis. Further, land-use data retrieved from official data sources can be integrated to MDA for cross-checking the results of remote-sensing analysis. Also, by incorporating the results of land-cover change models to MDA, they have simulated the change scenarios for the usage and coverage of the land.
According to the literature, satellite images are mainly used to identify urban land usage that covers a large area without frequent changes. However, it is challenging to obtain high-resolution images. Thus, there is a research direction toward the integration of these remote-sensing data with the social knowledge data for better analysis of the usage and coverage patterns of the land.
The primary focus of this study is to classify land usage into several categories, such as residential, hotels, restaurants, industrial, and so on. Two data sources google satellite images for the coverage and foursquare point data for usage of the land are used for this study. First, the google satellite imagery is used to classify land-cover data that represent primitive categories, such as water, built-up, vegetation, and bare soil. Next, this output would be further differentiated using foursquare point data into the abovementioned land usage classes. Figure 1.1 shows the overall view of the proposed system.
System design mainly consists of four components. The data preprocessing component removes and purifies noises in the input data. The data classification component applies classification and clustering techniques on the preprocessed data. These results of heterogeneous data will be integrated into the data integration component. Then, data analysis component will evaluate the accuracy of the classified data. The main output of the system is a Shape file that adds a layer to Quantum Geographic Information System (QGIS). The QGIS is a free desktop application that supports different platforms. It has features to view, edit, and analyze geospatial data together with quality measures. The final system is expected to be used for urban planning and government officials who want to collect data, and so on.
Figure 1.2 shows the sequence diagram of the proposed Web application. It shows how such a user can easily retrieve the results produced by the system so that he can use this information in his respective work, such as sustainable urban development planning. First, the user will access a Web interface where he is provided with a google map. He can zoom in the map and select a rectangular area using the mouse pointer. The analyst can highlight the area from the map, which he wants the land-use classification. The common interface will send the boundaries (latitudes and longitudes) of the area to the land-cover classification module, which will return the stitched image. Next, the common interface will send the land-cover classified map along with the boundaries to the land usage classification module. This module will get the Foursquare locations and their respective classes from the Foursquare database, perform the land-use classification, and return the final land usage classified map to the analyst. In addition, a legend is also provided with the different categories of land usage classes, such as residential, restaurants, hotels, and so on, and their assigned colors will also appear with the final map.
Figure 1.1 High-level design diagram.
Figure 1.2 Process view of the land-use visualization Web application.
Figure 1.3 shows the architectural view of the proposed system, which complies with the overall view, and the components are described with the workflow given in Figure 1.4. The module “Input Data Manager” handles the input data types to the system. The current system uses satellite images, Foursquare data, and official land-use data. The architecture is designed in an extensible feature to incorporate other data types, such as OpenStreetMap. The module “Visualization Manager” visualizes the processed Foursquare data and point location data in terms of land-use classification with a color-coded layer based on the predefined categories and described in detail in the chapter. The logical layer comprises three main modules responsible to preprocess data, integrate data sources, and the learning process.
Figure 1.3 Architectural view of the system.
Figure 1.4 describes the workflow of the system, which is an extension of our previous work [7]. Initially, Google satellite imagery and Foursquare data of Colombo District, which contains instances of varieties of land uses were collected and preprocessed separately. The satellite imagery was subjected to bilateral filtering [8] to remove the Gaussian noise [9]. Next, edge enhancement techniques were applied to enhance the important features of satellite images, such as sharp edges, which helps to identify built-up areas. Foursquare data also need to be cleaned because they contain irrelevant and unreliable data, such as location names inserted with English and Tamil. Next, the preprocessed satellite images are classified using supervised learning techniques, such as random forest, and unsupervised classification techniques, such as k-means clustering. The foursquare point locations would be further differentiated into nine basic classes. Next, these two outputs will be integrated in order to produce the final land usage classified map. The common platform enables heterogeneous data sources to collaborate to produce enhanced land-use data. The implementation details are further discussed in the Implementation section.
Figure 1.4 Workflow diagram of the system.
The data set considered for this study includes over 20,000 satellite images containing diverse features for each land-cover category. The differences in built-up areas were caused by differences in actual building structures and shapes, as well as due to lighting conditions and quality of the sensors used for capturing the images. Further, Google satellite images only include red, green, blue (RGB) bands, and certain land features are not well represented in these bands. Thus, developing a universal approach to automatically extract built-up areas to a satisfactory degree of accuracy is challenging.
This study has used 3863 foursquare points for the study area of around 700 square kms in Colombo District, All data records were consistent and complete and followed a common format and there were no discrepancies and duplications. Each record consisted of the location data such as name, geographical coordinates, category types the place belonged to, and the number of user and the check-ins. There were 10 main categories for Foursquare data, and 354 subcategories. However, some of the subcategories were not properly categorized under the main categories. In order to overcome this issue, this study has realigned foursquare categories and subcategories into nine land-use types.
Initially, the images are smoothened using the bilateral filter, which effectively removes Gaussian noise while preserving the edge information of the images. The bilateral filter calculates the average of pixels surrounding a pixel and replaces it with the weighted average of intensities in neighboring pixels [10]. Foursquare being a VGI data set, the reliability of data is one of the key issues. In order to remove the unreliable data, the points with a user count below a given threshold value were removed, because the degree of the unreliability of the existence of a point tends to decrease with the number of unique users checked in for a given location [2
