Computer Vision for the Web - Foat Akhmadeev - E-Book

Computer Vision for the Web E-Book

Foat Akhmadeev

0,0
23,99 €

-100%
Sammeln Sie Punkte in unserem Gutscheinprogramm und kaufen Sie E-Books und Hörbücher mit bis zu 100% Rabatt.

Mehr erfahren.
Beschreibung

This book will give you an insight into controlling your applications with gestures and head motion and readying them for the web. Packed with real-world tasks, it begins with a walkthrough of the basic concepts of Computer Vision that the JavaScript world offers us, and you’ll implement various powerful algorithms in your own online application. Then, we move on to a comprehensive analysis of JavaScript functions and their applications. Furthermore, the book will show you how to implement filters and image segmentation, and use tracking.js and jsfeat libraries to convert your browser into Photoshop. Subjects such as object and custom detection, feature extraction, and object matching are covered to help you find an object in a photo. You will see how a complex object such as a face can be recognized by a browser as you move toward the end of the book. Finally, you will focus on algorithms to create a human interface.
By the end of this book, you will be familiarized with the application of complex Computer Vision algorithms to develop your own applications, without spending much time learning sophisticated theory.

Das E-Book können Sie in Legimi-Apps oder einer beliebigen App lesen, die das folgende Format unterstützen:

EPUB
MOBI

Seitenzahl: 133

Veröffentlichungsjahr: 2015

Bewertungen
0,0
0
0
0
0
0
Mehr Informationen
Mehr Informationen
Legimi prüft nicht, ob Rezensionen von Nutzern stammen, die den betreffenden Titel tatsächlich gekauft oder gelesen/gehört haben. Wir entfernen aber gefälschte Rezensionen.



Table of Contents

Computer Vision for the Web
Credits
About the Author
About the Reviewer
www.PacktPub.com
Support files, eBooks, discount offers, and more
Why subscribe?
Free access for Packt account holders
Preface
What this book covers
What you need for this book
Who this book is for
Conventions
Reader feedback
Customer support
Downloading the example code
Downloading the color images of this book
Errata
Piracy
Questions
1. Math Never Was So Simple!
Installation and core structure representation of JSFeat
Initializing the project
Understanding a digital image
Loading an image into a matrix
Basic matrix operations
Going deeper
Matrix displaying
Useful functions and where to use them
Sorting using JSFeat
Linear algebra
A perspective example
Summary
2. Turn Your Browser into Photoshop
Introducing the tracking.js library
Installation and image loading
Conversion between JSFeat and tracking.js image formats
What is filtering and how to use it?
Image convolution
The Gaussian filter and separate convolution
The box blur
The integral image
Basic edge detection
The Sobel filter
Other operators
Advanced image processing
The Canny edge detector
Histogram equalization
Summary
3. Easy Object Detection for Everyone
Detecting color objects
Using predefined colors with the tracking.js library
Using your own colors
Digging into the tracking.js API
Using the <img> and <video> tags
Building a custom tracker
Image features
Detecting key points
A practical example
Descriptors and object matching
The BRIEF and ORB descriptors
Descriptors implementation and their matching
Finding an object location
Summary
4. Smile and Wave, Your Face Has Been Tracked!
Face detection with JSFeat
Face detection using Haar-like features
Brightness binary features
Tagging people with tracking.js
Haar features with tracking.js
Tagging people in photos
Head tracking with Camshift
The idea behind head tracking
The head tracking application
Summary
5. May JS Be with You! Control Your Browser with Motion
Basic tracking with tracking.js
An example of an object tracking application
Controlling objects with the head motion
The Head-coupled perspective
Controlling a simple box
Optical flow for motion estimation
The Lucas-Kanade optical flow
Optical flow map with oflow
Track points with JSFeat
Zooming with gestures
Summary
6. What's Next?
Refresh your memory
Importance of client-side scripting
Other interesting libraries in Computer Vision
CCV library and its extensions
Face detection and more
Object detection with js-objectdetect
Summary
Index

Computer Vision for the Web

Computer Vision for the Web

Copyright © 2015 Packt Publishing

All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.

Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author, nor Packt Publishing, and its dealers and distributors will be held liable for any damages caused or alleged to be caused directly or indirectly by this book.

Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.

First published: October 2015

Production reference: 1091015

Published by Packt Publishing Ltd.

Livery Place

35 Livery Street

Birmingham B3 2PB, UK.

ISBN 978-1-78588-617-1

www.packtpub.com

Credits

Author

Foat Akhmadeev

Reviewer

Noritsuna Imamura

Acquisition Editor

Tushar Gupta

Content Development Editor

Amey Varangaonkar

Technical Editor

Vivek Pala

Copy Editor

Pranjali Chury

Project Coordinator

Francina Pinto

Proofreader

Safis Editing

Indexer

Mariammal Chettiyar

Graphics

Disha Haria

Production Coordinator

Arvindkumar Gupta

Cover Work

Arvindkumar Gupta

About the Author

Foat Akhmadeev has 5 years of experience in software development and research. He completed his master's degree in the year 2014 from the Kazan Federal University, Russia. He has worked on different projects, including development of high-loaded websites written in Java and real-time object detection for mobile phones. He has an extensive background in the field of Computer Vision. He has also written a scientific paper on 3D reconstruction from a single image. For more information, you can visit his website at http://foat.me.

About the Reviewer

Noritsuna Imamura is a specialist in embedded Linux/Android-based Computer Vision, and is one of the main members of SIProp.org (http://siprop.org/). His main works are as follows:

ITRI Smart Glass, which is similar to Google Glass. He worked on this using Android 4.3 and OpenCV 2.4 in June 2014 (https://www.itri.org.tw/chi/Content/techTransfer/tech_tran_cont.aspx?&SiteID=1&MmmID=620622510147005345&Keyword=&MSid=4858).The Treasure Hunting Robot, a brainwave controlling robot that he developed in February 2012 (http://www.siprop.org/en/2.0/index.php?product%2FTreasureHuntingRobot).OpenCV for Android NDK. This has been included since Android 4.0.1 (http://tools.oesf.biz/android-4.0.1_r1.0/search?q=SIProp).The Auto Chasing Turtle, a human face recognition robot with Kinect, which he developed in February 2011 (http://www.siprop.org/ja/2.0/index.php?product%2FAutoChasingTurtle).Feel Sketch—an AR Authoring Tool and AR Browser as an Android application, which he developed in December 2009 (http://code.google.com/p/feelsketch/).

He can be contacted via e-mail at <[email protected]>.

www.PacktPub.com

Support files, eBooks, discount offers, and more

For support files and downloads related to your book, please visit www.PacktPub.com.

Did you know that Packt offers eBook versions of every book published, with PDF and ePub files available? You can upgrade to the eBook version at www.PacktPub.com and as a print book customer, you are entitled to a discount on the eBook copy. Get in touch with us at <[email protected]> for more details.

At www.PacktPub.com, you can also read a collection of free technical articles, sign up for a range of free newsletters and receive exclusive discounts and offers on Packt books and eBooks.

https://www2.packtpub.com/books/subscription/packtlib

Do you need instant solutions to your IT questions? PacktLib is Packt's online digital book library. Here, you can search, access, and read Packt's entire library of books.

Why subscribe?

Fully searchable across every book published by PacktCopy and paste, print, and bookmark contentOn demand and accessible via a web browser

Free access for Packt account holders

If you have an account with Packt at www.PacktPub.com, you can use this to access PacktLib today and view 9 entirely free books. Simply use your login credentials for immediate access.

Preface

Computer Vision is one of the popular areas in computer science that have gained widespread importance lately. Besides, the power of personal computers has grown, thus opening the gate for developers to use Computer Vision algorithms directly on end user machines using client-side scripting. Nowadays, the most popular programming language for the web is JavaScript. It allows us to develop complex algorithms and run them directly in a web browser; this solves several major problems—the user needs nothing but a browser to run a web application, and as a developer, you get a lower load on your server. In this book, we will provide a comprehensive overview of the most popular JavaScript libraries and discuss the techniques they provide to help you in your initial steps in exciting fields, such as image processing and Computer Vision. This book covers Computer Vision methods by providing an intuitive overview of each algorithm and showing clear examples of the usage of libraries.

What this book covers

Chapter 1, Math Never Was So Simple! covers most of the necessary math operations that you need for using Computer Vision libraries and developing your own application.

Chapter 2, Turning Your Browser into Photoshop, introduces the most popular image processing techniques that are commonly used in the Computer Vision area.

Chapter 3, Easy Object Detection for Everyone, provides a detailed overview of object detection in photos and videos, starting from the basic examples, such as detecting objects by color, to the more complex examples such as feature detection.

Chapter 4, Smile and Wave, Your Face Has Been Tracked! covers detection of the face and face particles. In addition to this, it gives you an example of head tracking.

Chapter 5, May JS Be with You! Control Your Browser with Motion, extends the topic of object detection to object tracking and provides exhaustive examples. It also demonstrates how to create a human interface using gestures or head motion.

Chapter 6, What's Next? summarizes all that we will do throughout this book. Moreover, it provides references to several libraries that are not presented here.

What you need for this book

This book provides an overview of various methods in the Computer Vision area using JavaScript. Knowledge of JavaScript at the beginner level is required. It is totally fine if you know nothing about Computer Vision. However, you will need some basic math knowledge to understand the concepts in the book. To start with, you just need a web browser and your favorite text editor. This book will guide you from the basics of Computer Vision to the most complex algorithms in the JavaScript world.

Who this book is for

You will find this book interesting if you want a much easier way to use the power of Computer Vision in web applications. It will also be beneficial for those who want to implement a human interface on their websites, for example, to create a game. Even if you do not want to add the Computer Vision functionality to your website, take a look at what we have here—websites with Computer Vision algorithms will probably be a trend in the future. Even now you can benefit from using client-side scripting for Computer Vision—you will not need to buy extra machines for your backend.

Reader feedback

Feedback from our readers is always welcome. Let us know what you think about this book—what you liked or may have disliked. Reader feedback is important for us to develop titles that you really get the most out of.

To send us general feedback, simply send an e-mail to <[email protected]>, and mention the book title via the subject of your message. If there is a topic that you have expertise in and you are interested in either writing or contributing to a book, see our author guide on www.packtpub.com/authors.

Customer support

Now that you are the proud owner of a Packt book, we have a number of things to help you to get the most from your purchase.

Downloading the example code

You can download the example code files for all Packt books you have purchased from your account at http://www.packtpub.com. If you purchased this book elsewhere, you can visit http://www.packtpub.com/support and register to have the files e-mailed directly to you. Download link for the book: https://github.com/foat/computer-vision-for-the-web.

Downloading the color images of this book

We also provide you with a PDF file that has color images of the screenshots/diagrams used in this book. The color images will help you better understand the changes in the output. You can download this file from https://www.packtpub.com/sites/default/files/downloads/B05004_Computer_Vision_for_the_Web_ColorImage.pdf.

Errata

Although we have taken every care to ensure the accuracy of our content, mistakes do happen. If you find a mistake in one of our books—maybe a mistake in the text or the code—we would be grateful if you would report this to us. By doing so, you can save other readers from frustration and help us improve subsequent versions of this book. If you find any errata, please report them by visiting http://www.packtpub.com/submit-errata, selecting your book, clicking on the erratasubmissionform link, and entering the details of your errata. Once your errata are verified, your submission will be accepted and the errata will be uploaded on our website, or added to any list of existing errata, under the Errata section of that title. Any existing errata can be viewed by selecting your title from http://www.packtpub.com/support.

Piracy

Piracy of copyright material on the Internet is an ongoing problem across all media. At Packt, we take the protection of our copyright and licenses very seriously. If you come across any illegal copies of our works, in any form, on the Internet, please provide us with the location address or website name immediately so that we can pursue a remedy.

Please contact us at <[email protected]> with a link to the suspected pirated material.

We appreciate your help in protecting our authors, and our ability to bring you valuable content.

Questions

You can contact us at <[email protected]> if you are having a problem with any aspect of the book, and we will do our best to address it.

Chapter 1. Math Never Was So Simple!

Computer Vision is all about math. When you need to create your own algorithm or implement something, you address a math topic. You should know how it works on the inside because without digging into the basics, it is hard to do anything. But you are not alone! Many smart people have created several useful libraries to simplify your job. One of those libraries is JSFeat (http://inspirit.github.io/jsfeat/), which has a realization of different math methods. Here, we will discuss fundamental elements of the library such as data structures, especially matrices, and simple math algorithms.

We will cover the following topics:

Installation and core structure representation of JSFeatWhat is inside an image? All about matricesUseful functions and where to use them

Installation and core structure representation of JSFeat

JSFeat is a powerful tool to implement something new. To start using it, we need to initialize the project. It is relatively simple; if you have any experience with JavaScript, then it will not cause any trouble for you. The library itself contains various Computer Vision algorithms and it will be a good starting point for anyone who wants a flexible Computer Vision framework. First, you will learn how to install it and see a basic example of what you can do with the library.