23,99 €
This book will give you an insight into controlling your applications with gestures and head motion and readying them for the web. Packed with real-world tasks, it begins with a walkthrough of the basic concepts of Computer Vision that the JavaScript world offers us, and you’ll implement various powerful algorithms in your own online application. Then, we move on to a comprehensive analysis of JavaScript functions and their applications. Furthermore, the book will show you how to implement filters and image segmentation, and use tracking.js and jsfeat libraries to convert your browser into Photoshop. Subjects such as object and custom detection, feature extraction, and object matching are covered to help you find an object in a photo. You will see how a complex object such as a face can be recognized by a browser as you move toward the end of the book. Finally, you will focus on algorithms to create a human interface.
By the end of this book, you will be familiarized with the application of complex Computer Vision algorithms to develop your own applications, without spending much time learning sophisticated theory.
Das E-Book können Sie in Legimi-Apps oder einer beliebigen App lesen, die das folgende Format unterstützen:
Seitenzahl: 133
Veröffentlichungsjahr: 2015
Copyright © 2015 Packt Publishing
All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.
Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author, nor Packt Publishing, and its dealers and distributors will be held liable for any damages caused or alleged to be caused directly or indirectly by this book.
Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.
First published: October 2015
Production reference: 1091015
Published by Packt Publishing Ltd.
Livery Place
35 Livery Street
Birmingham B3 2PB, UK.
ISBN 978-1-78588-617-1
www.packtpub.com
Author
Foat Akhmadeev
Reviewer
Noritsuna Imamura
Acquisition Editor
Tushar Gupta
Content Development Editor
Amey Varangaonkar
Technical Editor
Vivek Pala
Copy Editor
Pranjali Chury
Project Coordinator
Francina Pinto
Proofreader
Safis Editing
Indexer
Mariammal Chettiyar
Graphics
Disha Haria
Production Coordinator
Arvindkumar Gupta
Cover Work
Arvindkumar Gupta
Foat Akhmadeev has 5 years of experience in software development and research. He completed his master's degree in the year 2014 from the Kazan Federal University, Russia. He has worked on different projects, including development of high-loaded websites written in Java and real-time object detection for mobile phones. He has an extensive background in the field of Computer Vision. He has also written a scientific paper on 3D reconstruction from a single image. For more information, you can visit his website at http://foat.me.
Noritsuna Imamura is a specialist in embedded Linux/Android-based Computer Vision, and is one of the main members of SIProp.org (http://siprop.org/). His main works are as follows:
He can be contacted via e-mail at <[email protected]>.
For support files and downloads related to your book, please visit www.PacktPub.com.
Did you know that Packt offers eBook versions of every book published, with PDF and ePub files available? You can upgrade to the eBook version at www.PacktPub.com and as a print book customer, you are entitled to a discount on the eBook copy. Get in touch with us at <[email protected]> for more details.
At www.PacktPub.com, you can also read a collection of free technical articles, sign up for a range of free newsletters and receive exclusive discounts and offers on Packt books and eBooks.
https://www2.packtpub.com/books/subscription/packtlib
Do you need instant solutions to your IT questions? PacktLib is Packt's online digital book library. Here, you can search, access, and read Packt's entire library of books.
If you have an account with Packt at www.PacktPub.com, you can use this to access PacktLib today and view 9 entirely free books. Simply use your login credentials for immediate access.
Computer Vision is one of the popular areas in computer science that have gained widespread importance lately. Besides, the power of personal computers has grown, thus opening the gate for developers to use Computer Vision algorithms directly on end user machines using client-side scripting. Nowadays, the most popular programming language for the web is JavaScript. It allows us to develop complex algorithms and run them directly in a web browser; this solves several major problems—the user needs nothing but a browser to run a web application, and as a developer, you get a lower load on your server. In this book, we will provide a comprehensive overview of the most popular JavaScript libraries and discuss the techniques they provide to help you in your initial steps in exciting fields, such as image processing and Computer Vision. This book covers Computer Vision methods by providing an intuitive overview of each algorithm and showing clear examples of the usage of libraries.
Chapter 1, Math Never Was So Simple! covers most of the necessary math operations that you need for using Computer Vision libraries and developing your own application.
Chapter 2, Turning Your Browser into Photoshop, introduces the most popular image processing techniques that are commonly used in the Computer Vision area.
Chapter 3, Easy Object Detection for Everyone, provides a detailed overview of object detection in photos and videos, starting from the basic examples, such as detecting objects by color, to the more complex examples such as feature detection.
Chapter 4, Smile and Wave, Your Face Has Been Tracked! covers detection of the face and face particles. In addition to this, it gives you an example of head tracking.
Chapter 5, May JS Be with You! Control Your Browser with Motion, extends the topic of object detection to object tracking and provides exhaustive examples. It also demonstrates how to create a human interface using gestures or head motion.
Chapter 6, What's Next? summarizes all that we will do throughout this book. Moreover, it provides references to several libraries that are not presented here.
This book provides an overview of various methods in the Computer Vision area using JavaScript. Knowledge of JavaScript at the beginner level is required. It is totally fine if you know nothing about Computer Vision. However, you will need some basic math knowledge to understand the concepts in the book. To start with, you just need a web browser and your favorite text editor. This book will guide you from the basics of Computer Vision to the most complex algorithms in the JavaScript world.
You will find this book interesting if you want a much easier way to use the power of Computer Vision in web applications. It will also be beneficial for those who want to implement a human interface on their websites, for example, to create a game. Even if you do not want to add the Computer Vision functionality to your website, take a look at what we have here—websites with Computer Vision algorithms will probably be a trend in the future. Even now you can benefit from using client-side scripting for Computer Vision—you will not need to buy extra machines for your backend.
Feedback from our readers is always welcome. Let us know what you think about this book—what you liked or may have disliked. Reader feedback is important for us to develop titles that you really get the most out of.
To send us general feedback, simply send an e-mail to <[email protected]>, and mention the book title via the subject of your message. If there is a topic that you have expertise in and you are interested in either writing or contributing to a book, see our author guide on www.packtpub.com/authors.
Now that you are the proud owner of a Packt book, we have a number of things to help you to get the most from your purchase.
You can download the example code files for all Packt books you have purchased from your account at http://www.packtpub.com. If you purchased this book elsewhere, you can visit http://www.packtpub.com/support and register to have the files e-mailed directly to you. Download link for the book: https://github.com/foat/computer-vision-for-the-web.
We also provide you with a PDF file that has color images of the screenshots/diagrams used in this book. The color images will help you better understand the changes in the output. You can download this file from https://www.packtpub.com/sites/default/files/downloads/B05004_Computer_Vision_for_the_Web_ColorImage.pdf.
Although we have taken every care to ensure the accuracy of our content, mistakes do happen. If you find a mistake in one of our books—maybe a mistake in the text or the code—we would be grateful if you would report this to us. By doing so, you can save other readers from frustration and help us improve subsequent versions of this book. If you find any errata, please report them by visiting http://www.packtpub.com/submit-errata, selecting your book, clicking on the erratasubmissionform link, and entering the details of your errata. Once your errata are verified, your submission will be accepted and the errata will be uploaded on our website, or added to any list of existing errata, under the Errata section of that title. Any existing errata can be viewed by selecting your title from http://www.packtpub.com/support.
Piracy of copyright material on the Internet is an ongoing problem across all media. At Packt, we take the protection of our copyright and licenses very seriously. If you come across any illegal copies of our works, in any form, on the Internet, please provide us with the location address or website name immediately so that we can pursue a remedy.
Please contact us at <[email protected]> with a link to the suspected pirated material.
We appreciate your help in protecting our authors, and our ability to bring you valuable content.
You can contact us at <[email protected]> if you are having a problem with any aspect of the book, and we will do our best to address it.
Computer Vision is all about math. When you need to create your own algorithm or implement something, you address a math topic. You should know how it works on the inside because without digging into the basics, it is hard to do anything. But you are not alone! Many smart people have created several useful libraries to simplify your job. One of those libraries is JSFeat (http://inspirit.github.io/jsfeat/), which has a realization of different math methods. Here, we will discuss fundamental elements of the library such as data structures, especially matrices, and simple math algorithms.
We will cover the following topics:
JSFeat is a powerful tool to implement something new. To start using it, we need to initialize the project. It is relatively simple; if you have any experience with JavaScript, then it will not cause any trouble for you. The library itself contains various Computer Vision algorithms and it will be a good starting point for anyone who wants a flexible Computer Vision framework. First, you will learn how to install it and see a basic example of what you can do with the library.
