E-Book
23,99 €

Computer Vision for the Web E-Book

Foat Akhmadeev

0,0

23,99 €

Sammeln Sie Punkte in unserem Gutscheinprogramm und kaufen Sie E-Books und Hörbücher mit bis zu 100% Rabatt.

Mehr erfahren.

Herausgeber: Packt Publishing
Kategorie: Fachliteratur
Sprache: Englisch

Beschreibung

This book will give you an insight into controlling your applications with gestures and head motion and readying them for the web. Packed with real-world tasks, it begins with a walkthrough of the basic concepts of Computer Vision that the JavaScript world offers us, and you’ll implement various powerful algorithms in your own online application. Then, we move on to a comprehensive analysis of JavaScript functions and their applications. Furthermore, the book will show you how to implement filters and image segmentation, and use tracking.js and jsfeat libraries to convert your browser into Photoshop. Subjects such as object and custom detection, feature extraction, and object matching are covered to help you find an object in a photo. You will see how a complex object such as a face can be recognized by a browser as you move toward the end of the book. Finally, you will focus on algorithms to create a human interface.
By the end of this book, you will be familiarized with the application of complex Computer Vision algorithms to develop your own applications, without spending much time learning sophisticated theory.

Details

Das E-Book können Sie in Legimi-Apps oder einer beliebigen App lesen, die das folgende Format unterstützen:

EPUB

MOBI

Seitenzahl: 133

Veröffentlichungsjahr: 2015

Bewertungen

0,0

Rezensionen(0 Rezensionen)

Ähnliche

Der Weg zum erfolgreichen Unternehmer

Stefan Merath

Der Weg zum erfolgreichen Unternehmer

Stefan Merath

Denke (nach) und werde reich

Napoleon Hill

30 Minuten Resilienz

Ulrich Siegrist

Krebszellen mögen keine Himbeeren - Der große Bestseller - Vollständig überarbeitet und aktualisiert

Richard Béliveau

Die Hormonrevolution

Michael E Platt

Der Crash ist die Lösung

Matthias Weik

Günter, der innere Schweinehund, lernt verkaufen

Stefan Frädrich

Mission erfüllt

Owen Mark

Die Leber wächst mit ihren Aufgaben

Dr. med. Eckart von Hirschhausen

Macht, was ihr liebt!

Anja Förster

Der größte Raubzug der Geschichte

Matthias Weik

Unsere Hunde - gesund durch Homöopathie

Hans Günter Wolff

Die Jahrhundertlüge, die nur Insider kennen

Heiko Schrang

Organisation für Komplexität

Niels Pfläging

Radikal führen

Reinhard K. Sprenger

30 Minuten Sympathisch und souverän: So geht Vortragen!

Thomas Lorenz

BLACKOUT - Morgen ist es zu spät

Marc Elsberg

The Truth About Employee Engagement

Computer Vision for the Web

Credits

About the Author

About the Reviewer

www.PacktPub.com

Support files, eBooks, discount offers, and more

Why subscribe?

Free access for Packt account holders

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Reader feedback

Customer support

Downloading the example code

Downloading the color images of this book

Errata

Piracy

Questions

1. Math Never Was So Simple!

Installation and core structure representation of JSFeat

Initializing the project

Understanding a digital image

Loading an image into a matrix

Basic matrix operations

Going deeper

Matrix displaying

Useful functions and where to use them

Sorting using JSFeat

Linear algebra

A perspective example

Summary

2. Turn Your Browser into Photoshop

Introducing the tracking.js library

Installation and image loading

Conversion between JSFeat and tracking.js image formats

What is filtering and how to use it?

Image convolution

The Gaussian filter and separate convolution

The box blur

The integral image

Basic edge detection

The Sobel filter

Other operators

Advanced image processing

The Canny edge detector

Histogram equalization

Summary

3. Easy Object Detection for Everyone

Detecting color objects

Using predefined colors with the tracking.js library

Using your own colors

Digging into the tracking.js API

Using the <img> and <video> tags

Building a custom tracker

Image features

Detecting key points

A practical example

Descriptors and object matching

The BRIEF and ORB descriptors

Descriptors implementation and their matching

Finding an object location

Summary

4. Smile and Wave, Your Face Has Been Tracked!

Face detection with JSFeat

Face detection using Haar-like features

Brightness binary features

Tagging people with tracking.js

Haar features with tracking.js

Tagging people in photos

Head tracking with Camshift

The idea behind head tracking

The head tracking application

Summary

5. May JS Be with You! Control Your Browser with Motion

Basic tracking with tracking.js

An example of an object tracking application

Controlling objects with the head motion

The Head-coupled perspective

Controlling a simple box

Optical flow for motion estimation

The Lucas-Kanade optical flow

Optical flow map with oflow

Track points with JSFeat

Zooming with gestures

Summary

6. What's Next?

Refresh your memory

Importance of client-side scripting

Other interesting libraries in Computer Vision

CCV library and its extensions

Face detection and more

Object detection with js-objectdetect

Summary

Index

Computer Vision for the Web

All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.

Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author, nor Packt Publishing, and its dealers and distributors will be held liable for any damages caused or alleged to be caused directly or indirectly by this book.

Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.

First published: October 2015

Production reference: 1091015

Published by Packt Publishing Ltd.

Livery Place

35 Livery Street

Birmingham B3 2PB, UK.

ISBN 978-1-78588-617-1

www.packtpub.com

Credits

Author

Foat Akhmadeev

Reviewer

Noritsuna Imamura

Acquisition Editor

Tushar Gupta

Content Development Editor

Amey Varangaonkar

Technical Editor

Vivek Pala

Copy Editor

Pranjali Chury

Project Coordinator

Francina Pinto

Proofreader

Safis Editing

Indexer

Mariammal Chettiyar

Graphics

Disha Haria

Production Coordinator

Arvindkumar Gupta

Cover Work

Arvindkumar Gupta

About the Author

Foat Akhmadeev has 5 years of experience in software development and research. He completed his master's degree in the year 2014 from the Kazan Federal University, Russia. He has worked on different projects, including development of high-loaded websites written in Java and real-time object detection for mobile phones. He has an extensive background in the field of Computer Vision. He has also written a scientific paper on 3D reconstruction from a single image. For more information, you can visit his website at http://foat.me.

About the Reviewer

Noritsuna Imamura is a specialist in embedded Linux/Android-based Computer Vision, and is one of the main members of SIProp.org (http://siprop.org/). His main works are as follows:

ITRI Smart Glass, which is similar to Google Glass. He worked on this using Android 4.3 and OpenCV 2.4 in June 2014 (https://www.itri.org.tw/chi/Content/techTransfer/tech_tran_cont.aspx?&SiteID=1&MmmID=620622510147005345&Keyword=&MSid=4858).The Treasure Hunting Robot, a brainwave controlling robot that he developed in February 2012 (http://www.siprop.org/en/2.0/index.php?product%2FTreasureHuntingRobot).OpenCV for Android NDK. This has been included since Android 4.0.1 (http://tools.oesf.biz/android-4.0.1_r1.0/search?q=SIProp).The Auto Chasing Turtle, a human face recognition robot with Kinect, which he developed in February 2011 (http://www.siprop.org/ja/2.0/index.php?product%2FAutoChasingTurtle).Feel Sketch—an AR Authoring Tool and AR Browser as an Android application, which he developed in December 2009 (http://code.google.com/p/feelsketch/).

He can be contacted via e-mail at <[email protected]>.

www.PacktPub.com

Support files, eBooks, discount offers, and more

For support files and downloads related to your book, please visit www.PacktPub.com.

Did you know that Packt offers eBook versions of every book published, with PDF and ePub files available? You can upgrade to the eBook version at www.PacktPub.com and as a print book customer, you are entitled to a discount on the eBook copy. Get in touch with us at <[email protected]> for more details.

At www.PacktPub.com, you can also read a collection of free technical articles, sign up for a range of free newsletters and receive exclusive discounts and offers on Packt books and eBooks.

https://www2.packtpub.com/books/subscription/packtlib

Do you need instant solutions to your IT questions? PacktLib is Packt's online digital book library. Here, you can search, access, and read Packt's entire library of books.

Why subscribe?

Fully searchable across every book published by PacktCopy and paste, print, and bookmark contentOn demand and accessible via a web browser

Free access for Packt account holders

If you have an account with Packt at www.PacktPub.com, you can use this to access PacktLib today and view 9 entirely free books. Simply use your login credentials for immediate access.

Preface

Computer Vision is one of the popular areas in computer science that have gained widespread importance lately. Besides, the power of personal computers has grown, thus opening the gate for developers to use Computer Vision algorithms directly on end user machines using client-side scripting. Nowadays, the most popular programming language for the web is JavaScript. It allows us to develop complex algorithms and run them directly in a web browser; this solves several major problems—the user needs nothing but a browser to run a web application, and as a developer, you get a lower load on your server. In this book, we will provide a comprehensive overview of the most popular JavaScript libraries and discuss the techniques they provide to help you in your initial steps in exciting fields, such as image processing and Computer Vision. This book covers Computer Vision methods by providing an intuitive overview of each algorithm and showing clear examples of the usage of libraries.

What this book covers

Chapter 1, Math Never Was So Simple! covers most of the necessary math operations that you need for using Computer Vision libraries and developing your own application.

Chapter 2, Turning Your Browser into Photoshop, introduces the most popular image processing techniques that are commonly used in the Computer Vision area.

Chapter 3, Easy Object Detection for Everyone, provides a detailed overview of object detection in photos and videos, starting from the basic examples, such as detecting objects by color, to the more complex examples such as feature detection.

Chapter 4, Smile and Wave, Your Face Has Been Tracked! covers detection of the face and face particles. In addition to this, it gives you an example of head tracking.

Chapter 5, May JS Be with You! Control Your Browser with Motion, extends the topic of object detection to object tracking and provides exhaustive examples. It also demonstrates how to create a human interface using gestures or head motion.

Chapter 6, What's Next? summarizes all that we will do throughout this book. Moreover, it provides references to several libraries that are not presented here.

What you need for this book

This book provides an overview of various methods in the Computer Vision area using JavaScript. Knowledge of JavaScript at the beginner level is required. It is totally fine if you know nothing about Computer Vision. However, you will need some basic math knowledge to understand the concepts in the book. To start with, you just need a web browser and your favorite text editor. This book will guide you from the basics of Computer Vision to the most complex algorithms in the JavaScript world.

Who this book is for

You will find this book interesting if you want a much easier way to use the power of Computer Vision in web applications. It will also be beneficial for those who want to implement a human interface on their websites, for example, to create a game. Even if you do not want to add the Computer Vision functionality to your website, take a look at what we have here—websites with Computer Vision algorithms will probably be a trend in the future. Even now you can benefit from using client-side scripting for Computer Vision—you will not need to buy extra machines for your backend.

Reader feedback

Feedback from our readers is always welcome. Let us know what you think about this book—what you liked or may have disliked. Reader feedback is important for us to develop titles that you really get the most out of.

To send us general feedback, simply send an e-mail to <[email protected]>, and mention the book title via the subject of your message. If there is a topic that you have expertise in and you are interested in either writing or contributing to a book, see our author guide on www.packtpub.com/authors.

Customer support

Now that you are the proud owner of a Packt book, we have a number of things to help you to get the most from your purchase.

Downloading the example code

You can download the example code files for all Packt books you have purchased from your account at http://www.packtpub.com. If you purchased this book elsewhere, you can visit http://www.packtpub.com/support and register to have the files e-mailed directly to you. Download link for the book: https://github.com/foat/computer-vision-for-the-web.

Downloading the color images of this book

We also provide you with a PDF file that has color images of the screenshots/diagrams used in this book. The color images will help you better understand the changes in the output. You can download this file from https://www.packtpub.com/sites/default/files/downloads/B05004_Computer_Vision_for_the_Web_ColorImage.pdf.

Errata

Although we have taken every care to ensure the accuracy of our content, mistakes do happen. If you find a mistake in one of our books—maybe a mistake in the text or the code—we would be grateful if you would report this to us. By doing so, you can save other readers from frustration and help us improve subsequent versions of this book. If you find any errata, please report them by visiting http://www.packtpub.com/submit-errata, selecting your book, clicking on the erratasubmissionform link, and entering the details of your errata. Once your errata are verified, your submission will be accepted and the errata will be uploaded on our website, or added to any list of existing errata, under the Errata section of that title. Any existing errata can be viewed by selecting your title from http://www.packtpub.com/support.

Piracy

Piracy of copyright material on the Internet is an ongoing problem across all media. At Packt, we take the protection of our copyright and licenses very seriously. If you come across any illegal copies of our works, in any form, on the Internet, please provide us with the location address or website name immediately so that we can pursue a remedy.

Please contact us at <[email protected]> with a link to the suspected pirated material.

We appreciate your help in protecting our authors, and our ability to bring you valuable content.

Questions

You can contact us at <[email protected]> if you are having a problem with any aspect of the book, and we will do our best to address it.

Chapter 1. Math Never Was So Simple!

Computer Vision is all about math. When you need to create your own algorithm or implement something, you address a math topic. You should know how it works on the inside because without digging into the basics, it is hard to do anything. But you are not alone! Many smart people have created several useful libraries to simplify your job. One of those libraries is JSFeat (http://inspirit.github.io/jsfeat/), which has a realization of different math methods. Here, we will discuss fundamental elements of the library such as data structures, especially matrices, and simple math algorithms.

We will cover the following topics:

Installation and core structure representation of JSFeatWhat is inside an image? All about matricesUseful functions and where to use them

Installation and core structure representation of JSFeat

JSFeat is a powerful tool to implement something new. To start using it, we need to initialize the project. It is relatively simple; if you have any experience with JavaScript, then it will not cause any trouble for you. The library itself contains various Computer Vision algorithms and it will be a good starting point for anyone who wants a flexible Computer Vision framework. First, you will learn how to install it and see a basic example of what you can do with the library.