eBook - ePub

OpenCV with Python By Example

Name: OpenCV with Python By Example
ISBN: 9781785283932

Prateek Joshi,

296 pages
English
ePUB (mobile friendly)
Available on iOS & Android

eBook - ePub

OpenCV with Python By Example

Prateek Joshi,

About this book

Build real-world computer vision applications and develop cool demos using OpenCV for Python

About This Book

Learn how to apply complex visual effects to images using geometric transformations and image filters
Extract features from an image and use them to develop advanced applications
Build algorithms to help you understand the image content and perform visual searches

Who This Book Is For

This book is intended for Python developers who are new to OpenCV and want to develop computer vision applications with OpenCV-Python. This book is also useful for generic software developers who want to deploy computer vision applications on the cloud. It would be helpful to have some familiarity with basic mathematical concepts such as vectors, matrices, and so on.

What You Will Learn

Apply geometric transformations to images, perform image filtering, and convert an image into a cartoon-like image
Detect and track various body parts such as the face, nose, eyes, ears, and mouth
Stitch multiple images of a scene together to create a panoramic image
Make an object disappear from an image
Identify different shapes, segment an image, and track an object in a live video
Recognize an object in an image and build a visual search engine
Reconstruct a 3D map from images
Build an augmented reality application

In Detail

Computer vision is found everywhere in modern technology. OpenCV for Python enables us to run computer vision algorithms in real time. With the advent of powerful machines, we are getting more processing power to work with. Using this technology, we can seamlessly integrate our computer vision applications into the cloud. Web developers can develop complex applications without having to reinvent the wheel.

This book will walk you through all the building blocks needed to build amazing computer vision applications with ease. We start off with applying geometric transformations to images. We then discuss affine and projective transformations and see how we can use them to apply cool geometric effects to photos. We will then cover techniques used for object recognition, 3D reconstruction, stereo imaging, and other computer vision applications.

This book will also provide clear examples written in Python to build OpenCV applications. The book starts off with simple beginner's level tasks such as basic processing and handling images, image mapping, and detecting images. It also covers popular OpenCV libraries with the help of examples.

The book is a practical tutorial that covers various examples at different levels, teaching you about the different functions of OpenCV and their actual implementation.

Style and approach

This is a conversational-style book filled with hands-on examples that are really easy to understand. Each topic is explained very clearly and is followed by a programmatic implementation so that the concept is solidified. Each topic contributes to something bigger in the following chapters, which helps you understand how to piece things together to build something big and complex.

Tools to learn more effectively

Saving Books

Keyword Search

Annotating Text

Listen to it instead

Information

Publisher

Packt Publishing

Year

2015

eBook ISBN

9781785283932

Edition

Topic

Computer Science

Subtopic

Open Source Programming

Index

Computer Science

OpenCV with Python By Example

Credits

About the Author

About the Reviewers

www.PacktPub.com

Support files, eBooks, discount offers, and more

Why subscribe?

Free access for Packt account holders

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Reader feedback

Customer support

Downloading the example code

Downloading the color images of this book

Errata

Piracy

Questions

1. Applying Geometric Transformations to Images

Installing OpenCV-Python

Windows

Mac OS X

Linux (for Ubuntu)

Reading, displaying, and saving images

What just happened?

Loading and saving an image

Image color spaces

Converting between color spaces

What just happened?

Image translation

What just happened?

Image rotation

What just happened?

Image scaling

What just happened?

Affine transformations

What just happened?

Projective transformations

What just happened?

Image warping

Summary

2. Detecting Edges and Applying Image Filters

2D convolution

Blurring

The size of the kernel versus the blurriness

Edge detection

Motion blur

Under the hood

Sharpening

Understanding the pattern

Embossing

Erosion and dilation

Afterthought

Creating a vignette filter

What's happening underneath?

How do we move the focus around?

Enhancing the contrast in an image

How do we handle color images?

Summary

3. Cartoonizing an Image

Accessing the webcam

Under the hood

Keyboard inputs

Interacting with the application

Mouse inputs

What's happening underneath?

Interacting with a live video stream

How did we do it?

Cartoonizing an image

Deconstructing the code

Summary

4. Detecting and Tracking Different Body Parts

Using Haar cascades to detect things

What are integral images?

Detecting and tracking faces

Understanding it better

Fun with faces

Under the hood

Detecting eyes

Afterthought

Fun with eyes

Positioning the sunglasses

Detecting ears

Detecting a mouth

It's time for a moustache

Detecting a nose

Detecting pupils

Deconstructing the code

Summary

5. Extracting Features from an Image

Why do we care about keypoints?

What are keypoints?

Detecting the corners

Good Features To Track

Scale Invariant Feature Transform (SIFT)

Speeded Up Robust Features (SURF)

Features from Accelerated Segment Test (FAST)

Binary Robust Independent Elementary Features (BRIEF)

Oriented FAST and Rotated BRIEF (ORB)

Summary

6. Creating a Panoramic Image

Matching keypoint descriptors

How did we match the keypoints?

Understanding the matcher object

Drawing the matching keypoints

Creating the panoramic image

Finding the overlapping regions

Stitching the images

What if the images are at an angle to each other?

Why does it look stretched?

Summary

7. Seam Carving

Why do we care about seam carving?

How does it work?

How do we define "interesting"?

How do we compute the seams?

Can we expand an image?

Can we remove an object completely?

How did we do it?

Summary

8. Detecting Shapes and Segmenting an Image

Contour analysis and shape matching

Approximating a contour

Identifying the pizza with the slice taken out

How to censor a shape?

What is image segmentation?

How does it work?

Watershed algorithm

Summary

9. Object Tracking

Frame differencing

Colorspace based tracking

Building an interactive object tracker

Feature based tracking

Background subtraction

Summary

10. Object Recognition

Object detection versus object recognition

What is a dense feature detector?

What is a visual dictionary?

What is supervised and unsupervised learning?

What are Support Vector Machines?

What if we cannot separate the data with simple straight lines?

How do we actually implement this?

What happened inside the code?

How did we build the trainer?

Summary

11. Stereo Vision and 3D Reconstruction

What is stereo correspondence?

What is epipolar geometry?

Why are the lines different as compared to SIFT?

Building the 3D map

Summary

12. Augmented Reality

What is the premise of augmented reality?

What does an augmented reality system look like?

Geometric transformations for augmented reality

What is pose estimation?

How to track planar objects?

What happened inside the code?

How to augment our reality?

Mapping coordinates from 3D to 2D

How to overlay 3D objects on a video?

Let's look at the code

Let's add some movements

Summary

Index

OpenCV with Python By Example

All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.

Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author, nor Packt Publishing, and its dealers and distributors will be held liable for any damages caused or alleged to be caused directly or indirectly by this book.

Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.

First published: September 2015

Production reference: 1150915

Published by Packt Publishing Ltd.

Livery Place

35 Livery Street

Birmingham B3 2PB, UK.

ISBN 978-1-78528-393-2

www.packtpub.com

Credits

Author

Prateek Joshi

Reviewers

Will Brennan

Gabriel Garrido Calvo

Pavan Kumar Pavagada Nagaraja

Marvin Smith

Commissioning Editor

Deepika Gaonkar

Acquisition Editor

Tushar Gupta

Content Development Editor

Sumeet Sawant

Technical Editor

Ryan Kochery

Copy Editors

Merilyn Pereira

Angad Singh

Project Coordinator

Shweta H Birwatkar

Proofreader

Safis Editing

Indexer

Tejal Daruwale Soni

Graphics

Jason Monteiro

Production Coordinator

Manu Joseph

Cover Work

Manu Joseph

About the Author

Prateek Joshi is a computer vision researcher with a primary focus on content-based analysis. He is particularly interested in intelligent algorithms that can understand images to produce scene descriptions in terms of constituent objects. He has a master's degree from the University of Southern California, specializing in computer vision. He was elected to become a member of the Honor Society for academic excellence and an ambassador for the School of Engineering. Over the course of his career, he has worked for companies such as Nvidia, Microsoft Research, Qualcomm, and a couple of early stage start-ups in Silicon Valley.

His work in this field has resulted in multiple patents, tech demos, and research papers at major IEEE conferences. He has won many hackathons using a wide variety of technologies related to image recognition. He enjoys blogging about topics such as artificial intelligence, abstract mathematics, and cryptography. His blog has been visited by users in more than 200 countries, and he ...

OpenCV with Python By Example

Frequently asked questions

Yes, you can cancel anytime from the Subscription tab in your account settings on the Perlego website. Your subscription will stay active until the end of your current billing period. Learn how to cancel your subscription

No, books cannot be downloaded as external files, such as PDFs, for use outside of Perlego. However, you can download books within the Perlego app for offline reading on mobile or tablet. Learn how to download books offline

Perlego offers two plans: Essential and Complete

Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.

Both plans are available with monthly, semester, or annual billing cycles.

We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 990+ topics, we’ve got you covered! Learn about our mission

Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more about Read Aloud

Yes! You can use the Perlego app on both iOS and Android devices to read anytime, anywhere — even offline. Perfect for commutes or when you’re on the go.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app

Yes, you can access OpenCV with Python By Example by Prateek Joshi in PDF and/or ePUB format, as well as other popular books in Computer Science & Open Source Programming. We have over one million books available in our catalogue for you to explore.

OpenCV with Python By Example

OpenCV with Python By Example

About this book

Tools to learn more effectively

Information

OpenCV with Python By Example

Table of Contents

OpenCV with Python By Example

Credits

About the Author

Table of contents

Frequently asked questions