Image Processing Masterclass with Python
eBook - ePub

Image Processing Masterclass with Python

50+ Solutions and Techniques Solving Complex Digital Image Processing Challenges Using Numpy, Scipy, Pytorch and Keras

Sandipan Dey

  1. English
  2. ePUB (disponibile sull'app)
  3. Disponibile su iOS e Android
eBook - ePub

Image Processing Masterclass with Python

50+ Solutions and Techniques Solving Complex Digital Image Processing Challenges Using Numpy, Scipy, Pytorch and Keras

Sandipan Dey

Dettagli del libro
Anteprima del libro
Indice dei contenuti
Citazioni

Informazioni sul libro

Over 50 problems solved with classical algorithms + ML / DL models

Key Features

  • Problem-driven approach to practice image processing.
  • Practical usage of popular Python libraries: Numpy, Scipy, scikit-image, PIL and SimpleITK.
  • End-to-end demonstration of popular facial image processing challenges using MTCNN and Microsoft's Cognitive Vision APIs.

Description
This book starts with basic Image Processing and manipulation problems and demonstrates how to solve them with popular Python libraries and modules. It then concentrates on problems based on Geometric image transformations and problems to be solved with Image hashing. Next, the book focuses on solving problems based on Sampling, Convolution, Discrete Fourier transform, Frequency domain filtering and image restoration with deconvolution. It also aims at solving Image enhancement problems using different algorithms such as spatial filters and create a super resolution image using SRGAN.Finally, it explores popular facial image processing problems and solves them with Machine learning and Deep learning models using popular python ML / DL libraries.

What you will learn

  • Develop strong grip on the fundamentals of Image Processing and Image Manipulation.
  • Solve popular Image Processing problems using Machine Learning and Deep Learning models.
  • Working knowledge on Python libraries including numpy, scipy and scikit-image.
  • Use popular Python Machine Learning packages such as scikit-learn, Keras and pytorch.
  • Live implementation of Facial Image Processing techniques such as Face Detection / Recognition / Parsing dlib and MTCNN.

Who this book is for
This book is designed specially for computer vision users, machine learning engineers, image processing experts who are looking for solving modern image processing/computer vision challenges.

Table of Contents
1. Chapter 1: Basic Image & Video Processing
2. Chapter 2: More Image Transformation and Manipulation
3. Chapter 3: Sampling, Convolution and Discrete Fourier Transform
4. Chapter 4: Discrete Cosine / Wavelet Transform and Deconvolution
5. Chapter 5: Image Enhancement
6. Chapter 6: More Image Enhancement
7. Chapter 7: Face Image Processing

About the Author
Sandipan Dey is a Data Scientist with a wide range of interests, covering topics such as Machine Learning, Deep Learning, Image Processing and Computer Vision. He has worked in numerous data science fields, such as recommender systems, predictive models for the events industry, sensor localization models, sentiment analysis, and device prognostics. He earned his master's degree in Computer Science from the University of Maryland, Baltimore County, and has published in a few IEEE data mining conferences and journals. He has also authored a couple of Image Processing books, published from an international publication house. He has earned certifications from 100+ MOOCs on data science and related courses. He is a regular blogger (at sandipanweb @wordpress, medium and data science central) and is a Machine Learning education enthusiast. LinkedIn Profile: https://www.linkedin.com/in/sandipan-dey-0370276

Domande frequenti

Come faccio ad annullare l'abbonamento?
È semplicissimo: basta accedere alla sezione Account nelle Impostazioni e cliccare su "Annulla abbonamento". Dopo la cancellazione, l'abbonamento rimarrà attivo per il periodo rimanente già pagato. Per maggiori informazioni, clicca qui
È possibile scaricare libri? Se sì, come?
Al momento è possibile scaricare tramite l'app tutti i nostri libri ePub mobile-friendly. Anche la maggior parte dei nostri PDF è scaricabile e stiamo lavorando per rendere disponibile quanto prima il download di tutti gli altri file. Per maggiori informazioni, clicca qui
Che differenza c'è tra i piani?
Entrambi i piani ti danno accesso illimitato alla libreria e a tutte le funzionalità di Perlego. Le uniche differenze sono il prezzo e il periodo di abbonamento: con il piano annuale risparmierai circa il 30% rispetto a 12 rate con quello mensile.
Cos'è Perlego?
Perlego è un servizio di abbonamento a testi accademici, che ti permette di accedere a un'intera libreria online a un prezzo inferiore rispetto a quello che pagheresti per acquistare un singolo libro al mese. Con oltre 1 milione di testi suddivisi in più di 1.000 categorie, troverai sicuramente ciò che fa per te! Per maggiori informazioni, clicca qui.
Perlego supporta la sintesi vocale?
Cerca l'icona Sintesi vocale nel prossimo libro che leggerai per verificare se è possibile riprodurre l'audio. Questo strumento permette di leggere il testo a voce alta, evidenziandolo man mano che la lettura procede. Puoi aumentare o diminuire la velocità della sintesi vocale, oppure sospendere la riproduzione. Per maggiori informazioni, clicca qui.
Image Processing Masterclass with Python è disponibile online in formato PDF/ePub?
Sì, puoi accedere a Image Processing Masterclass with Python di Sandipan Dey in formato PDF e/o ePub, così come ad altri libri molto apprezzati nelle sezioni relative a Computer Science e Software Development. Scopri oltre 1 milione di libri disponibili nel nostro catalogo.

Informazioni

Anno
2021
ISBN
9789389898644

CHAPTER 1

Basic Image and Video Processing

Introduction

Image processing refers to the automatic processing, manipulation, analysis, and interpretation of images using algorithms and codes on a computer. Video processing refers to a special case of image processing that often employs video filters and where the input and output signals are video files or video streams. Image and video processing have applications in many disciplines and fields in science and technology such as television, photography, robotics, remote sensing, medical diagnosis (CT scan/X-Ray/MRI), and industrial inspection. Social networking sites such as Facebook and Instagram, which we have got used to in our daily lives and where we upload tons of images/videos every day, are typical examples of the industries that need to use/innovate many image/video processing algorithms to process the images/videos we upload.
In this chapter, we shall solve a few initial image and video processing problems that will help us understand the basic concepts of image and video processing. Before we start processing/analysing an image/video, we need to be able to load the image into memory using a suitable data structure and also be able to save the processed image/video back to the disk. It is also important to be able to visualize (plot) the image on the computer screen (to see the impact of an image processing algorithm on an image immediately). Often an image/a video needs to be pre-processed before it can be used in some complex image/video processing algorithms (such as classification or segmentation that you will get to know more in the later chapters); some transformation/manipulation techniques (such as resizing/cropping/changing brightness and contrast) are very useful. Similarly, as a post-processing step, we may need to apply some image/video manipulation/transformation techniques to get back the desired output. With image transformation and manipulation, we can also enhance the appearance of an image (for example, by applying a filter).
In this chapter, you are going to learn how to use different Python libraries (numpy, scipy, scikit-image, opencv-python, and matplotlib) for basic image/video processing, manipulation, and transformation. We shall start by displaying the three channels of an RGB image with 3D visualizations. Next, we shall demonstrate how to capture a video from a camera and extract frames. Then, we shall show how to implement Instagram-like Gotham filter. Finally, we shall explore the following few problems on image manipulations and see how to solve them using python libraries:
  • Plot image montage, crop/resize images, and draw contours
  • Convert PNG image with a palette to grayscale
  • Rotate an image and convert RGB to YUV color space (using scikit-image, PIL, python-opencv, and scipy.ndimage/misc)

Structure

This chapter is organized as follows:
  • Objectives
  • Problems
    Display RGB image color channels in 3D
    Video I/O
    Read/write video files
    Capture video from camera and extract frames with OpenCV-Python
    Implement Instagram-like Gotham filter
    Explore image manipulations (using scikit-image, PIL, python-opencv, and scipy ndimage/misc)
    Plot image montage with scikit-image
    Crop/resize images with SciPy ndimage module
    Draw contours with OpenCV-Python
    Counting objects in an image
    Convert a PNG image with a palette to grayscale with PIL
    Different ways to convert an RGB image to grayscale
    Rotating an image with scipy.ndimage
    Image differences with PIL
    Converting RGB to HSV and YUV color spaces with scikit-image
    Resizing an image with OpenCV-Python
    Add a logo to an image with scikit-image
    Change brightness/contrast of an image with linear transformation and gamma correction with OpenCV-Python
    Detecting colors and changing colors of objects with OpenCV-Python
    Object removal with seam carving
    Creating fake miniature effect
  • Summary
  • Questions
  • Key terms
  • References

Objectives

After studying this Chapter, you should be able to:
  • Understand the image/video storage and data structures in python
  • Do image/video file I/O in python using different libraries
  • Write python code to do basic image/video manipulations

Problems

Display RGB image color channels in 3D

It is very useful to be able to conceptualize an image as a function and visualize it to understand what it is and then do further analysis/processing. A grayscale image can be thought of a 2-D function f(x, y) of the pixel locations (x, y), a function that maps each pixel into its corresponding grey level (for example, an integer in [0,255] or equivalently a floating-point number in [0,1]), that is:
f : (x, y)R
For an RGB image, there are three such functions that can be denoted as:
fR (x, y), fG (x. y) and fB(x. y)
which is corresponding to each of the channels R, G, and B, respectively. The library matplotlib’s 3-D plot functions can be used to plot each of these functions. The following Python code shows how to plot the RGB channels separately in 3D.
The following are the steps you need to follow:
  1. First, start by importing all the required packages by using the following code. For reading an image, we need the imread() function from the scikit-image library’s io module. For array operations, ...

Indice dei contenuti

Stili delle citazioni per Image Processing Masterclass with Python

APA 6 Citation

Dey, S. (2021). Image Processing Masterclass with Python ([edition unavailable]). BPB Publications. Retrieved from https://www.perlego.com/book/2661000/image-processing-masterclass-with-python-50-solutions-and-techniques-solving-complex-digital-image-processing-challenges-using-numpy-scipy-pytorch-and-keras-pdf (Original work published 2021)

Chicago Citation

Dey, Sandipan. (2021) 2021. Image Processing Masterclass with Python. [Edition unavailable]. BPB Publications. https://www.perlego.com/book/2661000/image-processing-masterclass-with-python-50-solutions-and-techniques-solving-complex-digital-image-processing-challenges-using-numpy-scipy-pytorch-and-keras-pdf.

Harvard Citation

Dey, S. (2021) Image Processing Masterclass with Python. [edition unavailable]. BPB Publications. Available at: https://www.perlego.com/book/2661000/image-processing-masterclass-with-python-50-solutions-and-techniques-solving-complex-digital-image-processing-challenges-using-numpy-scipy-pytorch-and-keras-pdf (Accessed: 15 October 2022).

MLA 7 Citation

Dey, Sandipan. Image Processing Masterclass with Python. [edition unavailable]. BPB Publications, 2021. Web. 15 Oct. 2022.