Top 8 Image-Processing Python Libraries Used in Machine Learning –

According to IDC, digital knowledge will skyrocket as much as 175 zettabytes, and the large a part of this knowledge is pictures. Data scientists must (pre) course of these pictures earlier than feeding them into any machine studying fashions. They must do the essential (and generally soiled) work earlier than the enjoyable half begins.  

To course of a considerable amount of knowledge with effectivity and pace with out compromising the outcomes knowledge scientists want to make use of picture processing instruments for machine studying and deep studying duties.

In this text, I’m going to checklist out probably the most helpful picture processing libraries in Python that are getting used closely in machine studying duties.

1. OpenCV

Source: OpenCV

OpenCV is an open-source library that was developed by Intel within the yr 2000. It is usually utilized in laptop imaginative and prescient duties comparable to object detection, face detection, face recognition, picture segmentation, and many others but in addition comprises a whole lot of helpful capabilities that you could be want in ML.


import cv2 as cv
import numpy as np
import matplotlib.pyplot as plt
img = cv.imread('goku.jpeg')
gray_image = cv.cvtColor(img, cv.COLOR_BGR2GRAY)
fig, ax = plt.subplots(1, 2, figsize=(16, 8))
ax[0].imshow(cv.cvtColor(img, cv.COLOR_BGR2RGB))
ax[1].imshow(cv.cvtColor(gray_image, cv.COLOR_BGR2RGB))

A coloured picture consists of three colour channels the place a grey picture solely consists of 1 Color channel which carries depth data for every pixel displaying the picture as black-and-white.

The following code separates every colour channel:

import cv2 as cv
import numpy as np
import matplotlib.pyplot as plt
img = cv.imread('goku.jpeg')
b, g, r = cv.cut up(img)
fig, ax = plt.subplots(1, 3, figsize=(16, 8))
ax[0].imshow(cv.cvtColor(r, cv.COLOR_BGR2RGB))
ax[1].imshow(cv.cvtColor(g, cv.COLOR_BGR2RGB))
ax[2].imshow(cv.cvtColor(b, cv.COLOR_BGR2RGB))
Gray-scaling effect

Image translation

import cv2 as cv
import numpy as np
import matplotlib.pyplot as plt
picture = cv.imread("pics/goku.jpeg")
h, w = picture.form[:2]
half_height, half_width = h//4, w//8
transition_matrix = np.float32([[1, 0, half_width],
                               [0, 1, half_height]])
img_transition = cv.warpAffine(picture, transition_matrix, (w, h))
plt.imshow(cv.cvtColor(img_transition, cv.COLOR_BGR2RGB))
image translation

Above code interprets a picture from one coordinate to a unique coordinate.

Image rotation

import cv2 as cv
import numpy as np
import matplotlib.pyplot as plt
picture = cv.imread("pics/goku.jpeg")
h, w = picture.form[:2]
rotation_matrix = cv.getRotationMatrix2D((w/2,h/2), -180, 0.5)
rotated_image = cv.warpAffine(picture, rotation_matrix, (w, h))
plt.imshow(cv.cvtColor(rotated_image, cv.COLOR_BGR2RGB))
image rotation

Rotation of a picture for the X or Y-axis.

Scaling and resizing

import cv2 as cv
import numpy as np
import matplotlib.pyplot as plt
picture = cv.imread("pics/goku.jpeg")
fig, ax = plt.subplots(1, 3, figsize=(16, 8))
image_scaled = cv.resize(picture, None, fx=0.15, fy=0.15)
ax[0].imshow(cv.cvtColor(image_scaled, cv.COLOR_BGR2RGB))
ax[0].set_title("Linear Interpolation Scale")
image_scaled_2 = cv.resize(picture, None, fx=2, fy=2, interpolation=cv.INTER_CUBIC)
ax[1].imshow(cv.cvtColor(image_scaled_2, cv.COLOR_BGR2RGB))
ax[1].set_title("Cubic Interpolation Scale")
image_scaled_3 = cv.resize(picture, (200, 400), interpolation=cv.INTER_AREA)
ax[2].imshow(cv.cvtColor(image_scaled_3, cv.COLOR_BGR2RGB))
ax[2].set_title("Skewed Interpolation Scale")
scaling and resizing image

Scaling of a picture refers to changing a picture array into decrease or greater dimensions.

These are a number of the most simple operations that may be carried out with the OpenCV on a picture. Apart from this, OpenCV can carry out operations comparable to Image Segmentation, Face Detection, Object Detection, 3-D reconstruction, function extraction as effectively.

If you need to take a look at how these photos have been generated utilizing OpenCV then you possibly can take a look at this GitHub repository.

2. Sci-kit Image

sci-kit image

Source: sci-kit image

sci-kit image is a python-based picture processing library that has some components written in Cython (Cython is a programming language which is a superset of Python programming language designed to have efficiency like C programming language.) to attain good efficiency. It consists of algorithms for:

  • Segmentation, 
  • Geometric transformations, 
  • Color house manipulation,
  • Analysis,
  • Filtering, 
  • Morphology,
  • Feature detection, and extra 

You will discover it helpful for just about any laptop imaginative and prescient activity.

The sci-kit image makes use of NumPy arrays as picture objects.

Operation utilizing sci-kit image

Active contour

active contour

In laptop imaginative and prescient, contour fashions describe the boundaries of shapes in a picture.

“Active contour models are defined for image segmentation based on the curve flow, curvature, and contour to obtain the exact target region or segment in the image.”

Following code produces the above output:

import numpy as np
import matplotlib.pyplot as plt
from skimage.colour import rgb2gray
from skimage import knowledge
from skimage.filters import gaussian
from skimage.segmentation import active_contour
img = knowledge.astronaut()
s = np.linspace(0, 2*np.pi, 400)
x = 220 + 100*np.cos(s)
y = 100 + 100*np.sin(s)
init = np.array([x, y]).T
cntr = active_contour(gaussian(img, 3),init, alpha=0.015, beta=10, gamma=0.001)
fig, ax = plt.subplots(1, 2, figsize=(7, 7))
ax[0].set_title("Original Image")
ax[1].plot(init[:, 0], init[:, 1], '--r', lw=3)
ax[1].plot(cntr[:, 0], cntr[:, 1], '-b', lw=3)
ax[1].set_title("Active Contour Image")

3. SciPy


Source: Scipy

Scipy is used for mathematical and scientific computations however may carry out multi-dimensional picture processing utilizing the submodule scipy.ndimage. It gives capabilities to function on n-dimensional Numpy arrays and on the finish of the day pictures are simply that.

Scipy provides probably the most generally used picture processing operations like: 

  • Reading Images
  • Image Segmentation
  • Convolution
  • Face Detection
  • Feature Extraction and so forth.

Blurring a picture with scipy

from scipy import misc,ndimage
from matplotlib import pyplot as plt
face = misc.face()
blurred_face = ndimage.gaussian_filter(face, sigma=3)
fig, ax = plt.subplots(1, 2, figsize=(16, 8))
ax[0].set_title("Original Image")
ax[1].set_title("Blurred Image")


blurring image effect

You can discover all operations here.

4. Pillow/PIL

PIL logo

PIL (Python Imaging Library) is an open-source library for picture processing duties that requires python programming language. PIL can carry out duties on a picture comparable to studying, rescaling, saving in numerous picture codecs.

PIL can be utilized for Image archives, Image processing, Image show.

Image enhancement with PIL

For instance, let’s improve the next picture by 30% distinction.

from PIL import Image, ImageFilter
im ='cat_inpainted.png')
from PIL import ImageImprove
enh = ImageImprove.Contrast(im)
enh.improve(1.8).present("30% more contrast")


cat enhanced

For extra data go here

5. NumPy

numpy logo

An picture is actually an array of pixel values the place every pixel is represented by 1 (greyscale) or 3 (RGB) values. Therefore, NumPy can simply carry out duties comparable to picture cropping, masking, or manipulation of pixel values.

For instance to extract pink/inexperienced/blue channels from the next picture:

color extract

We can use numpy and “penalize” every channel separately by changing all of the pixel values with zero. 

from PIL import Image
import numpy as np
im = np.array('goku.png'))
im_R = im.copy()
im_R[:, :, (1, 2)] = 0
im_G = im.copy()
im_G[:, :, (0, 2)] = 0
im_B = im.copy()
im_B[:, :, (0, 1)] = 0
im_RGB = np.concatenate((im_R, im_G, im_B), axis=1)
pil_img = Image.fromarray(im_RGB)'goku.jpg')
colors extracted effect

6. Mahotas

Mahotas is one other picture processing and laptop imaginative and prescient library that was designed for bioimage informatics. It reads and writes pictures in NumPy array, and is applied in C++ with a easy python interface.

The hottest capabilities of Mahotas are

Let’s see how Template Matching might be achieved with Mahotas for discovering the wally.

The following code snippet helps to find the Wally within the crowd.

from pylab import imshow, present
import mahotas
import mahotas.demos
import numpy as np
wally = mahotas.demos.load('Wally')
wfloat = wally.astype(float)
r,g,b = wfloat.transpose((2,0,1))
w = wfloat.imply(2)
sample = np.ones((24,16), float)
for i in vary(2):
    sample[i::4] = -1
    v = mahotas.convolve(r-w, sample)
    masks = (v == v.max())
    masks = mahotas.dilate(masks, np.ones((48,24)))
    np.subtract(wally, .8*wally * ~masks[:,:,None], out=wally, casting='unsafe')

7. SimpleITK

ITK or Insight Segmentation and Registration Toolkit is an open-source platform that’s broadly used for Image Segmentation and Image Registration (a course of that overlays two or extra pictures).

Image segmentation

ITK makes use of the CMake construct surroundings and the library is applied in C++ which is wrapped for Python.

registration visualization

You can test this Jupyter Notebook for studying and analysis functions.

8. Pgmagick

Pgmagick is a GraphicsMagick binding for Python that gives utilities to carry out on pictures comparable to resizing, rotation, sharpening, gradient pictures, drawing textual content, and many others.

Blurring a picture

from pgmagick.api import Image 
img = Image('leena.jpeg') 
img.blur(10, 5)
blurring images
blurred image

Scaling of a picture

from pgmagick.api import Image 
img = Image('leena.png') 
img.scale((150, 100), 'leena_scaled')
scaling image
scaling image

For extra information, you possibly can test the curated checklist of Jupyter Notebooks here.

Final ideas

We have lined the highest Eight picture processing libraries for machine studying. Hopefully, you now have an concept of which a type of will work greatest on your mission. Best of luck. 🙂

Python & Machine Learning Instructor | Founder of


Please enter your comment!
Please enter your name here