0% found this document useful (0 votes)

5K views63 pages

Computer Vision Lab Manual 2023

This document is a laboratory manual for the subject "Computer Vision". It contains: 1) An introduction outlining the purpose of practical/laboratory work and the importance of developing industry-relevant skills. 2) A preface describing the manual's focus on achieving competency-based outcomes through practical experiments rather than just theoretical concepts. 3) A mapping of the 10 listed experiments to the 5 course outcomes to indicate the skills and concepts each experiment aims to teach. 4) Guidelines for faculty and students on effective implementation and assessment of the practical sessions using this manual to enhance skill development. 5) Safety instructions and an index of the 10 experiments with space to record assessment details for each.

Uploaded by

VivekSinh Rajput

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5K views63 pages

Computer Vision Lab Manual 2023

Uploaded by

VivekSinh Rajput

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 63

A Laboratory Manual for

Computer Vision
(3171614)

B.E. Semester 7

Directorate of Technical Education, Gandhinagar, Gujarat

Certificate

This is to certify that Mr./Ms. ___________________________ Enrollment

No. _______________ of B.E. Semester 7 (Branch ____________)of this Institute (GTU Code:
_____ ) has satisfactorily completed the Practical / Tutorial work for the subject Computer
Vision (3171614) for the academic year 2023-24.

Place: __________
Date: __________

Name and Sign of Faculty member

Head ofthe Department

Computer Vision (3171614)

Preface

Main motto of any laboratory/practical/field work is for enhancing required skills as well as
creating ability amongst students to solve real time problem by developing relevant
competencies in psychomotor domain.By keeping in view, GTU has designed competency
focused outcome-based curriculum for engineering degree programs where sufficient weightage
is given to practical work. It shows importance of enhancement of skills amongst the students
and it pays attention to utilize every second of time allotted for practical amongst students,
instructors and faculty members to achieve relevant outcomes by performing the experiments
rather than having merely study type experiments. It is must for effective implementation of
competency focused outcome-basedcurriculum that every practical is keenly designed to serve
as a tool to develop and enhance relevant competency required by the various industry among
every student. These psychomotor skills are very difficult to develop through traditional chalk
and board content delivery method in the classroom. Accordingly, this lab manual is designed
to focus on the industry defined relevant outcomes, rather than old practice of conducting
practical to prove concept and theory.

By using this lab manual students can go through the relevant theory and procedure in advance
before the actual performance which createsan interest and students can have basic idea prior to
performance.This in turn enhances pre-determined outcomes amongst students.Each experiment
in this manual begins with competency, industry relevant skills, course outcomes as well as
practical outcomes (objectives). The students will also achieve safety and necessary precautions
to be taken while performing practical.

This manual also provides guidelines to faculty members to facilitate studentcentric lab
activities through each experiment by arranging and managing necessary resources in order that
the students follow the procedures with required safety and necessary precautions to achieve the
outcomes. It also gives an idea that how students will be assessed by providing rubrics.

Computer vision is a professional elective course which deals with principles of image
formation, image processing algorithms and recognition from single or multiple images (video).
This course emphasizes the core vision tasks of scene understanding and recognition.
Applications to object recognition, image analysis, image retrieval and object tracking will be
discussed.

Utmost care has been taken while preparing this lab manual however always there is chances of
improvement. Therefore, we welcome constructive suggestions for improvement and removal
of errors if any.
Computer Vision (3171614)

Practical – Course Outcome matrix

Course Outcomes (COs):

1. Learn fundamentals of computer vision and its applications
2. Understand the basic image processing operations to enhance, segment the
images.
3. Understand the analyzing and extraction of relevant features of the
concerned domain problem.
4. Understand and apply the motion concepts and its relevance in real time
applications
5. Apply the knowledge in solving high level vision problems like object
recognition, image classification etc.
Sr. CO CO CO CO CO
Objective(s) of Experiment
No. 1 2 3 4 5
Implementing various basic image processing
1. operations in python/MATLAB/open-CV: Reading √
image, writing image, conversion of images, and
complement of an image
Implement contrast adjustment of an image. Implement
2. Histogram processing and equalization. √

Implement the various low pass and high pass filtering

3. mechanisms. √

Use of Fourier transform for filtering the image.

4. √

Utilization of SIFT and HOG features for image

5. analysis. √

Performing/Implementing image segmentation

6. √

Implement optical flow computation algorithm.

7. √

Demonstrate the use of optical flow in any image

8. processing application √

Object detection and Recognition on available online

9. image datasets √

Character or digit or face classification project

10. √
Computer Vision (3171614)

Industry Relevant Skills

The following industry relevant competencies are expected to be developed in the student by
undertaking the practical work of this laboratory.
1. Will be able to solve open design problems
2. Will be able to apply the knowledge, techniques, skills and modern tools to become
successful professionals in computer vision industries.

Guidelines for Faculty members

1. Teacher should provide the guideline with demonstration of practical to the students
with all features.
2. Teacher shall explain basic concepts/theory related to the experiment to the students
before starting of each practical
3. Involve all the students in performance of each experiment.
4. Teacher is expected to share the skills and competencies to be developed in the
students and ensure that the respective skills and competencies are developed in the
students after the completion of the experimentation.
5. Teachers should give opportunity to students for hands-on experience after the
demonstration.
6. Teacher may provide additional knowledge and skills to the students even though not
covered in the manual but are expected from the students by concerned industry.
7. Give practical assignment and assess the performance of students based on task
assigned to check whether it is as per the instructions or not.
8. Teacher is expected to refer complete curriculum of the course and follow the
guidelines for implementation.

Instructions for Students

1. Students are expected to carefully listen to all the theory classes delivered by the faculty
members and understand the COs, content of the course, teaching and examination
scheme, skill set to be developed etc.
2. Students shall organize the work in the group and make record of all observations.
3. Students shall develop maintenance skill as expected by industries.
4. Student shall attempt to develop related hand-on skills and build confidence.
5. Student shall develop the habits of evolving more ideas, innovations, skills etc. apart from
those included in scope of manual.
6. Student shall refer technical magazines and data books.
7. Student should develop a habit of submitting the experimentation work as per the schedule
and s/he should be well prepared for the same.

Common Safety Instructions

1) Switch on the PC carefully (not to use wet hands)
2) Shutdown the PC properly at the end of your Lab
3) Carefully Handle the peripherals (Mouse, Keyboard, Network cable etc)
4) Use Laptop in lab after getting permission from Teacher
Computer Vision (3171614)

Index
(Progressive Assessment Sheet)

Sr. Objective(s) of Experiment Pag Date Date Assess Sign. of Rema

No. e of of ment Teacher rks
No. perfor submis Marks with
mance sion date
Implementing various basic image processing
1. operations in python/MATLAB/open-CV:
Reading image, writing image, conversion of
images, and complement of an image
Implement contrast adjustment of an image.
2. Implement Histogram processing and
equalization.
Implement the various low pass and high pass
3.
filtering mechanisms.
Use of Fourier transform for filtering the
4.
image.
Utilization of SIFT and HOG features for
5.
image analysis.
Performing/Implementing image segmentation
6.
Implement optical flow computation algorithm.
7.
Demonstrate the use of optical flow in any
8.
image processing application
Object detection and Recognition on available
9.
online image datasets
Character or digit or face classification project
10.
Total
Computer Vision (3171614)

Experiment No: 1

Implementing various basic image processing operations in python/ MATLAB/ open-CV:

Reading image, writing image, conversion of images, and complement of an image

Date:

Competency and Practical Skills:

This practical is expected to develop following skills in you

1. Reading an image.
2. Writing an image.
3. Conversion of an image.
4. Complement of an image.

Relevant CO: CO1

Objectives:

1. To understand basic image processing operations.

2. To implement basic image processing operations.

Equipment/Instruments: Computer systems equipped with python/matlab/open-CV

Theory:

Read Image:Function to read image essentially takes the grey values of all the pixels in the
greyscale image and puts them all into a matrix. This matrix now becomes a variable of the
programming platform we use namely Matlab, Python or Open-CV. Size of such matrix for a
greyscale image will be MxN. In case of a color image with RGB (Red, Green, Blue) color palette
the size of the matrix becomes 3 * (MxN). Here MxN is the resolution of the image.In general, the
read function reads the pixel values from an image file, andreturns a matrix of all the pixel values.

Write Image: Once we have captured image data i.e matrix with MxN resolution either by
digitally capturing it, extracting it from a video sequence or by processing any input image, we
Page | 1
Computer Vision (3171614)
would intend to save the image on computer i.e writing the image. Write function will enable us
to write this data from the matrix variable onto the hard disk at desired location and in
corresponding file format as the matrix. A MxN matrix can generate a greyscale image while a 3 *
(MxN) matrix contains data from R, G and B color palettes can generate a color image.

Image Conversion:Following are the types of digital images:

1. Binary images
2. Greyscale images
3. RGB images

(a) Binary Image (b) Greyscale Image (c) RGB Image

Figure 1.1: Types of digital images

Binary as the name suggests is the image with either black or white pixels. Greyscale images have
pixel values from 0 (black) to 255 (white). RGB images are the true color images with value
between 0 to 255 for each of Red, Green and Blue components. Within the limitations of
arithmetic conversions, we can convert images from one image type to another. RGB to Grey and
Grey to RGB are examples of such image conversions.

Image Complement: In the complement of a binary image, zeros become ones and ones become
zeros. Black and white are reversed. In the complement of a grayscale or color image, each pixel
value is subtracted from the maximum pixel value supported by the class (or 1.0 for double-
precision images). The difference is used as the pixel value in the output image. In the output
image, dark areas become lighter and light areas become darker. For color images, reds become
cyan, greens become magenta, blues become yellow, and vice versa.

Page | 2
Computer Vision (3171614)

(a) Original and Complement of an (b) Original and Complement of a Color

Intensity Image Image
Figure 1.2: Image Complement

Safety and necessary Precautions:

1. Do not alter the installed libraries of python/matlab/open-CV

Procedure:
1. Image read :
matrix variable = read image ( From_Location )
Display the image
2. Image Write: write image( To_Location, matrix variable)
3. Image conversion:
y=rgb2gray(x);
y=gray2rgb(x)
4. Complement of Image:
for each value of x in the image
y= 255-x
save y

Page | 3
Computer Vision (3171614)

Program:

import cv2

# Read an image from file

image = cv2.imread('./image.png')

# Check if the image was loaded successfully

if image is not None:
cv2.imshow('Image', image)

cv2.waitKey(0)
cv2.destroyAllWindows()
else:
print("Image not found or couldn't be loaded.")

# Read an image from file

image = cv2.imread('./image.png')

# Save the image to a file

cv2.imwrite('output.jpg', image)

# Read an image from file

image = cv2.imread('./image.png')

# Convert the image to grayscale

gray_image = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)

# Save the grayscale image to a file

cv2.imwrite('gray_output.jpg', gray_image)

# Read an image from file

image = cv2.imread('./image.png')

# Find the complement of the image

complement_image = 255 - image

# Save the complemented image to a file

cv2.imwrite('complement_output.jpg', complement_image)

Page | 4
Computer Vision (3171614)

Output:
output.jpg

gray_output.jpg

complement_output.jpg

Page | 5
Computer Vision (3171614)

Conclusion:
image processing operations are essential for various computer vision and image analysis
tasks. Understanding how to read, manipulate, and save images, as well as perform
conversions and enhancements, lays the foundation for more advanced image processing
techniques and applications. These operations are the building blocks for more complex
image analysis tasks such as object detection, image recognition, and image segmentation.

Quiz:
1. If you have access to a digital cameracapable of capturing images with 1024x768
resolution for a fixed scene, using all possiblecamera settings what is the smallest file you
can create?
➢ In general, the smallest file size can be achieved by using strong image
compression techniques (e.g., JPEG compression) and capturing a scene with
minimal detail or changes in color. However, the specific file size can vary widely
depending on the camera's compression algorithm, the image content, and the
desired image quality.
2. Will it be possible to convert an original greyscale image to rgb ?
➢ Yes, it is possible to convert an original grayscale image to an RGB (Red, Green,
Blue) image.

Suggested Reference:
1. Digital Image Processingby S. Sridhar. Oxford Press.
2. https://www.mathworks.com/help/matlab/ref/imwrite.html

References used by the students:

https://docs.opencv.org/4.x/d0/d86/tutorial_py_image_arithmetics.html

Page | 6
Computer Vision (3171614)

Rubric wise marks obtained:

Program (Excellent)(4) (Good)(3) (Fair)(2) (Beginning)(1)

Program Program Program Program Program does not
execution executes executes with a executes with execute (0-1)
correctly with no minor error multiple minor
syntax or (easily fixed
runtime errors error)
Design- Program displays Output/design of Output/Design of Output is
Correctness of correct output output has minor output has incorrect (0-1)
output with no errors errors multiple errors
Design of logic Program is Program has Program has Program is
logically well slight logic significant logic incorrect (0-1)
designed errors that do no errors
significantly
affect the results
Standards Program is Few Several Program is
stylistically well inappropriate inappropriate poorly written (0-
designed design choices design choices 1)
(i.e. poor (i.e. poor
variable names, variable names,
improper improper
indentation) indentation)
Documentation Program is well Missing one Missing two or Most or all
documented required more required documentation
comment comments missing (0-1)

Criteria 1 2 3 4 5 Total
Marks

Page | 7
Computer Vision (3171614)

Experiment No: 2

Implement contrast adjustment of an image. Implement Histogram processing and equalization

Date:

Competency and Practical Skills:

This practical is expected to develop following skills in you

1. Image Enhancement by adjusting image contrast

2. Image analysis using Histogram

Relevant CO: CO2

Objectives:

1. Image enhancement using contrast adjustment and histogram equalization

Equipment/Instruments: Computer systems equipped with python/ matlab/ open-CV

Theory:

Contrast stretching: It is an image enhancement technique that tries to improve the contrast by
stretching the intensity values of an image to fill the entire dynamic range. The transformation
function used is always linear and monotonically increasing.If the minimum intensity value( )
present in the image is 100 then it is stretched to the possible minimum intensity value 0.
Likewise, if the maximum intensity value( ) is less than the possible maximum intensity value
255 then it is stretched out to 255.0–255 is taken as standard minimum and maximum intensity
values for 8-bit images. General Formula for Contrast Stretching is given by equation (2.1).

eq. (2.1)

Page | 8
Computer Vision (3171614)
where, = current pixel intensity value ; = minimum intensity value present in the whole

image ; = maximum intensity value present in the whole image. are the intended values and

s is the resultant value of the intensity.

(a) Input image before contrast stretching along with its histogram

(b) Input image after contrast stretching along with its histogram
Figure 2.1: Results of contrast stretching

Histogram Equalization:It is an automatic enhancement technique which produces an output

(enhanced) image that has a near uniformly distributed histogram. The idea is to change the
histogram to one which is uniform; that is that every bar onthe histogram is of the same height, or
in other words that each grey level in the image occurs withthe same frequency. In practice this is
generally not possible, although we shall see that the result ofhistogram equalization provides
very good results.Through this adjustment, the intensities can be better distributed on the
histogram utilizing the full range of intensities evenly. This allows for areas of lower local
contrast to gain a higher contrast. Histogram equalization accomplishes this by effectively
spreading out the highly populated intensity values which are used to degrade image contrast.The
method is useful in images with backgrounds and foregrounds that are both bright or both dark.

(a) Input image before histogram equalization along with its

histogram (red) and cumulative histogram (black)

Page | 9
Computer Vision (3171614)

(b) Input image after histogram equalization along with its

histogram (red) and cumulative histogram (black)
Figure 2.2: Results of Histogram Equalization

Procedure in Matlab:
Contrast stretching:
I = imread('<input image>');
figure
imshow(I)
J = imadjust(I,stretchlim(I),[]);
figure
imshow(J)

Histogram Equalization:
I = imread('<input image>');
figure
subplot(1,3,1)
imshow(I)
subplot(1,3,2:3)
imhist(I)
J = histeq(I);
figure
subplot(1,3,1)
imshow(J)
subplot(1,3,2:3)
imhist(J)

Page | 10
Computer Vision (3171614)
Program:
import cv2
import numpy as np
import matplotlib.pyplot as plt

# Function to perform contrast adjustment

def adjust_contrast(image, alpha, beta):
adjusted_image = cv2.convertScaleAbs(image, alpha=alpha, beta=beta)
return adjusted_image

# Function to perform histogram equalization

def equalize_histogram(image):
equalized_image = cv2.equalizeHist(image)
return equalized_image

# Read the input image

input_image = cv2.imread('./input_image.jpg', cv2.IMREAD_GRAYSCALE) # Convert to
grayscale

# Define contrast adjustment parameters (alpha and beta)

alpha = 1.5 # Contrast control (1.0 for no change)
beta = 0 # Brightness control (0 for no change)

# Perform contrast adjustment

contrast_adjusted_image = adjust_contrast(input_image, alpha, beta)

# Perform histogram equalization

equalized_image = equalize_histogram(input_image)

# Display and save the original, adjusted, and equalized images

cv2.imshow('Original Image', input_image)
cv2.imshow('Contrast Adjusted Image', contrast_adjusted_image)
cv2.imshow('Equalized Image', equalized_image)

cv2.imwrite('contrast_adjusted_output.jpg', contrast_adjusted_image)
cv2.imwrite('equalized_output.jpg', equalized_image)

cv2.waitKey(0)
cv2.destroyAllWindows()

# Plot histograms for original and equalized images

plt.figure(figsize=(12, 6))

# Original Image Histogram

plt.subplot(2, 2, 1)
plt.hist(input_image.ravel(), 256, [0, 256])
plt.title('Original Image Histogram')

# Contrast Adjusted Image Histogram

plt.subplot(2, 2, 2)
plt.hist(contrast_adjusted_image.ravel(), 256, [0, 256])
plt.title('Contrast Adjusted Image Histogram')

Page | 11
Computer Vision (3171614)
# Equalized Image Histogram
plt.subplot(2, 2, 3)
plt.hist(equalized_image.ravel(), 256, [0, 256])
plt.title('Equalized Image Histogram')

plt.show()

Output:
Original image Histogram

Contrast Adjust Image Histogram

Equalized Image Histogram

Conclusion:

Page | 12
Computer Vision (3171614)
This practical has provided valuable hands-on experience in image enhancement
techniques. By adjusting image contrast using contrast stretching and equalizing image
histograms, we have learned important tools for improving the quality and interpretability
of digital images, ultimately enhancing our skills in image analysis and processing.
Quiz:
1. Differentiate between contrast stretching and histogram equalization
→ Contrast stretching primarily stretches the intensity range, while histogram
equalization redistributes intensity values to achieve a more uniform histogram. The
choice between these techniques depends on the specific requirements and
characteristics of the image being processed

2. Is it possible to re-tract to original image after in both contrast stretching and histogram
equalization
→ it's possible to attempt to revert to the original image after applying contrast
stretching or histogram equalization, the process may not result in a perfect
reconstruction due to information loss during enhancement. The effectiveness of
the retraction depends on the specific characteristics of the original image and the
extent of enhancement applied.

Suggested Reference:

1.Digital Image Processingby S. Sridhar. Oxford Press.

2.https://uotechnology.edu.iq/ce/lecture%202013n/4th%20Image%20Processing%20_Lecture
s/DIP_Lecture5.pdf
3. https://en.wikipedia.org/wiki/Histogram_equalization

References used by the students:

https://www.mathworks.com/help/images/histogram-equalization.html

Page | 13
Computer Vision (3171614)
Rubric wise marks obtained:
Program (Excellent)(4) (Good)(3) (Fair)(2) (Beginning)(1)
Program Program Program Program Program does not
execution executes executes with a executes with execute (0-1)
correctly with no minor error multiple minor
syntax or (easily fixed
runtime errors error)
Design- Program displays Output/design of Output/Design of Output is
Correctness of correct output output has minor output has incorrect (0-1)
output with no errors errors multiple errors
Design of logic Program is Program has Program has Program is
logically well slight logic significant logic incorrect (0-1)
designed errors that do no errors
significantly
affect the results
Standards Program is Few Several Program is
stylistically well inappropriate inappropriate poorly written (0-
designed design choices design choices 1)
(i.e. poor (i.e. poor
variable names, variable names,
improper improper
indentation) indentation)
Documentation Program is well Missing one Missing two or Most or all
documented required more required documentation
comment comments missing (0-1)

Criteria 1 2 3 4 5 Total
Marks

Page | 14
Computer Vision (3171614)

Experiment No: 3

Implement the various low pass and high pass filtering mechanisms.

Date:

Competency and Practical Skills:

This practical is expected to develop following skills in you

1. Image filtering using low pass filters

2. Image filtering using high pass filters

Relevant CO: CO3

Objectives:

1. Image enhancement such as smoothing, sharpening and edge enhancement using various
filters.

Equipment/Instruments: Computer systems equipped with python/ matlab/ open-CV

Theory:

Filtering is a technique for modifying or enhancing an image. For example, you can filter an
image to emphasize certain features or remove other features. Image processing operations
implemented with filtering include smoothing, sharpening, and edge enhancement.

Low pass filter (smoothing): Low pass filter is the type of frequency domain filter that is used
for smoothing the image. It attenuates the high-frequency components and preserves the low-
frequency components. High frequency content corresponds to boundaries of the objects. An
image is smoothed by decreasing the disparity between pixel values by averaging nearby pixels.
The low-pass filters usually employ moving window operator which affects one pixel of the
image at a time, changing its value by some function of a local region (window) of pixels. The
operator moves over the image to affect all the pixels in the image.
Page | 15
Computer Vision (3171614)

Mean filtering: It is used as a method of smoothing images, reducing the amount of intensity
variation between one pixel and the next resulting in reducing noise in images. The idea of mean
filtering is simply to replace each pixel value in an image with the mean (average) value of its
neighbors, including itself. This has the effect of eliminating pixel values which are
unrepresentative of their surroundings.

Median filter: Median filtering is a nonlinear operation often used in image processing to reduce
"salt and pepper" noise. Median filter replaces the pixel at the center of the filter with the median
value of the pixels falling beneath the mask. Median filter does not blur the image but it rounds
the corners.

Figure 3.1: Original image, mean filtered output and median filtered output in the order of left to
right

High pass filter (sharpening and edge enhancement):High pass filter is the type of frequency
domain filter that is used for sharpening the image. It attenuates the low-frequency components
and preserves the high-frequency components. A high-pass filter can be used to make an image
appear sharper. These filters emphasize fine details in the image - the opposite of the low-pass
filter. High-pass filtering works in the same way as low-pass filtering; it just uses a different
convolution kernel. Prewitt and Sobel are derivative filters used as edge detectors.

Laplacian filter: One of the most known high-pass filters is the Laplacian edge enhancement. Its
meaning can be thus understood: We subtract the image from a blurred version of itself created
from the averaging of the four nearest neighbours. This enhances edges and isolated pixels with
extreme values. This method being very sensitive to noise, Laplacian of Gaussian (LoG) is used.

Page | 16
Computer Vision (3171614)
Procedure in Matlab:
Low Pass Filters:
I = imread('<input image>');
h = 1/3*ones(3,1);
H = h*h';
imfilt = filter2(H,I); // Mean filter for 3x3
J = medfilt2(I) // Median filter

High Pass Filters:

I = imread('<input image>');
BW1 = edge(I,'Sobel');
BW2 = edge(I,'Prewitt');
BW3 = edge(I,'Canny');
BW4 = edge(I,'log');
imshowpair(BW1,BW2,'montage')
imshowpair(BW3,BW4,'montage')

Program:
import cv2
import numpy as np

# Load the input image in color (RGB)

input_image = cv2.imread('input_image.jpg', cv2.IMREAD_COLOR)

# Apply mean filtering (3x3 kernel)

mean_filtered = cv2.blur(input_image, (3, 3))

# Apply median filtering

median_filtered = cv2.medianBlur(input_image, 3)

# Apply Laplacian edge enhancement

laplacian = cv2.Laplacian(input_image, cv2.CV_64F)
laplacian_filtered = cv2.convertScaleAbs(laplacian)

# Display the original image and filtered images

cv2.imshow('Original Colorful Image', input_image)
cv2.imshow('Mean Filtered', mean_filtered)
cv2.imshow('Median Filtered', median_filtered)
cv2.imshow('Laplacian Edge Enhancement', laplacian_filtered)

cv2.waitKey(0)
cv2.destroyAllWindows()

Page | 17
Computer Vision (3171614)

Output:

Conclusion:
we explored various image filtering techniques, including low-pass filtering and high-pass
filtering, using the OpenCV library in Python. The project aimed to develop skills in
image enhancement for tasks such as smoothing, sharpening, and edge enhancement.

Quiz:

1. Compare low pass filters and high pass filters

→ low-pass filters and high-pass filters serve complementary roles in image processing. Low-
pass filters are used for noise reduction and image smoothing, while high-pass filters are
used for edge enhancement and feature extraction.
2. For noise removal which type of filter will be used?
→ For noise removal, low-pass filters are commonly used. Low-pass filters are effective at
reducing noise in an image by smoothing or averaging pixel values, which helps eliminate
high-frequency noise components

Page | 18
Computer Vision (3171614)
3. It is necessary to use Gaussian smoothing before using Laplacian filter. Justify.
→ Yes, it's necessary to use Gaussian smoothing before the Laplacian filter to reduce noise
and avoid amplifying noise artifacts during edge enhancement.

Suggested Reference:

1.Digital Image Processing by S. Sridhar. Oxford Press.

2.https://rpg.ifi.uzh.ch/docs/teaching/2017/04_filtering.pdf
3.https://www.bogotobogo.com/Matlab/Matlab_Tutorial_Digital_Image_Processing_6_Filter_
Smoothing_Low_Pass_fspecial_filter2.php

References used by the students:

https://python.plainenglish.io/sequence-of-high-pass-and-low-pass-image-filtering-by-using-
opencv-scipy-and-python-2c22c88219d7

Page | 19
Computer Vision (3171614)

Rubric wise marks obtained:

Program (Excellent)(4) (Good)(3) (Fair)(2) (Beginning)(1)
Program Program Program Program Program does not
execution executes executes with a executes with execute (0-1)
correctly with no minor error multiple minor
syntax or (easily fixed
runtime errors error)
Design- Program displays Output/design of Output/Design of Output is
Correctness of correct output output has minor output has incorrect (0-1)
output with no errors errors multiple errors
Design of logic Program is Program has Program has Program is
logically well slight logic significant logic incorrect (0-1)
designed errors that do no errors
significantly
affect the results
Standards Program is Few Several Program is
stylistically well inappropriate inappropriate poorly written (0-
designed design choices design choices 1)
(i.e. poor (i.e. poor
variable names, variable names,
improper improper
indentation) indentation)
Documentation Program is well Missing one Missing two or Most or all
documented required more required documentation
comment comments missing (0-1)

Criteria 1 2 3 4 5 Total
Marks

Page | 20
Computer Vision (3171614)

Experiment No: 4

Use of Fourier transform for filtering the image.

Date:

Competency and Practical Skills:

This practical is expected to develop following skills in you

1. Understanding Discrete Fourier Transform

2. Implementation of Fourier Transform in digital image

Relevant CO: CO2

Objectives:

1. Use of Fourier transform in digital image processing

Equipment/Instruments: Computer systems equipped with python/ matlab/ open-CV

Theory:

Fourier Transform is an important image processing tool which is used to decompose an image
into its sine and cosine components. The output of the transformation represents the image in
the Fourier or frequency domain, while the input image is the spatial domain equivalent. In the
Fourier domain image, each point represents a particular frequency contained in the spatial domain
image. The Fourier Transform is used in a wide range of applications, such as image analysis,
image filtering, image reconstruction and image compression.

Page | 21
Computer Vision (3171614)

Figure 3.1: Frequency domain filtering

Procedure in Matlab:

1. fft which takes the DFT of a vector

2. ifft which takes the inverse DFT of a vector
3. fft2 which takes the DFT of a matrix
4. ifft2 which takes the inverse DFT of a matrix
5. fftshift which shifts a transform

Program:
import cv2
import numpy as np
from matplotlib import pyplot as plt

# Load the input image in grayscale

input_image = cv2.imread('input_image.jpg', cv2.IMREAD_GRAYSCALE)

# Perform 2D Fourier Transform

f_transform = np.fft.fft2(input_image)
f_transform_shifted = np.fft.fftshift(f_transform)

# Create a Gaussian high-pass filter

rows, cols = input_image.shape
crow, ccol = rows // 2, cols // 2 # Center of the image
d = 30 # Radius of the high-pass filter
mask = np.zeros((rows, cols), dtype=np.uint8)
mask[crow - d:crow + d, ccol - d:ccol + d] = 1

# Apply the high-pass filter in the frequency domain

f_transform_shifted_filtered = f_transform_shifted * mask

# Perform Inverse Fourier Transform

f_transform_filtered = np.fft.ifftshift(f_transform_shifted_filtered)
filtered_image = np.fft.ifft2(f_transform_filtered)
Page | 22
Computer Vision (3171614)
filtered_image = np.abs(filtered_image) # Take the magnitude of the result

# Display the original and filtered images

plt.subplot(121), plt.imshow(input_image, cmap='gray')
plt.title('Original Image'), plt.xticks([]), plt.yticks([])
plt.subplot(122), plt.imshow(filtered_image, cmap='gray')
plt.title('Filtered Image'), plt.xticks([]), plt.yticks([])
plt.show()

Output:

Conclusion:

Quiz:
1. Discuss Properties of Fourier Transform

Suggested Reference:

1.Digital Image Processingby S. Sridhar. Oxford Press.

2. https://www.ece.mcmaster.ca/~shirani/ip12/chapter4
3. https://vincmazet.github.io/bip/filtering/fourier.html

References used by the students:

https://www.geeksforgeeks.org/how-to-find-the-fourier-transform-of-an-image-using-
opencv-python/

Page | 23
Computer Vision (3171614)

Rubric wise marks obtained:

Criteria 1 2 3 4 5 Total
Marks

Page | 24
Computer Vision (3171614)

Experiment No: 5

Utilization of SIFT and HOG features for image analysis.

Date:

Competency and Practical Skills:

This practical is expected to develop following skills in you

1. Understanding SIFT and HOG features

2. Implementation of SIFT and HOG features

Relevant CO: CO3

Objectives:

1. Use of SIFT and HOG feature extraction in digital image processing

Equipment/Instruments: Computer systems equipped with python/ matlab/ open-CV

Theory:

Scale-Invariant Feature Transform (SIFT): SIFT is invariance to image scale and rotation. In
general, SIFT algorithm can be decomposed into four steps as (a) Feature point (also called
keypoint) detection (b) Feature point localization (c) Orientation assignment and (d) Feature
descriptor generation.

Major advantages of SIFT are

• Locality: features are local, so robust to occlusion and clutter (no prior segmentation)
• Distinctiveness: individual features can be matched to a large database of objects
• Quantity: many features can be generated for even small objects
• Efficiency: close to real-time performance
• Extensibility: can easily be extended to a wide range of different feature types, with each
adding robustness
Page | 25
Computer Vision (3171614)
SIFT helps to reduce the dimensions of the feature space by removing the redundant features,
which highly impact the training of the machine learning used in large scale applications

Figure 5.1: SIFT features visual

Histogram of Oriented Gradients (HOG): This feature descriptor is used for the purpose of
object detection. The technique counts occurrences of gradient orientation in localized portions of
an image. This method is similar to that of edge orientation histograms, scale-invariant feature
transform descriptors, and shape contexts, but differs in that it is computed on a dense grid of
uniformly spaced cells and uses overlapping local contrast normalization for improved accuracy.
The HOG feature vector is arranged by HOG blocks. The cell histogram, H(Cyx), is 1-by-NumBins.
The figure below shows the HOG feature vector with a 1-by-1 cell overlap between blocks.
.

Figure 5.2: HOG Feature Vector

Page | 26
Computer Vision (3171614)

Figure 5.3: HOG features visual

Procedure in OpenCV:
SIFT:
img = imread('image_name')
imgGray = cv2.cvtColor(img,cv2.COLOR_BGR2GRAY)
sift = cv2.SIFT_create()
keypoints,descriptors = sift.detectAndCompute(img, None)
sift_image = cv2.drawKeypoints(imgGray, keypoints, img)
HOG:
img = cv2.imread('image_name')
(hog, hog_image) = feature.hog(img, orientations=9,pixels_per_cell = (8,8),
cells_per_block=(2,2),block_norm='L2-Hys', visualize=True, transform_sqrt=True)
cv2.imshow("Ori", img)
cv2.imshow('HOG IMAGE', hog_image)

Program:
import cv2
from skimage import feature
import matplotlib.pyplot as plt

# SIFT feature extraction

def sift_feature_extraction(image_path):
# Read the image
img = cv2.imread(image_path)
imgGray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)

# Create SIFT detector

sift = cv2.SIFT_create()

# Detect and compute SIFT keypoints and descriptors

keypoints, descriptors = sift.detectAndCompute(imgGray, None)

Page | 27
Computer Vision (3171614)
# Draw SIFT keypoints on the image
sift_image = cv2.drawKeypoints(imgGray, keypoints, img)

return keypoints, descriptors, sift_image

# HOG feature extraction

def hog_feature_extraction(image_path):
# Read the image
img = cv2.imread(image_path)

# Convert the image to grayscale

imgGray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)

# Compute HOG features

(hog, hog_image) = feature.hog(imgGray, orientations=9, pixels_per_cell=(8, 8),
cells_per_block=(2, 2), block_norm='L2-Hys',
visualize=True, transform_sqrt=True)

return hog, hog_image

if __name__ == "__main__":
image_path = './input_image.jpg'

# SIFT feature extraction

sift_keypoints, sift_descriptors, sift_image = sift_feature_extraction(image_path)

# HOG feature extraction

hog_features, hog_image = hog_feature_extraction(image_path)

# Display SIFT image and HOG image

plt.figure(figsize=(12, 6))
plt.subplot(121)
plt.title('SIFT Features')
plt.imshow(cv2.cvtColor(sift_image, cv2.COLOR_BGR2RGB))
plt.axis('off')

plt.subplot(122)
plt.title('HOG Features')
plt.imshow(hog_image, cmap=plt.cm.gray)
plt.axis('off')

plt.show()

Page | 28
Computer Vision (3171614)

Output:

Conclusion:
In this experiment, we successfully utilized Scale-Invariant Feature Transform (SIFT) and
Histogram of Oriented Gradients (HOG) features for image analysis. SIFT provided robust
keypoint detection, while HOG described object shapes effectively. These techniques
enhance image processing and feature extraction for various computer vision applications.
Quiz:
1. Compare HOG and SIFT feature descriptors
◈ HOG is well-suited for tasks where capturing object shapes in various scales is critical,
whereas SIFT excels in scenarios where keypoint matching and recognition under scale
and rotation variations are important. The choice between them depends on the specific
requirements of the computer vision application.

Suggested Reference:
1.Digital Image Processing by S. Sridhar. Oxford Press.
2. https://in.mathworks.com/help/vision/ref/extracthogfeatures.html
3. https://towardsdatascience.com/hog-histogram-of-oriented-gradients-67ecd887675f

References used by the students:

https://www.analyticsvidhya.com/blog/2019/09/feature-engineering-images-introduction-
hog-feature-descriptor/

Page | 29
Computer Vision (3171614)

Rubric wise marks obtained:

Criteria 1 2 3 4 5 Total
Marks

Page | 30
Computer Vision (3171614)

Experiment No: 6

Performing/Implementing image segmentation.

Date:

Competency and Practical Skills:

This practical is expected to develop following skills in you

1. Understanding Image Segmentation

2. Implementing Image Segmentation

Relevant CO: CO2

Objectives:

1. Use of segmentation in digital image processing

Equipment/Instruments: Computer systems equipped with python/ matlab/ open-CV

Theory:

Segmentation:Instead of processing the entire image, a common practice is to extract the Region
of Interest (RoI). Image segmentation is a method of dividing a digital image into subgroups called
image segments, reducing the complexity of the image and enabling further processing or analysis
of each image segment. Technically, segmentation is the assignment of labels to pixels to identify
objects, people, or other important elements in the image. Image segmentation could involve
separating foreground from background, or clustering regions of pixels based on similarities in
color or shape. For example, a common application of image segmentation in medical imaging is
to detect and label pixels in an image or voxels of a 3D volume that represent a tumor in a patient’s
brain or other organs. Image segmentation is typically used to locate objects and boundaries (lines,
curves, etc.) in images. Types of segmentation are as below:

Page | 31
Computer Vision (3171614)
Edge-Based Segmentation: This technique identifies the edges of various objects in a given
image. It helps locate features of associated objects in the image using the information from the
edges. Edge detection helps strip images of redundant information, reducing their size and
facilitating analysis. Edge-based segmentation algorithms identify edges based on contrast, texture,
color, and saturation variations. They can accurately represent the borders of objects in an image
using edge chains comprising the individual edges.

Threshold Based: It is the simplest image segmentation method, dividing pixels based on their
intensity relative to a given value or threshold. It is suitable for segmenting objects with higher
intensity than other objects or backgrounds. The threshold value T can work as a constant in low-
noise images. In some cases, it is possible to use dynamic thresholds.

Region-based Segmentation: It involves dividing an image into regions with similar

characteristics. Each region is a group of pixels, which the algorithm locates via a seed point. Once
the algorithm finds the seed points, it can grow regions by adding more pixels or shrinking and
merging them with other points.

Once the mask is ready then the RoI can be segmented out of the given image with the help of the
mask.

Procedure:

1. Create mask for desired RoI

2. Mask it with the original image
3. The resultant segments can further be used for post-processing

Program:

import cv2
import numpy as np
import matplotlib.pyplot as plt
from sklearn.cluster import KMeans # Import the KMeans class

# Load the image

image = cv2.imread('./input_image.jpg')
image = cv2.cvtColor(image, cv2.COLOR_BGR2RGB) # Convert to RGB color space for
displaying

# Reshape the image to a 2D array of pixels

pixels = image.reshape(-1, 3)

# Define the number of clusters (segments)

Page | 32
Computer Vision (3171614)
num_clusters = 3 # You can adjust this value to change the number of segments

# Apply k-means clustering

kmeans = KMeans(n_clusters=num_clusters)
kmeans.fit(pixels)

# Get the labels assigned to each pixel

labels = kmeans.labels_

# Reshape the labels to the shape of the original image

segmented_image = labels.reshape(image.shape[0], image.shape[1])

# Create masks for each segment

segment_masks = [segmented_image == i for i in range(num_clusters)]

# Create segmented images for each segment

segmented_images = [image.copy() for _ in range(num_clusters)]
for i in range(num_clusters):
segmented_images[i][~segment_masks[i]] = [0, 0, 0] # Set non-segment pixels to black

# Display the original image and segmented images

plt.figure(figsize=(12, 4))
plt.subplot(1, num_clusters + 1, 1)
plt.imshow(image)
plt.title('Original Image')

for i in range(num_clusters):
plt.subplot(1, num_clusters + 1, i + 2)
plt.imshow(segmented_images[i])
plt.title(f'Segment {i + 1}')

plt.show()

Output:

Page | 33
Computer Vision (3171614)
Conclusion:
his practical exercise provided hands-on experience in image segmentation, showcasing
how this technique can be employed to extract meaningful information from complex
images. It opens the door to further exploration and experimentation with image
processing techniques for various real-world applications.
Quiz:
1. Discuss applications of different segmentation techniques
→ Each segmentation technique has its strengths and weaknesses, making them suitable for
specific tasks. The choice of technique depends on the nature of the data and the objectives
of the image analysis task. In many cases, a combination of these techniques or more
advanced methods like deep learning-based segmentation is used to achieve more accurate
and robust results.

Suggested Reference:

1.Digital Image Processing by S. Sridhar. Oxford Press.

2. https://www.tensorflow.org/tutorials/images/segmentation

References used by the students:

https://medium.com/@flcamarao/image-processing-using-python-image-segmentation-
98d2ebe44bfe

Page | 34
Computer Vision (3171614)
Rubric wise marks obtained:
Program (Excellent)(4) (Good)(3) (Fair)(2) (Beginning)(1)
Program Program Program Program Program does not
execution executes executes with a executes with execute (0-1)
correctly with no minor error multiple minor
syntax or (easily fixed
runtime errors error)
Design- Program displays Output/design of Output/Design of Output is
Correctness of correct output output has minor output has incorrect (0-1)
output with no errors errors multiple errors
Design of logic Program is Program has Program has Program is
logically well slight logic significant logic incorrect (0-1)
designed errors that do no errors
significantly
affect the results
Standards Program is Few Several Program is
stylistically well inappropriate inappropriate poorly written (0-
designed design choices design choices 1)
(i.e. poor (i.e. poor
variable names, variable names,
improper improper
indentation) indentation)
Documentation Program is well Missing one Missing two or Most or all
documented required more required documentation
comment comments missing (0-1)

Criteria 1 2 3 4 5 Total
Marks

Page | 35
Computer Vision (3171614)

Experiment No: 7

Implement optical flow computation algorithm.

Date:

Competency and Practical Skills:

This practical is expected to develop following skills in you

1. Understanding optical flow computation algorithm

2. Implementation of optical flow computation algorithm

Relevant CO: CO4

Objectives:

1. Understanding motion estimation and implementing optical flow computation algorithm

Equipment/Instruments: Computer systems equipped with python/ matlab/ open-CV

Theory:

Optical flow: Itis the motion of objects between consecutive frames of sequence, caused by the
relative movement between the object and camera. The problem of optical flow may be expressed
by following figure:

Figure 7.1: Optical flow

Page | 36
Computer Vision (3171614)
where between consecutive frames, we can express the image intensity (I) as a function of space
(x,y) and time (t). In other words, if we take the first image I(x,y,t) and move its pixels by (dx,dy)
over t time, we obtain the new image I(x+dx, y+dy, t+dt).

Optical flow works on following assumptions:

1. The pixel intensities of an object do not change between consecutive frames.
2. Neighbouring pixels have similar motion.

Differential methods of estimating optical flow, based on partial derivatives of the image signal
and/or the sought flow field and higher-order partial derivatives, such as:

1. Lucas–Kanade method – regarding image patches and an affine model for the flow field
2. Horn–Schunck method – optimizing a functional based on residuals from the brightness
constancy constraint, and a particular regularization term expressing the expected
smoothness of the flow field
3. Buxton–Buxton method – based on a model of the motion of edges in image sequences
4. Black–Jepson method – coarse optical flow via correlation

General variational methods – a range of modifications/extensions of Horn–Schunck, using other

data terms and other smoothness terms.

Procedure for Lucas-KanadeSparse Optical Flow method with OpenCV:

1. Setting up your environment and pen sparse-starter.py with your text editor
2. Configuring OpenCV to read a video and setting up parameters
3. Grayscaling
4. Shi-Tomasi Corner Detector - selecting the pixels to track
5. Tracking Specific Objects
6. Lucas-Kanade: Sparse Optical Flow
7. Visualizing

Program:
import numpy as np
import cv2 as cv

cap = cv.VideoCapture('./input_video.mp4')

Page | 37
Computer Vision (3171614)
ret,frame = cap.read()

# dimensions of initial location of window

x, y, w, h = 300, 200, 100, 50
tracker = (x, y, w, h)

region = frame[y:y+h, x:x+w]

hsv_reg = cv.cvtColor(region, cv.COLOR_BGR2HSV)
mask = cv.inRange(hsv_reg, np.array((0., 60.,32.)), np.array((180.,255.,255.)))
reg_hist = cv.calcHist([hsv_reg],[0],mask,[180],[0,180])
cv.normalize(reg_hist,reg_hist,0,255,cv.NORM_MINMAX)

# Setup the termination criteria

criteria = ( cv.TERM_CRITERIA_EPS | cv.TERM_CRITERIA_COUNT, 10, 1 )

while(1):
ret, frame = cap.read()

if ret == True:
hsv = cv.cvtColor(frame, cv.COLOR_BGR2HSV)
dst = cv.calcBackProject([hsv],[0],reg_hist,[0,180],1)

# apply meanshift
ret, tracker = cv.meanShift(dst, tracker, criteria)

# Draw it on image
x,y,w,h = tracker
img = cv.rectangle(frame, (x,y), (x+w,y+h), 255,2)
cv.imshow('img',img)

k = cv.waitKey(30) & 0xff

if k==115:
cv.imwrite('capture.png', img)
if k == 27:
break

Output:

Page | 38
Computer Vision (3171614)

Conclusion:
In this experiment, we explored the concept of optical flow and implemented the Lucas-
Kanade Sparse Optical Flow algorithm using OpenCV. Optical flow is a critical technique
in computer vision that allows us to estimate the motion of objects between consecutive
frames in a video sequence. This technique finds application in various domains such as
object tracking, video stabilization, and motion analysis.
Quiz:
1. Compare Sparse vs. Dense Optical Flow
◈ The choice between sparse and dense optical flow depends on the specific requirements of
the computer vision task. Sparse optical flow is suitable when computational efficiency
and tracking specific features are essential, while dense optical flow is preferred for tasks
that require a detailed analysis of motion across the entire frame, even though it comes at
the cost of higher computational requirements.

Suggested Reference:

1.The computation of optical flow by S. S. Beauchemin and J. L. Barron. ACM digital Library.
2. https://nanonets.com/blog/optical-flow/#what-is-optical-flow

References used by the students:

https://www.tutorialspoint.com/opencv_python/opencv_python_meanshift_camshift.htm

Page | 39
Computer Vision (3171614)

Rubric wise marks obtained:

Criteria 1 2 3 4 5 Total
Marks

Page | 40
Computer Vision (3171614)

Experiment No: 8

Demonstrate the use of optical flow in any image processing application.

Date:

Competency and Practical Skills:

This practical is expected to develop following skills in you

1. Use of optical flow in any image processing application

Relevant CO: CO4

Objectives:

1. Applying optical flow in image processing

Equipment/Instruments: Computer systems equipped with python/ matlab/ open-CV

Theory:

Optical flow has many applications in areas like:

1. Structure from Motion

2. Video Compression
3. Video Stabilization

We can create a simple application which tracks some points in a video. To decide the points, we
use cv.goodFeaturesToTrack(). We take the first frame, detect some Shi-Tomasi corner points in
it, then we iteratively track those points using Lucas-Kanade optical flow. For the function
cv.calcOpticalFlowPyrLK() we pass the previous frame, previous points and next frame. It
returns next points along with some status numbers which has a value of 1 if next point is found,
else zero. We iteratively pass these next points as previous points in next step.

Page | 41
Computer Vision (3171614)

Figure 8.1: Demonstration output

Procedure with OpenCV:

#include <iostream>
#include <opencv2/core.hpp>
#include <opencv2/highgui.hpp>
#include <opencv2/imgproc.hpp>
#include <opencv2/videoio.hpp>
#include <opencv2/video.hpp>
using namespace cv;
using namespace std;
int main(int argc, char **argv)
{
const string about =
"This sample demonstrates Lucas-Kanade Optical Flow calculation.\n"
"The example file can be downloaded from:\n"
"
https://www.bogotobogo.com/python/OpenCV_Python/images/mean_shift_tracking/slow_traffic_
small.mp4";
const string keys =
"{ h help | | print this help message }"
"{ @image | vtest.avi | path to image file }";
CommandLineParserparser(argc, argv, keys);
parser.about(about);
if (parser.has("help"))
{
parser.printMessage();
Page | 42
Computer Vision (3171614)
return 0;
}
string filename = samples::findFile(parser.get<string>("@image"));
if (!parser.check())
{
parser.printErrors();
return 0;
}
VideoCapture capture(filename);
if (!capture.isOpened()){
//error in opening the video input
cerr<< "Unable to open file!" <<endl;
return 0;
}
// Create some random colors
vector<Scalar> colors;
RNG rng;
for(int i = 0; i< 100; i++)
{
int r = rng.uniform(0, 256);
int g = rng.uniform(0, 256);
int b = rng.uniform(0, 256);
colors.push_back(Scalar(r,g,b));
}
Mat old_frame, old_gray;
vector<Point2f> p0, p1;
// Take first frame and find corners in it
capture >>old_frame;
cvtColor(old_frame, old_gray, COLOR_BGR2GRAY);
goodFeaturesToTrack(old_gray, p0, 100, 0.3, 7, Mat(), 7, false, 0.04);
// Create a mask image for drawing purposes
Mat mask = Mat::zeros(old_frame.size(), old_frame.type());
while(true){
Mat frame, frame_gray;
capture >> frame;
if (frame.empty())
Page | 43
Computer Vision (3171614)
break;
cvtColor(frame, frame_gray, COLOR_BGR2GRAY);
// calculate optical flow
vector<uchar> status;
vector<float> err;
TermCriteria criteria = TermCriteria((TermCriteria::COUNT) + (TermCriteria::EPS), 10, 0.03);
calcOpticalFlowPyrLK(old_gray, frame_gray, p0, p1, status, err, Size(15,15), 2, criteria);
vector<Point2f>good_new;
for(uinti = 0; i< p0.size(); i++)
{
// Select good points
if(status[i] == 1) {
good_new.push_back(p1[i]);
// draw the tracks
line(mask,p1[i], p0[i], colors[i], 2);
circle(frame, p1[i], 5, colors[i], -1);
}
}
Mat img;
add(frame, mask, img);
imshow("Frame", img);
int keyboard = waitKey(30);
if (keyboard == 'q' || keyboard == 27)
break;
// Now update the previous frame and previous points
old_gray = frame_gray.clone();
p0 = good_new;
}
}

Page | 44
Computer Vision (3171614)
Program:
import cv2
import numpy as np

# Open a video capture object

cap = cv2.VideoCapture('./input_video.mp4')

# Create some random colors for drawing the optical flow tracks
colors = np.random.randint(0, 255, (100, 3))

# Read the first frame

ret, old_frame = cap.read()
if not ret:
print("Error reading the first frame")
exit()

old_gray = cv2.cvtColor(old_frame, cv2.COLOR_BGR2GRAY)

# Find Shi-Tomasi corner points in the first frame

p0 = cv2.goodFeaturesToTrack(old_gray, maxCorners=100, qualityLevel=0.3,
minDistance=7)

# Create an empty mask for drawing

mask = np.zeros_like(old_frame)

while True:
ret, frame = cap.read()
if not ret:
break

frame_gray = cv2.cvtColor(frame, cv2.COLOR_BGR2GRAY)

# Calculate optical flow using Lucas-Kanade

p1, status, err = cv2.calcOpticalFlowPyrLK(old_gray, frame_gray, p0, None,
winSize=(15, 15), maxLevel=2,
criteria=(cv2.TERM_CRITERIA_EPS |
cv2.TERM_CRITERIA_COUNT, 10, 0.03))

# Select good points and draw tracks

good_new = p1[status.ravel() == 1]
good_old = p0[status.ravel() == 1]

for i, (new, old) in enumerate(zip(good_new, good_old)):

a, b = new.ravel()
c, d = old.ravel()
mask = cv2.line(mask, (int(a), int(b)), (int(c), int(d)), colors[i].tolist(), 2) # Fixed the
line function
frame = cv2.circle(frame, (int(a), int(b)), 5, colors[i].tolist(), -1)

# Combine the frame and the mask to visualize the optical flow
img = cv2.add(frame, mask)
cv2.imshow('Optical Flow', img)

Page | 45
Computer Vision (3171614)
# Update the previous frame and points
old_gray = frame_gray.copy()
p0 = good_new.reshape(-1, 1, 2)

if cv2.waitKey(30) & 0xFF == 27:

break

cap.release()
cv2.destroyAllWindows()

Output:

Conclusion:
In this experiment, we successfully demonstrated the practical application of optical flow
in image processing using the Lucas-Kanade method with OpenCV. Optical flow is a
valuable technique in computer vision that allows us to analyze the motion of objects
within a video sequence or between consecutive image frames. We applied this technique
to track points in a video and visualize their motion, offering insights into the movement
patterns present in the video.
Quiz:
1. Is it necessary to detect corner points in particular intervals ?
◈ The decision to detect corner points at particular intervals or adaptively depends on the
specific demands of your image processing or computer vision application. You should
consider factors such as the nature of the scene, changes in the scene over time, and the
available computational resources when determining the best strategy for feature point
detection in optical flow and motion tracking.
Page | 46
Computer Vision (3171614)

Suggested Reference:

1.The computation of optical flow by S. S. Beauchemin and J. L. Barron. ACM digital Library.
2. https://nanonets.com/blog/optical-flow/#what-is-optical-flow
3. https://docs.opencv.org/3.4/d4/dee/tutorial_optical_flow.html

References used by the students:

https://www.tutorialspoint.com/opencv_python/opencv_python_meanshift_camshift.htm

Page | 47
Computer Vision (3171614)

Rubric wise marks obtained:

Criteria 1 2 3 4 5 Total
Marks

Page | 48
Computer Vision (3171614)

Experiment No: 9

Object detection and Recognition on available online image datasets

Date:

Competency and Practical Skills:

This practical is expected to develop following skills in you

1. Perform object detection

2. Perform object recognition

Relevant CO: CO5

Objectives:

1. Detect multiple objects from online dataset

Equipment/Instruments: Computer systems equipped with python/ matlab/ open-CV

Theory:

Object detection and object recognition are similar techniques for identifying objects, but they
vary in their execution. Object detection is the process of finding instances of objects in images.
In the case of deep learning, object detection is a subset of object recognition, where the object is
not only identified but also located in an image. This allows for multiple objects to be identified
and located within the same image. Object recognition is a key technology behind driverless cars,
enabling them to recognize a stop sign or to distinguish a pedestrian from a lamppost. It is also useful in a
variety of applications such as disease identification in bioimaging, industrial inspection, and robotic
vision.

Page | 49
Computer Vision (3171614)

Figure 9.1 Sample object detection and recognition

Procedure:

from imageai.Detection import ObjectDetection

detector = ObjectDetection()
model_path = "yolo-tiny.h5" input_path = "cars.jpg" output_path = "output_image.jpg"
detector.setModelTypeAsTinyYOLOv3()
detector.setModelPath(model_path)
detector.loadModel()
detection = detector.detectObjectsFromImage(input_image=input_path,
output_image_path=output_path)

Program:
from imageai.Detection import ObjectDetection

# Initialize the object detection model

detector = ObjectDetection()

# Define the paths to the model, input image, and output image
model_path = "yolo-tiny.h5"
input_path = "cars.jpg"
output_path = "output_image.jpg"

# Set the model type to Tiny YOLOv3

detector.setModelTypeAsTinyYOLOv3()

Page | 50
Computer Vision (3171614)
# Load the pre-trained model
detector.setModelPath(model_path)
detector.loadModel()

# Perform object detection on the input image and save the output image
detections = detector.detectObjectsFromImage(
input_image=input_path,
output_image_path=output_path
)

# Display the detected objects and their probabilities

for detection in detections:
print(f"{detection['name']} : {detection['percentage_probability']}")

print("Object detection and recognition completed.")

Conclusion:
In this experiment, we successfully performed object detection and recognition using a
pre-trained model (Tiny YOLOv3). We applied it to an online dataset image, detecting and
labeling multiple objects. This demonstrates the practicality of object detection and
recognition in various applications.
Quiz:
1. Differentiate between machine learning approach and deep learning approach for object
recognition
◈ The primary difference lies in the feature engineering and the ability to learn features
directly from data. Deep learning excels in tasks like object recognition, where the data is
high-dimensional and complex, but the models can be less interpretable compared to
traditional machine learning approaches.
Suggested Reference:

1. Computer Vision: Algorithms and Applications, R. Szeliski, Springer, 2011.

2. Introductory techniques for 3D computer vision, E. Trucco and A. Verri, Prentice Hall,
1998.
3. https://www.kaggle.com/getting-started/169984
4. https://www.geeksforgeeks.org/object-detection-vs-object-recognition-vs-image-
segmentation/
References used by the students:
https://scale.com/blog/best-10-public-datasets-object-detection

Page | 51
Computer Vision (3171614)

Rubric wise marks obtained:

Criteria 1 2 3 4 5 Total
Marks

Page | 52
Computer Vision (3171614)

Experiment No: 10

Character or digit or face classification project

Date:

Competency and Practical Skills:

This practical is expected to develop following skills in you

1. Problem solving

Relevant CO: CO5

Objectives:

1. Applying computer vision knowledge to solve real time problem

Equipment/Instruments: Computer systems equipped with python/ matlab/ open-CV

Theory:

Face detection, also called facial detection, is an artificial intelligence-based computer technology
used to find and identify human faces in digital images and video. Face detection technology is
often used for surveillance and tracking of people in real time. It is used in various fields
including security, biometrics, law enforcement, entertainment and social media.

To perform the face recognition function, face detection is first performed to determine the position of the
face in the picture. OpenCV performs functionalities for the same. It firstly extracts the feature images into
a large sample set by extracting the face Haar features in the image and then uses the AdaBoost algorithm
as the face detector. In face detection, the algorithm can effectively adapt to complex environments such as
insufficient illumination and background blur, which greatly improves the accuracy of detection. For a set
of training sets, different training sets are obtained for subsequent work by changing the distribution
probabilities of each of the samples, and each training set is trained to obtain a weak classifier, and then
these several classifiers are weighted.

Page | 53
Computer Vision (3171614)

Figure 10.1 Sample output of face recognition

Procedure:

Captureframe-by-frame from the input video

faces = faceCascade.detectMultiScale( gray,scaleFactor=1.1, minNeighbors=5,
minSize=(30, 30),flags=cv2.CASCADE_SCALE_IMAGE )
Draw a rectangle around the faces for (x, y, w, h) in faces:
cv2.rectangle(frames, (x, y), (x+w, y+h), (0, 255, 0), 2)
Display the resulting frame cv2.imshow('Video', frames)

Program:
import cv2

# Load the pre-trained Haar cascade for face detection

face_cascade = cv2.CascadeClassifier(cv2.data.haarcascades +
'haarcascade_frontalface_default.xml')

# Replace 'video_file_path.mp4' with the path to your video file

video_file_path = './input_video.mp4'

# Open the video source (use the provided video file path)
cap = cv2.VideoCapture(video_file_path)

while True:
ret, frame = cap.read()

if not ret:
break
Page | 54
Computer Vision (3171614)

# Convert the frame to grayscale for face detection

gray = cv2.cvtColor(frame, cv2.COLOR_BGR2GRAY)

# Detect faces in the frame

faces = face_cascade.detectMultiScale(gray, scaleFactor=1.1, minNeighbors=5,
minSize=(30, 30), flags=cv2.CASCADE_SCALE_IMAGE)

# Draw rectangles around detected faces

for (x, y, w, h) in faces:
cv2.rectangle(frame, (x, y), (x + w, y + h), (0, 255, 0), 2)

# Display the resulting frame

cv2.imshow('Video', frame)

if cv2.waitKey(1) & 0xFF == 27: # Press 'Esc' to exit

break

# Release the video capture object and close the OpenCV windows
cap.release()
cv2.destroyAllWindows()

Output:

Conclusion:
Overall, this experiment provides a fundamental understanding of face detection, which is
a crucial component in many computer vision applications, including face recognition,
surveillance, and security systems.

Page | 55
Computer Vision (3171614)
For more advanced applications, such as face recognition, deep learning models and
custom datasets would be required. However, this experiment serves as a starting point for
understanding the basic concepts of face detection in computer vision.
Quiz:
1. Compare approaches for face detection from images and videos
◈ Face detection in images and videos relies on similar techniques, like Haar cascades, but
videos require real-time processing. Videos present added challenges due to frame rate,
tracking, and performance optimization.

Suggested Reference:

1. https://www.hindawi.com/journals/js/2021/4796768/

References used by the students:

https://towardsdatascience.com/real-time-face-recognition-an-end-to-end-project-
b738bb0f7348

Page | 56
Computer Vision (3171614)

Rubric wise marks obtained:

Criteria 1 2 3 4 5 Total
Marks

Page | 57

Deep Learning For Vision Book 2
No ratings yet
Deep Learning For Vision Book 2
292 pages
Computer Vision Notes
100% (1)
Computer Vision Notes
97 pages
IT Workshop Lab Manual R23
100% (1)
IT Workshop Lab Manual R23
83 pages
Computer Vision Lab Manual
No ratings yet
Computer Vision Lab Manual
63 pages
Pseudocode Cheat Sheet Guide
No ratings yet
Pseudocode Cheat Sheet Guide
13 pages
NN Unit 1 Complete Notes
100% (1)
NN Unit 1 Complete Notes
154 pages
Digital Image Processing LAB MANUAL 6th Sem-Final
No ratings yet
Digital Image Processing LAB MANUAL 6th Sem-Final
20 pages
Computer Vision-Unit 4 Notes
100% (1)
Computer Vision-Unit 4 Notes
13 pages
Al3502deep Learning For Visionl T P C
No ratings yet
Al3502deep Learning For Visionl T P C
3 pages
Chapter 7. MOSFET Single Stage Amplifier - Lecture Notes-2
No ratings yet
Chapter 7. MOSFET Single Stage Amplifier - Lecture Notes-2
103 pages
Computer Vision 8th Sem Lab Manual
100% (1)
Computer Vision 8th Sem Lab Manual
29 pages
CCS338 Computer Vision Lecture Notes 1
No ratings yet
CCS338 Computer Vision Lecture Notes 1
99 pages
OCS351 - AI ML Fundamentals Syllabus
No ratings yet
OCS351 - AI ML Fundamentals Syllabus
2 pages
Computer Vision-Unit 1 Notes
100% (1)
Computer Vision-Unit 1 Notes
21 pages
Big Data Analysis Lab Manual
No ratings yet
Big Data Analysis Lab Manual
39 pages
Deep Learning Exam Guide
No ratings yet
Deep Learning Exam Guide
3 pages
cs3362 Foundations of Data Science Lab Manual
67% (9)
cs3362 Foundations of Data Science Lab Manual
53 pages
Computer Vision-Unit 3 Notes
No ratings yet
Computer Vision-Unit 3 Notes
26 pages
c7 PDF
No ratings yet
c7 PDF
34 pages
Unit 4
100% (3)
Unit 4
57 pages
Os Lab Manual
No ratings yet
Os Lab Manual
107 pages
NNDL Technical Publication Notes
No ratings yet
NNDL Technical Publication Notes
81 pages
DL Lab Manual 2022-23
No ratings yet
DL Lab Manual 2022-23
34 pages
EDA Lab Manual for Students
No ratings yet
EDA Lab Manual for Students
41 pages
Computer Vision Question Bank
100% (5)
Computer Vision Question Bank
2 pages
NLP Lab Manual 3-2 Aiml R22 Update
100% (2)
NLP Lab Manual 3-2 Aiml R22 Update
20 pages
Power System Reactance Diagram Questions PDF
No ratings yet
Power System Reactance Diagram Questions PDF
22 pages
MTE 2223 08 Mar 2025
No ratings yet
MTE 2223 08 Mar 2025
8 pages
AIDS Syllabus 2021 L
No ratings yet
AIDS Syllabus 2021 L
87 pages
AD3491 - Unit 1 - Introduction To Data Science Important Questions 2 Marks With Answer - 3-8
No ratings yet
AD3491 - Unit 1 - Introduction To Data Science Important Questions 2 Marks With Answer - 3-8
6 pages
CCS341-Data Warehousing Lab Manual (2021)
100% (1)
CCS341-Data Warehousing Lab Manual (2021)
50 pages
Ccs355 Neural Networks and Deep Learning Unit1
No ratings yet
Ccs355 Neural Networks and Deep Learning Unit1
29 pages
Computer Vision-Unit 2 Notes
No ratings yet
Computer Vision-Unit 2 Notes
15 pages
Lecture Ch4 Performance
No ratings yet
Lecture Ch4 Performance
25 pages
CS3361 Data Science Lab Manual (II CYS)
100% (1)
CS3361 Data Science Lab Manual (II CYS)
40 pages
ADVANCED DIGITAL SIGNAL PROCESSING Lab Manual
100% (1)
ADVANCED DIGITAL SIGNAL PROCESSING Lab Manual
44 pages
Computer Science: Basic Computer Organisation: Description of A Computer System
No ratings yet
Computer Science: Basic Computer Organisation: Description of A Computer System
5 pages
Data Analytics Unit-I
No ratings yet
Data Analytics Unit-I
25 pages
Designing Machine Learning Systems
No ratings yet
Designing Machine Learning Systems
12 pages
AD3501-DL-Unit 1 Notes
No ratings yet
AD3501-DL-Unit 1 Notes
43 pages
On The Sidewalk Bleeding Essay
100% (2)
On The Sidewalk Bleeding Essay
8 pages
Ann Lab Manual 1
No ratings yet
Ann Lab Manual 1
50 pages
Cs3491 - Aiml - Unit III - Introduction To Machine Learning1
100% (2)
Cs3491 - Aiml - Unit III - Introduction To Machine Learning1
23 pages
AD3501 - Deep Learning University Question
No ratings yet
AD3501 - Deep Learning University Question
2 pages
Degradation Function Estimation
0% (1)
Degradation Function Estimation
9 pages
Revision Questions
No ratings yet
Revision Questions
2 pages
ESIOT LAB Mannual - Cse
100% (1)
ESIOT LAB Mannual - Cse
59 pages
Deep Learning R18 Jntuh Lab Manual
0% (1)
Deep Learning R18 Jntuh Lab Manual
21 pages
Dell EMC MD1400 and MD1420: Cost-Effective Storage
No ratings yet
Dell EMC MD1400 and MD1420: Cost-Effective Storage
3 pages
Cs3691-Important Two Marks
No ratings yet
Cs3691-Important Two Marks
22 pages
CS6612 Compiler Lab Manual
100% (4)
CS6612 Compiler Lab Manual
60 pages
Machine Learning Study Guide
No ratings yet
Machine Learning Study Guide
8 pages
Macros For Mine Planning Engineer
No ratings yet
Macros For Mine Planning Engineer
8 pages
V30Plus GNSS RTK Brochure EN 20220608 S
100% (1)
V30Plus GNSS RTK Brochure EN 20220608 S
2 pages
Soft Computing Lab Record
100% (1)
Soft Computing Lab Record
35 pages
CCS 352 Multimedia and Animation Question Bank Unitwise
No ratings yet
CCS 352 Multimedia and Animation Question Bank Unitwise
27 pages
Final Copy Cp4291-Iot Lab Manual
No ratings yet
Final Copy Cp4291-Iot Lab Manual
49 pages
Manual de Servicio BW90AD2
No ratings yet
Manual de Servicio BW90AD2
99 pages
CV Jahanzaib 02
No ratings yet
CV Jahanzaib 02
5 pages
Consent Form Version 6
No ratings yet
Consent Form Version 6
2 pages
7 - 5250 - 01880 - 01E Nachrüstsatz
No ratings yet
7 - 5250 - 01880 - 01E Nachrüstsatz
11 pages
Digital Image Processing Exam 2025
No ratings yet
Digital Image Processing Exam 2025
2 pages
Me cp4212 Software Engineering Manual
No ratings yet
Me cp4212 Software Engineering Manual
34 pages
NM Plus Hydrogen Generator: Carrier Grade
No ratings yet
NM Plus Hydrogen Generator: Carrier Grade
4 pages
AD3501 Deep Learning Syllabus
100% (1)
AD3501 Deep Learning Syllabus
1 page
Engineering Resources Hub
100% (3)
Engineering Resources Hub
3 pages
2023 11 20 Edip Selections
No ratings yet
2023 11 20 Edip Selections
123 pages
ccs355 Lab Manual
No ratings yet
ccs355 Lab Manual
24 pages
ccs355 Syllabus NNDL
100% (1)
ccs355 Syllabus NNDL
3 pages
Integration of Sensors and Actuators With Arduino-: Dr. Sudip Misra
No ratings yet
Integration of Sensors and Actuators With Arduino-: Dr. Sudip Misra
14 pages
The Top 10 High-Demand Jobs With Attractive Salaries
No ratings yet
The Top 10 High-Demand Jobs With Attractive Salaries
54 pages
CS3491 Artificial Intelligence and Machine Learning Nov Dec 2023 Question Paper Download
100% (2)
CS3491 Artificial Intelligence and Machine Learning Nov Dec 2023 Question Paper Download
2 pages
Format Kti Internasional
No ratings yet
Format Kti Internasional
3 pages
Image and Video Analytics
No ratings yet
Image and Video Analytics
3 pages
LED Driver IC for Lighting Systems
No ratings yet
LED Driver IC for Lighting Systems
13 pages
AIML UNIT I Notes
No ratings yet
AIML UNIT I Notes
68 pages
Lead - Security Operations and Monitoring JD
No ratings yet
Lead - Security Operations and Monitoring JD
2 pages
Data Science-Lab Manual
100% (1)
Data Science-Lab Manual
15 pages
Presentation Eurl Afaq
No ratings yet
Presentation Eurl Afaq
9 pages
I Am Neha Jain, Ph.D. Research Scholar of JJT University Jhunjhunu, Doing A Research On "To Study
No ratings yet
I Am Neha Jain, Ph.D. Research Scholar of JJT University Jhunjhunu, Doing A Research On "To Study
5 pages
JUnit 5 - IntelliJ IDEA Documentation
No ratings yet
JUnit 5 - IntelliJ IDEA Documentation
6 pages
Computer Vision
No ratings yet
Computer Vision
5 pages
Importance of Analytical Sandbox
No ratings yet
Importance of Analytical Sandbox
30 pages
Question Bank Ann
50% (2)
Question Bank Ann
2 pages
Mini Projects 1-3-Satyaki Mitra
No ratings yet
Mini Projects 1-3-Satyaki Mitra
33 pages
Olp 34 35 38 Optical Power Meter Manual User Guide en
No ratings yet
Olp 34 35 38 Optical Power Meter Manual User Guide en
36 pages
Ai PPT Presentation:: State Space Search: Water Jug Problem
No ratings yet
Ai PPT Presentation:: State Space Search: Water Jug Problem
8 pages
CS3362 C Programming and Data Structures Laboratory
No ratings yet
CS3362 C Programming and Data Structures Laboratory
1 page

Computer Vision Lab Manual 2023

Uploaded by

Computer Vision Lab Manual 2023

Uploaded by

A Laboratory Manual for

Directorate of Technical Education, Gandhinagar, Gujarat

This is to certify that Mr./Ms. ___________________________________ ________ Enrollment

Name and Sign of Faculty member

Head ofthe Department

Practical – Course Outcome matrix

Course Outcomes (COs):

Implement the various low pass and high pass filtering

Use of Fourier transform for filtering the image.

Utilization of SIFT and HOG features for image

Performing/Implementing image segmentation

Implement optical flow computation algorithm.

Demonstrate the use of optical flow in any image

Object detection and Recognition on available online

Character or digit or face classification project

Industry Relevant Skills

Guidelines for Faculty members

Instructions for Students

Common Safety Instructions

Sr. Objective(s) of Experiment Pag Date Date Assess Sign. of Rema

Implementing various basic image processing operations in python/ MATLAB/ open-CV:

Competency and Practical Skills:

This practical is expected to develop following skills in you

Relevant CO: CO1

1. To understand basic image processing operations.

Equipment/Instruments: Computer systems equipped with python/matlab/open-CV

Image Conversion:Following are the types of digital images:

(a) Binary Image (b) Greyscale Image (c) RGB Image

(a) Original and Complement of an (b) Original and Complement of a Color

Safety and necessary Precautions:

1. Do not alter the installed libraries of python/matlab/open-CV

# Read an image from file

# Check if the image was loaded successfully

# Read an image from file

# Save the image to a file

# Read an image from file

# Convert the image to grayscale

# Save the grayscale image to a file

# Read an image from file

# Find the complement of the image

# Save the complemented image to a file

References used by the students:

Rubric wise marks obtained:

Program (Excellent)(4) (Good)(3) (Fair)(2) (Beginning)(1)

Implement contrast adjustment of an image. Implement Histogram processing and equalization

Competency and Practical Skills:

This practical is expected to develop following skills in you

1. Image Enhancement by adjusting image contrast

Relevant CO: CO2

1. Image enhancement using contrast adjustment and histogram equalization

Equipment/Instruments: Computer systems equipped with python/ matlab/ open-CV

s is the resultant value of the intensity.

Histogram Equalization:It is an automatic enhancement technique which produces an output

(a) Input image before histogram equalization along with its

(b) Input image after histogram equalization along with its

# Function to perform contrast adjustment

# Function to perform histogram equalization

# Read the input image

# Define contrast adjustment parameters (alpha and beta)

# Perform contrast adjustment

# Perform histogram equalization

# Display and save the original, adjusted, and equalized images

# Plot histograms for original and equalized images

# Original Image Histogram

# Contrast Adjusted Image Histogram

Contrast Adjust Image Histogram

Equalized Image Histogram

1.Digital Image Processingby S. Sridhar. Oxford Press.

References used by the students:

Competency and Practical Skills:

This practical is expected to develop following skills in you

1. Image filtering using low pass filters

Relevant CO: CO3

Equipment/Instruments: Computer systems equipped with python/ matlab/ open-CV

High Pass Filters:

# Load the input image in color (RGB)

This is to certify that Mr./Ms. ___________________________ Enrollment