Stages of Computer Vision Process

The document outlines the five stages of the computer vision process: image acquisition, preprocessing, feature extraction, detection/segmentation, and high-level processing. Each stage is crucial for enhancing image quality, identifying relevant features, and interpreting visual data for applications such as autonomous driving and medical imaging. The document details techniques and algorithms used at each stage to improve the effectiveness of computer vision systems.

Uploaded by

Pratham Bhatia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

56 views4 pages

Stages of Computer Vision Process

Uploaded by

Pratham Bhatia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Unit 3: Making Machines See

COMPUTER VISION – PROCESS:

The Computer Vision process often involves five stages.
1. Image Acquisition:
 Image acquisition is the initial stage in the process of computer vision, involving the capture of
digital images or videos.
 Provides the raw data upon which subsequent analysis is based.
 Digital images can be acquired through digital cameras, scanning physical photographs or
documents, or even generating them using design software.
 The quality and characteristics of the acquired images greatly influence the effectiveness of
subsequent processing and analysis.
 Resolutions of different imaging devices play a significant role in determining the quality of
acquired images. Higher-resolution devices can capture finer details and produce clearer
images compared to those with lower resolutions.
 lighting conditions and angles can influence the effectiveness of image acquisition techniques.

In scientific and medical fields, specialized imaging techniques like MRI (Magnetic Resonance
Imaging) or CT (Computed Tomography) scans are employed to acquire highly detailed images of
biological tissues or structures.

2. Preprocessing:
Preprocessing in computer vision aims to enhance the quality of the acquired image. Some of the
common techniques are-
a. Noise Reduction: Removes unwanted elements like blurriness, random spots, or distortions.
This
makes the image clearer and reduces distractions for algorithms.
Example: Removing grainy effects in low-light photos.
b. Image Normalization: Standardizes pixel values across images for consistency. Adjusts the
pixel
values of an image so they fall within a consistent range (e.g., 0–1 or -1 to 1).
Ensures all images in a dataset have a similar scale, helping the model learn better.
Example: Scaling down pixel values from 0–255 to 0–1.
c. Resizing/Cropping: Changes the size or aspect ratio of the image to make it uniform.
Ensures all images have the same dimensions for analysis.
Example: Resizing all images to 224×224 pixels before feeding them into a neural network.
d. Histogram Equalization: Adjusts the brightness and contrast of an image. Spreads out the
pixel intensity values evenly, enhancing details in dark or bright areas.
Example: Making a low-contrast image look sharper and more detailed.

The main goal for preprocessing is to prepare images for computer vision tasks by:
Removing noise (disturbances).
Highlighting important features.
Ensuring consistency and uniformity across the dataset.
3. Feature Extraction:
Feature extraction involves identifying and extracting relevant visual patterns or attributes from the
pre-processed image.
(i). Edge detection identifies the boundaries between different regions in an image where there
is a significant change in intensity.
(ii). Corner detection identifies points where two or more edges meet. These points are areas
of high curvature in an image, focused on identifying sharp changes in image gradients, which
often correspond to corners or junctions in objects.
(iii). Texture analysis extracts features like smoothness, roughness, or repetition in an image.
(iv). Colour-based feature extraction quantifies colour distributions within the image, enabling
discrimination between different objects or regions based on their colour characteristics.

In deep learning-based approaches, feature extraction is often performed automatically by

convolutional neural networks (CNNs) during the training process.
4. Detection/Segmentation:
Detection and segmentation are fundamental tasks in computer vision, focusing on identifying
objects or regions of interest within an image. These tasks play a pivotal role in applications like
autonomous driving, medical imaging, and object tracking. This crucial stage is categorized into two
primary tasks:
1. Single Object Tasks
2. Multiple Object Tasks
Single Object Tasks: Single object tasks focus on analysing/ or delineate individual objects within an
image, with two main objectives:

i. Classification: This task involves determining the category or class to which a single object
belongs, providing insights into its identity or nature. KNN (K-Nearest Neighbour) algorithm
may be used for supervised classification while K-means clustering algorithm can be used for
unsupervised classification.
ii. Classification + Localization: In addition to classifying objects, this task also involves
precisely localizing the object within the image by predicting bounding boxes that tightly
enclose it.
Multiple Object Tasks: Multiple object tasks deal with scenarios where an image contains multiple
instances of objects or different object classes. These tasks aim to identify and distinguish between
various objects within the image, and they include:

i. Object Detection:
 Object detection focuses on identifying and locating multiple objects of interest within
the image.
 It involves analysing the entire image and drawing bounding boxes around detected
objects, along with assigning class labels to these boxes.
 The main difference between classification and detection is that classification
considers the image as a whole and determines its class whereas detection identifies
the different objects in the image and classifies all of them.
 In detection, bounding boxes are drawn around multiple objects and these are labelled
according to their particular class.
 Object detection algorithms typically use extracted features and learning algorithms to
recognize instances of an object category.
 Some of the algorithms used for object detection are: R-CNN (Region-Based
Convolutional Neural Network), R-FCN (Region-based Fully Convolutional Network),
YOLO (You Only Look Once) and SSD (Single Shot Detector).

Object Detection

ii. Image segmentation:

 It creates a mask around similar characteristic pixels and identifies their class in the
given input image.

 Image segmentation helps to gain a better understanding of the image at a granular

level.

 Pixels are assigned a class and for each object, a pixel-wise mask is created in the
image.

 This helps to easily identify each object separately from the other.

 Two of the popular segmentations are:

a. Semantic Segmentation: It classifies pixels belonging to a particular class. Objects
belonging to the same class are not differentiated. In this image for example the pixels are
identified under class animals but do not identify the type of animal.

b. Instance Segmentation: It classifies pixels belonging to a particular instance. All the objects
in the image are differentiated even if they belong to the same class. In this image for example
the pixels are separately masked even though they belong to the same class.

5. High-Level Processing:

 In the final stage of computer vision, high-level processing plays a crucial role in interpreting
and extracting meaningful information from the detected objects or regions within digital
images.
 This advanced processing enables computers to achieve a deeper understanding of visual
content and make informed decisions based on the visual data.
 Tasks involved in high-level processing include recognizing objects, understanding scenes,
and analysing the context of the visual content.
 Ultimately, high-level processing empowers computer vision systems to extract valuable
insights and drive intelligent decision-making in various applications, ranging from
autonomous driving to medical diagnostics.

Computer Vision Overview for Class 12
No ratings yet
Computer Vision Overview for Class 12
6 pages
Understanding Computer Vision Techniques
No ratings yet
Understanding Computer Vision Techniques
9 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
33 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
7 pages
Class 12 AI: Computer Vision Notes
No ratings yet
Class 12 AI: Computer Vision Notes
10 pages
Computer Vision for Class 12 AI
No ratings yet
Computer Vision for Class 12 AI
8 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
32 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
5 pages
Understanding Computer Vision in AI
No ratings yet
Understanding Computer Vision in AI
9 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
21 pages
Computer Vision Fundamentals for Class 12
No ratings yet
Computer Vision Fundamentals for Class 12
3 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
50 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
45 pages
Computer Vision: Key Concepts & Applications
No ratings yet
Computer Vision: Key Concepts & Applications
5 pages
Image Indexing and Computer Vision Explained
No ratings yet
Image Indexing and Computer Vision Explained
7 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
7 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
2 pages
Computer Vision: AI Machine Seeing Guide
No ratings yet
Computer Vision: AI Machine Seeing Guide
3 pages
Computer Vision Basics for Class 12 AI
No ratings yet
Computer Vision Basics for Class 12 AI
28 pages
PEC CSM602A Module-2
No ratings yet
PEC CSM602A Module-2
31 pages
Computer Vision: Principles & Applications
No ratings yet
Computer Vision: Principles & Applications
7 pages
Understanding Computer Vision in AI
No ratings yet
Understanding Computer Vision in AI
18 pages
Making Machines See
No ratings yet
Making Machines See
29 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
32 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
39 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
20 pages
Advanced Computer Vision - Notes - Topic5
No ratings yet
Advanced Computer Vision - Notes - Topic5
11 pages
Computer Vision Course Overview
No ratings yet
Computer Vision Course Overview
61 pages
Computer Vision Fundamentals and Techniques
No ratings yet
Computer Vision Fundamentals and Techniques
99 pages
Image Segmentation and Object Detection Techniques
No ratings yet
Image Segmentation and Object Detection Techniques
12 pages
CVPR Unit1 Unit2 Very Detailed Notes
No ratings yet
CVPR Unit1 Unit2 Very Detailed Notes
6 pages
Computer Vision Overview for Professionals
No ratings yet
Computer Vision Overview for Professionals
20 pages
Computer Vision Techniques Overview
No ratings yet
Computer Vision Techniques Overview
29 pages
Overview of Computer Vision Techniques
No ratings yet
Overview of Computer Vision Techniques
5 pages
Unit 5 CV
No ratings yet
Unit 5 CV
4 pages
Making Machines See Notes
No ratings yet
Making Machines See Notes
6 pages
Making Machines See: Computer Vision Basics
No ratings yet
Making Machines See: Computer Vision Basics
6 pages
Introduction to Computer Vision Basics
No ratings yet
Introduction to Computer Vision Basics
38 pages
Computer Vision Group Work
No ratings yet
Computer Vision Group Work
21 pages
Computer Vision Applications & Techniques
No ratings yet
Computer Vision Applications & Techniques
6 pages
Computer Vision Techniques and Applications
No ratings yet
Computer Vision Techniques and Applications
14 pages
CV - Unit-1 Notes by PBJ
No ratings yet
CV - Unit-1 Notes by PBJ
15 pages
Computer Vision Pattern Recognition Unit1 Unit2 Notes
No ratings yet
Computer Vision Pattern Recognition Unit1 Unit2 Notes
4 pages
Introduction to Computer Vision Basics
No ratings yet
Introduction to Computer Vision Basics
20 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
33 pages
AI & ML: Computer Vision Course Overview
No ratings yet
AI & ML: Computer Vision Course Overview
35 pages
Computer Vision Overview and Techniques
No ratings yet
Computer Vision Overview and Techniques
6 pages
Applications of Computer Vision Explained
No ratings yet
Applications of Computer Vision Explained
5 pages
CO1
No ratings yet
CO1
22 pages
Computer Vision Applications and Techniques
No ratings yet
Computer Vision Applications and Techniques
3 pages
CV - Unit-1 Notes by PBJ
No ratings yet
CV - Unit-1 Notes by PBJ
15 pages
Overview of Computer Vision Research
No ratings yet
Overview of Computer Vision Research
3 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
9 pages
UNIT I Computer Vision
No ratings yet
UNIT I Computer Vision
41 pages
Computer Vision Techniques and Uses
No ratings yet
Computer Vision Techniques and Uses
3 pages
Computer Vision Techniques Explained
No ratings yet
Computer Vision Techniques Explained
15 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
8 pages
Computer Vision Fundamentals Guide
No ratings yet
Computer Vision Fundamentals Guide
39 pages
Strategic Outsourcing: Benefits & Risks
No ratings yet
Strategic Outsourcing: Benefits & Risks
15 pages
Itil V3 - 1
No ratings yet
Itil V3 - 1
90 pages
Primary Key in Student Records
No ratings yet
Primary Key in Student Records
20 pages
Math 55: Relation Properties Explained
No ratings yet
Math 55: Relation Properties Explained
2 pages
Dell Case Study
No ratings yet
Dell Case Study
4 pages
Game-Based Learning for High School Programming
No ratings yet
Game-Based Learning for High School Programming
29 pages
CNN-Based Handwriting Recognition System
No ratings yet
CNN-Based Handwriting Recognition System
38 pages
AI-Powered Intranet Search Proposal
No ratings yet
AI-Powered Intranet Search Proposal
14 pages
Matrix Cryptography Techniques
No ratings yet
Matrix Cryptography Techniques
4 pages
Fishbone Diagrams in Risk Management
No ratings yet
Fishbone Diagrams in Risk Management
6 pages
Week 2 Assignment Solutions (1)
No ratings yet
Week 2 Assignment Solutions (1)
7 pages
BlueScreen Error Analysis Report
No ratings yet
BlueScreen Error Analysis Report
9 pages
8510 Center Rider Pallet Product Guide
No ratings yet
8510 Center Rider Pallet Product Guide
2 pages
Yale Commonapp Submission
No ratings yet
Yale Commonapp Submission
15 pages
System and Network Administration Syllabus
No ratings yet
System and Network Administration Syllabus
3 pages
Benq mp510 Level1
100% (1)
Benq mp510 Level1
212 pages
Food Waste Management System Overview
No ratings yet
Food Waste Management System Overview
5 pages
Fun Accounting Quizzes & Games
No ratings yet
Fun Accounting Quizzes & Games
3 pages
HP Laptop Customer Perception Survey
No ratings yet
HP Laptop Customer Perception Survey
3 pages
MATH4545 Lesson Plan 8402
No ratings yet
MATH4545 Lesson Plan 8402
16 pages
Sunny Kumar: Software Engineer Profile
No ratings yet
Sunny Kumar: Software Engineer Profile
3 pages
CIS 300 Crimp Inspection System Overview
No ratings yet
CIS 300 Crimp Inspection System Overview
2 pages
Abhishek Yadav's CV: Store Executive
No ratings yet
Abhishek Yadav's CV: Store Executive
3 pages
Hikvision DS-2CD2047G2-LU/SL Overview
No ratings yet
Hikvision DS-2CD2047G2-LU/SL Overview
7 pages
Teacher Device Usage Guidelines
No ratings yet
Teacher Device Usage Guidelines
8 pages
FYBSC Computer Science Exam Hall Ticket
No ratings yet
FYBSC Computer Science Exam Hall Ticket
2 pages
Replacing Compact Flash on 7x50
No ratings yet
Replacing Compact Flash on 7x50
11 pages
Cyber Ethics and Digital Safety Guide
No ratings yet
Cyber Ethics and Digital Safety Guide
7 pages
Troubleshooting Cisco FTD Clustering
No ratings yet
Troubleshooting Cisco FTD Clustering
102 pages
Business Intelligence Analytics Overview
No ratings yet
Business Intelligence Analytics Overview
11 pages

Stages of Computer Vision Process

Uploaded by

Stages of Computer Vision Process

Uploaded by

Unit 3: Making Machines See

COMPUTER VISION – PROCESS:

In deep learning-based approaches, feature extraction is often performed automatically by

ii. Image segmentation:

 Image segmentation helps to gain a better understanding of the image at a granular

 Two of the popular segmentations are:

You might also like