0% found this document useful (0 votes)

77 views59 pages

Understanding Data Classification Basics

Classification is the process of categorizing data into classes, often using supervised learning techniques to predict the class of new data points. It includes various types such as binary, multi-class, and multi-label classification, with algorithms like logistic regression and support vector machines being commonly used. Evaluation metrics such as confusion matrices and F1 scores are essential for assessing classifier performance.

Uploaded by

Sonali S. Patil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

77 views59 pages

Understanding Data Classification Basics

Uploaded by

Sonali S. Patil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Unit 2 : Classification

What is Classification?
• Classification is a process of categorizing a given
set of data into classes, It can be performed on
both structured or unstructured data.
• The process starts with predicting the class of
given data points. The classes are often referred
to as target, label or categories.
• The classification predictive modeling is the task
of approximating the mapping function from
input variables to discrete output variables.
• The main goal is to identify which class/category
the new data will fall into.
Example:
Example:
• Heart disease detection can be identified as a
classification problem, this is a binary
classification since there can be only two classes
i.e has heart disease or does not have heart
• disease.
The classifier, in this case, needs training data to
understand how the given input variables are
related to the class. And once the classifier is
trained accurately, it can be used to detect
• whether heart disease is there or not for a
particular patient.
Since classification is a type of supervised learning,
even the targets are also provided with the input
Basic Terminologies used
• Classifier – It is an algorithm that is used to
map the input data to a specific category.
• Classification Model – The model predicts or
draws a conclusion to the input data given for
training, it will predict the class or category
for the data.
• Feature – A feature is an individual
measurable property of the phenomenon
being observed.
• Label- Output variable
Example of Linear Classification

• Red points: patterns

belonging to class C1.
• Blue points: patterns
belonging to class C2.
• Goal: find a linear decision
boundary separating C1
from C2.
• Points on one side of the line will be classified as belonging to
C1, points on the other side will be classified as C2.
• The red line is one example of such a decision boundary.
• It misclassified a few patterns.
• The green line is another example.

6
Linear Classification
• Mathematically, assuming
input patterns are
D-dimensional vectors:
• We are looking for a decision
boundary in the form of a
(D-1)-dimensional hyperplane
separating the two classes.
• Points on one side of the
hyperplane will be classified
as belonging to C1, points on the
other side will be classified as C2.
• If inputs are 2-dimensional vectors, the decision boundary
is a line.
• If inputs are 3-dimensional vectors, the decision boundary
is a 2-dimensional surface. 7
Types of Learners
• Lazy Learners –
– Lazy learners simply store the training
data and wait until a testing data
appears.
– The classification is done using the
most related data in the stored
training data.
– They have more predicting time compared
to eager learners. Eg – k-nearest neighbor,
case- based reasoning.
Types of Learners
• Eager Learners –
– Eager learners construct a
classification model based on the
given training data before getting
data for predictions.
– It must be able to commit to a single
hypothesis that will work for the entire
space.
– Due to this, they take a lot of time in
training and less time for a prediction. Eg –
Decision Tree, Naive Bayes, Artificial Neural
Networks.
Types of Classification
• Binary Classification
• Multi-Class Classification
• Multi-Label Classification
• Imbalanced
Classification
Types of Classification
• Linear Models
– Logistic Regression
– Support Vector Machines
• Nonlinear models
– K-nearest Neighbors (KNN)
– Kernel Support Vector Machines
(SVM)
– Naïve Bayes
– Decision Tree Classification
– Random Forest Classification
Binary Classification
• Binary classification refers to those
classification tasks that have two class
labels.
• Examples include:
– Email spam detection (spam or not)
– Churn prediction (churn or not).
– Conversion prediction (buy or not).
• Typically, binary classification tasks involve
one class that is the normal state and
another class that is the abnormal state.
Binary Classification – Example
• For example “not spam” is the normal state and
“spam” is the abnormal state. Another example is
“cancer not detected” is the normal state of a task
that involves a medical test and “cancer detected”
is the abnormal state.
• The class for the normal state is assigned the
class label 0 and the class with the abnormal
state is assigned the class label 1.
• It is common to model a binary classification
task with a model that predicts a Bernoulli
probability distribution for each example.
Binary Classification – Algorithms
• Popular algorithms that can be used for
binary classification include:
– Logistic Regression
– k-Nearest Neighbors
– Decision Trees
– Support Vector Machine
– Naive Bayes
Evaluation of Binary Classifier
• There are many metrics that can be used to measure
the performance of a classifier or predictor; different
fields have different preferences for specific metrics
due to different goals.
• In medicine sensitivity and specificity are often
used, while in information retrieval precision and
recall are preferred.
• An important distinction is between metrics that are
independent of how often each category occurs in the
population (the prevalence), and metrics that depend
on the prevalence – both types are useful, but they
have very different properties.
Evaluation of Binary Classifier
• Given a classification of a specific data set, there
are four basic combinations of actual data
category and assigned category: true positives TP
(correct positive assignments), true negatives TN
(correct negative assignments), false positives FP
(incorrect positive assignments), and false
negatives FN (incorrect negative assignments).
Confusion Matrix
• In the field of machine learning and specifically the
problem of statistical classification, a confusion matrix,
also known as an error matrix, is a specific table layout
that allows visualization of the performance of an
algorithm, typically a supervised learning one (in
unsupervised learning it is usually called a matching
• matrix).
Each row of the matrix represents the instances in
a predicted class, while each column represents
• the instances in an actual class (or vice versa).
The name stems from the fact that it makes it easy to see
whether the system is confusing two classes (i.e.
commonly mislabeling one as another).
True Positive (TP)
•The predicted value matches the actual value
•The actual value was positive and the model predicted a positive value
True Negative (TN)
•The predicted value matches the actual value
•The actual value was negative and the model predicted a negative value
False Positive (FP) – Type 1 error
•The predicted value was falsely predicted
•The actual value was negative but the model predicted a positive value
•Also known as the Type 1 error
False Negative (FN) – Type 2 error
•The predicted value was falsely predicted
•The actual value was positive but the model predicted a negative value
•Also known as the Type 2 error
Confusion Matrix
• Given a sample of 13 pictures, 8 of cats and 5 of dogs,
where cats belong to class 1 and dogs belong to class 0,
•– actual = [1,1,1,1,1,1,1,1,0,0,0,0,0],
• • assume that a classifier that distinguishes between cats and
dogs is trained, and we take the 13 pictures and run them
through the classifier, and the classifier makes 8 accurate
predictions and misses 5: 3 cats wrongly predicted as dogs (fi rst
3 predictions) and 2 dogs wrongly predicted as cats (last 2
predictions).

•– prediction = [0,0,0,1,1,1,1,1,0,0,0,1,1]
Confusion Matrix

F1 Score / Harmonic Mean

Sklearn has two
functions: confusion_matrix() and classification_report().
•Sklearn confusion_matrix() returns the values of the Confusion
matrix.
•Sklearn classification_report() outputs precision, recall and f1-
score for each target class. In addition to this, it also has some
extra values: micro avg, macro avg, and weighted avg
Confusion Matrix for Multi-Class Classification
Confusion Matrix for Multi-Class Classification
The true positive, true negative, false positive and false negative for
each class would be calculated by adding the cell values as follows:
Confusion Matrix for Multi-Class Classification
How to calculate FN, FP, TN, TP for multi
class :
Confusion Matrix for Multi-Class Classification
class Setosa
TP: The actual value and predicted value should be the same. So concerning S
class, the value of cell 1 is the TP value.
FN: The sum of values of corresponding rows except the TP value
FN = (cell 2 + cell3)
= (0 + 0)
=0
FP : The sum of values of corresponding column except the TP value.
FP = (cell 4 + cell 7)
= (0 + 0)
=0
TN: The sum of values of all columns and row except the values of that class
are calculating the values for.
TN = (cell 5 + cell 6 + cell 8 + cell 9)
= 17 + 1 +0 + 11
= 29
Similarly, for Versicolor class the values/ metrics are calculated as below:
TP : 17 (cell 5)
FN : 0 + 1 = 1 (cell 4 +cell 6)
FP : 0 + 0 = 0 (cell 2 + cell 8)
TN : 16 +0 +0 + 11 =27 (cell 1 + cell 3 + cell 7 + cell 9).
I hope the concept is clear you can try for the Virginia class.
ROC (Receiver Operating Characteristic)
•TruePositiveRate =
TruePositive / (TruePositive + The ROC curve shows the trade-off
FalseNegative) between sensitivity (or TPR) and
specificity (1 – FPR).
•FalsePositiveRate =
FalsePositive / (FalsePositive +
TrueNegative)
Multi Class Classification
• Multi-class classification refers to those
classification tasks that have more than two class
labels.
• Examples include:
– Face classification.
– Plant species classification.
– Optical character recognition.
• Unlike binary classification, multi-class
classification does not have the notion of normal
and abnormal outcomes. Instead, examples are
classified as belonging to one among a range of
known classes.
Multi Class Classification
• The number of class labels may be very large on some
problems. For example, a model may predict a photo as
belonging to one among thousands or tens of
thousands of faces in a face recognition system.
• Problems that involve predicting a sequence of words,
such as text translation models, may also be considered
a special type of multi-class classification.
• Each word in the sequence of words to be predicted
involves a multi-class classification where the size of
the vocabulary defines the number of possible classes
that may be predicted and could be tens or hundreds
of thousands of words in size.
Multi Class Classification -
Examples
• Many algorithms used for binary
classification can be used for multi-class
classification.
• Popular algorithms that can be used for
multi- class classification include:
– k-Nearest Neighbors.
– Decision Trees.
– Naive Bayes.
– Random Forest.
– Gradient Boosting.
Multi Class Classification
• This involves using a strategy of fitting multiple binary
classification models for each class vs. all other classes
(called one-vs-rest) or one model for each pair of
classes (called one-vs-one).
– One-vs-Rest: Fit one binary classification model for
each class vs. all other classes.
– One-vs-One: Fit one binary classification model for
each pair of classes.
• Binary classification algorithms that can use
these strategies for multi-class classification
include:
– Logistic Regression.
– Support Vector Machine.
Multi-Label Classification?
• Multi-label classification refers to those classification
tasks that have two or more class labels, where one
or more class labels may be predicted for each
example.
• Consider the example of photo classification, where
a given photo may have multiple objects in the
scene and a model may predict the presence of
multiple known objects in the photo, such as
“bicycle,” “apple,” “person,” etc.
• This is unlike binary classification and multi-class
classification, where a single class label is
predicted for each example.
Imbalanced Classification
• Imbalanced classification refers to classification
tasks where the number of examples in each class
is unequally distributed.
• Typically, imbalanced classification tasks are binary
classification tasks where the majority of examples
in the training dataset belong to the normal class
and a minority of examples belong to the abnormal
class.
• Examples include:
– Fraud detection.
– Outlier detection.
– Medical diagnostic tests.
Imbalanced Classification
• These problems are modeled as binary
classification tasks, although may
require specialized techniques.
• Specialized techniques may be used to
change the composition of samples in the
training dataset by undersampling the
majority class or oversampling the minority
class.
• Examples include:
– Random Undersampling.
– SMOTE Oversampling.
Imbalanced Classification
• Specialized modeling algorithms may be used
that pay more attention to the minority class
when fitting the model on the training
dataset, such as cost-sensitive machine
learning algorithms.
• Examples include:
– Cost-sensitive Logistic Regression.
– Cost-sensitive Decision Trees.
– Cost-sensitive Support Vector Machines.
Introduction to Support
Vector Machines
History of SVM

• SVM is related to statistical learning theory

• SVM was first introduced in 1992
• SVM becomes popular because of its success in handwritten digit
recognition
• 1.1% test error rate for SVM. This is the same as the error rates of a carefully
constructed neural network, LeNet 4.
• See Section 5.11 in [2] or the discussion in [3] for details
• SVM is now regarded as an important example of “kernel methods”,
one of the key area in machine learning
• Note: the meaning of “kernel” is different from the “kernel” function for
Parzen windows

07/30/2025 39
Linear Classifiers Estimation:

x f yest

f(x,w,b) = sign(w. x -
denotes +1 b)
denotes -1 w: weight vector
x: data vector

How would you

classify this
data?

07/30/2025 40
a
Linear Classifiers
x f yest
f(x,w,b) = sign(w. x -
denotes +1 b)
denotes -1

How would you

classify this
data?

07/30/2025 41
a
Linear Classifiers
x f yest
f(x,w,b) = sign(w. x -
denotes +1 b)
denotes -1

How would you

classify this
data?

07/30/2025 42
a
Linear Classifiers
x f yest
f(x,w,b) = sign(w. x -
denotes +1 b)
denotes -1

How would you

classify this
data?

07/30/2025 43
a
Linear Classifiers
x f yest
f(x,w,b) = sign(w. x -
denotes +1 b)
denotes -1

Any of these
would be fine..

..but which is
best?

07/30/2025 44
Classifier Margin a
x f yest
f(x,w,b) = sign(w. x -
denotes +1 b)
denotes -1 Define the
margin of a
linear classifier
as the width
that the
boundary could
be increased by
before hitting a
datapoint.

07/30/2025 45
Maximum Margin a
x f yest
f(x,w,b) = sign(w. x -
denotes +1 b)
denotes -1 The maximum
margin linear
classifier is the
linear classifier
with the, um,
maximum
margin.
This is the
simplest kind of
SVM (Called an
Linear SVM LSVM)
07/30/2025 46
a
Maximum Margin
x f yest
f(x,w,b) = sign(w. x +
denotes +1 b)
denotes -1 The maximum
margin linear
classifier is the
linear classifier
Support with the, um,
Vectors are
those maximum
datapoints margin.
that the
margin pushes
This is the
up against simplest kind of
SVM (Called an
Linear SVM LSVM)
07/30/2025 47
Why Maximum Margin?
f(x,w,b) = sign(w. x -
denotes +1 b)
denotes -1 The maximum
margin linear
classifier is the
linear classifier
Support with the, um,
Vectors are
those maximum
datapoints margin.
that the
margin pushes
This is the
up against simplest kind of
SVM (Called an
LSVM)
07/30/2025 48
How to calculate the distance from a point to a
line?

denotes +1
denotes -1
x
wx +b =
0
X – Vector
W
W – Normal
Vector
b – Scale Value

 In our case, w1x1+w2x2+b=0,

 thus, w=(w1,w2), x=(x1,x2)

07/30/2025 49
Estimate the Margin
denotes +1
denotes -1
x
wx +b =
0
X – Vector
W
W – Normal
Vector
b – Scale Value

• What is the distance expression for a point x to a line wx+b= 0?

x w  b x w  b
d ( x)  
2 d 2
w 2
 w
i 1 i
07/30/2025 50
Soft-Margin Classification
Transforming the Data
f( )
f( ) f( )
f( ) f( ) f( )
f(.) f( )
f( ) f( )
f( ) f( )
f( ) f( )
f( ) f( ) f( )
f( )
f( )

Input space Feature space

Note: feature space is of higher dimension
than the input space in practice

• Computation in the feature space can be costly because it is high

dimensional
• The feature space is typically infinite-dimensional!
• The kernel trick comes to rescue

07/30/2025 52
Non-linear
SVMs
 Datasets that are linearly separable with noise work
out great:
0 x

 But what are we going to do if the dataset is just

too hard?
0 x

 Kernel Trick!!!
 SVM = Linear SVM + Kernel

Trick

This slide is courtesy of [Link]/~pift6080/documents/papers/svm_tutorial.

ppt
Kernel Trick Motivation
• Linear classifiers are well understood, widely-
used
and efficient.
• How to use linear classifiers to build non-linear
ones?
• Neural networks: Construct non-linear classifiers
by
using a network of linear classifiers (perceptrons).
• Kernels:
o Map the problem from the input space to a new higher-dimensional
space (called the feature space) by doing a non-linear
transformation using a special function called the kernel.
o Then use a linear model in this new high-dimensional feature space.
The linear model in the feature space corresponds to a non-linear
model in the input space.
Non-linear SVMs: Feature
Space
 General idea: the original input space can be
mapped to some higher-dimensional feature
space where the training set is separable:

Φ: x →
φ(x)
SVM Limitations

• Uses a binary (yes/no) decision rule

• Generates a distance from the hyperplane, but this distance is often not a good
measure of our “confidence” in the classification

• Can produce a “probability” as a function of the distance (e.g. using sigmoid fits), but
they are inadequate

• Number of support vectors grows linearly with the size of the data set

• Requires the estimation of trade-off parameter, C, via held-out sets

Error Open-Loop
Error

Optimum

Training Set
Error

Model Complexity
Logistic Regression
Logistic Regression is a Machine Learning algorithm
which is used for the classification problems, it is a
predictive analysis algorithm and based on the concept
of probability.
Logistic Regression and cost function
cost function represents optimization objective
create a cost function and minimum error.

The Cost function of Linear regression

The Cost function of Logistic regression

Graph of logistic regression

The above two functions can be compressed into a single function i.e.

Introduction to Classification in ML
No ratings yet
Introduction to Classification in ML
31 pages
Supervised Learning: Classification Basics
No ratings yet
Supervised Learning: Classification Basics
63 pages
Understanding Classification and Its Uses
No ratings yet
Understanding Classification and Its Uses
100 pages
Understanding Classification Algorithms
No ratings yet
Understanding Classification Algorithms
131 pages
Understanding Machine Learning Classification
No ratings yet
Understanding Machine Learning Classification
13 pages
Importance of Classification in Machine Learning
No ratings yet
Importance of Classification in Machine Learning
99 pages
Machine Learning for Student Performance
No ratings yet
Machine Learning for Student Performance
12 pages
Evaluation Metrics for Classification Models
No ratings yet
Evaluation Metrics for Classification Models
6 pages
Understanding Binary Classification
No ratings yet
Understanding Binary Classification
27 pages
Data Mining Classification Techniques
No ratings yet
Data Mining Classification Techniques
47 pages
Data Mining Classification Techniques
No ratings yet
Data Mining Classification Techniques
47 pages
Evaluating Predictive Model Performance
No ratings yet
Evaluating Predictive Model Performance
6 pages
Classification Techniques and SVM Overview
No ratings yet
Classification Techniques and SVM Overview
127 pages
Machine Learning Classification Overview
No ratings yet
Machine Learning Classification Overview
20 pages
Understanding Classification Algorithms in ML
No ratings yet
Understanding Classification Algorithms in ML
33 pages
Decision Calibration in Multi-Class Classification
No ratings yet
Decision Calibration in Multi-Class Classification
28 pages
Key Machine Learning Metrics Explained
No ratings yet
Key Machine Learning Metrics Explained
39 pages
Predicting Label for (6, 1) in k-NN
No ratings yet
Predicting Label for (6, 1) in k-NN
51 pages
Confusion Matrix in Model Evaluation
No ratings yet
Confusion Matrix in Model Evaluation
43 pages
Understanding Confusion Matrix Metrics
No ratings yet
Understanding Confusion Matrix Metrics
12 pages
Classification Techniques in Data Mining
No ratings yet
Classification Techniques in Data Mining
24 pages
Binary Classification Techniques Explained
No ratings yet
Binary Classification Techniques Explained
39 pages
Supervised Learning in Machine Learning
No ratings yet
Supervised Learning in Machine Learning
33 pages
Machine Learning Evaluation Metrics Guide
No ratings yet
Machine Learning Evaluation Metrics Guide
37 pages
Classifier Performance Metrics Explained
No ratings yet
Classifier Performance Metrics Explained
23 pages
Classification Techniques Overview
No ratings yet
Classification Techniques Overview
133 pages
ML Classification Notes
No ratings yet
ML Classification Notes
7 pages
Machine Learning Classification Overview
No ratings yet
Machine Learning Classification Overview
28 pages
Heart Disease Prediction Using ML Models
No ratings yet
Heart Disease Prediction Using ML Models
11 pages
Evaluating Hearing Screenings Accuracy
No ratings yet
Evaluating Hearing Screenings Accuracy
29 pages
Machine Learning Overview: Python vs C++
No ratings yet
Machine Learning Overview: Python vs C++
24 pages
Classification vs. Regression in ML
No ratings yet
Classification vs. Regression in ML
4 pages
Understanding Classification in ML
No ratings yet
Understanding Classification in ML
36 pages
Understanding Classification Metrics
No ratings yet
Understanding Classification Metrics
17 pages
Understanding Classification Techniques
No ratings yet
Understanding Classification Techniques
20 pages
Classification vs Regression in ML
No ratings yet
Classification vs Regression in ML
5 pages
Ai DS 2 Book-Chpt-5
No ratings yet
Ai DS 2 Book-Chpt-5
17 pages
Classification Metrics Overview
No ratings yet
Classification Metrics Overview
43 pages
Machine Learning Classification Overview
No ratings yet
Machine Learning Classification Overview
89 pages
Machine Learning Classification Overview
No ratings yet
Machine Learning Classification Overview
36 pages
Classification Algorithms Overview
No ratings yet
Classification Algorithms Overview
71 pages
Understanding Confusion Matrix Metrics
No ratings yet
Understanding Confusion Matrix Metrics
21 pages
Machine Learning Evaluation Metrics Guide
No ratings yet
Machine Learning Evaluation Metrics Guide
57 pages
Understanding Machine Learning Classification
No ratings yet
Understanding Machine Learning Classification
74 pages
Support Vector Machines Explained
No ratings yet
Support Vector Machines Explained
28 pages
Understanding Machine Learning Basics
No ratings yet
Understanding Machine Learning Basics
40 pages
ML 6
No ratings yet
ML 6
24 pages
Understanding Confusion Matrix Basics
No ratings yet
Understanding Confusion Matrix Basics
32 pages
Evaluating Data Mining Model Performance
No ratings yet
Evaluating Data Mining Model Performance
30 pages
Introduction to Classification Techniques
No ratings yet
Introduction to Classification Techniques
27 pages
Classification Intro Measures
No ratings yet
Classification Intro Measures
27 pages
Module 6 - Evaluation Metrics
No ratings yet
Module 6 - Evaluation Metrics
23 pages
Confusion Matrix in Classification Analysis
No ratings yet
Confusion Matrix in Classification Analysis
8 pages
Naive Bayes Text Classification Guide
No ratings yet
Naive Bayes Text Classification Guide
74 pages
Chapter 4 - Classification
No ratings yet
Chapter 4 - Classification
18 pages
Chapter 4 - Classification
No ratings yet
Chapter 4 - Classification
18 pages
Chapter 4 - Classification
No ratings yet
Chapter 4 - Classification
18 pages
Understanding Binary Classification Metrics
No ratings yet
Understanding Binary Classification Metrics
39 pages
Understanding Machine Learning Basics
No ratings yet
Understanding Machine Learning Basics
69 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
85 pages
Deep Learning Fundamentals Explained
No ratings yet
Deep Learning Fundamentals Explained
118 pages
Deep Learning Course Guidelines VI Semester
No ratings yet
Deep Learning Course Guidelines VI Semester
2 pages
Decision Tree Induction for Classification
No ratings yet
Decision Tree Induction for Classification
18 pages
Imputing Missing Data with Neural Networks
No ratings yet
Imputing Missing Data with Neural Networks
8 pages
Dairy Quality Assessment with AI Techniques
No ratings yet
Dairy Quality Assessment with AI Techniques
8 pages
JNTU M.Tech Neural Networks Exam 2010
No ratings yet
JNTU M.Tech Neural Networks Exam 2010
1 page
Understanding the SVM Kernel Trick
No ratings yet
Understanding the SVM Kernel Trick
36 pages
Data Preprocessing in Neural Networks
No ratings yet
Data Preprocessing in Neural Networks
12 pages
Neural Networks for Signal Processing Course
No ratings yet
Neural Networks for Signal Processing Course
1 page
Neural Networks in Machine Learning
No ratings yet
Neural Networks in Machine Learning
35 pages
Perceptron and MLP Explained
No ratings yet
Perceptron and MLP Explained
31 pages
Machine Learning Roadmap Guide
No ratings yet
Machine Learning Roadmap Guide
2 pages
Shallow vs Deep Neural Networks Explained
No ratings yet
Shallow vs Deep Neural Networks Explained
13 pages
Purpose of Activation Functions in ANN
No ratings yet
Purpose of Activation Functions in ANN
22 pages
Introduction to Machine Learning Course
No ratings yet
Introduction to Machine Learning Course
2 pages
Neural Networks in MATLAB Overview
No ratings yet
Neural Networks in MATLAB Overview
13 pages
ANNs Notes
No ratings yet
ANNs Notes
237 pages
Overview of Artificial Neural Networks
No ratings yet
Overview of Artificial Neural Networks
7 pages
Perceptron Model and Backpropagation Explained
No ratings yet
Perceptron Model and Backpropagation Explained
22 pages
MLP for MNIST Digit Classification
No ratings yet
MLP for MNIST Digit Classification
43 pages
Machine Learning Week 5: Logistic Regression
No ratings yet
Machine Learning Week 5: Logistic Regression
6 pages
Understanding ANN Basics and Applications
No ratings yet
Understanding ANN Basics and Applications
55 pages
Classification Algorithms Overview
No ratings yet
Classification Algorithms Overview
32 pages
AI Model Selection for Engineers
No ratings yet
AI Model Selection for Engineers
2 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
22 pages
Classification and Prediction Techniques
No ratings yet
Classification and Prediction Techniques
67 pages
Apple and Grape Leaf Disease Detection
No ratings yet
Apple and Grape Leaf Disease Detection
4 pages
Deep Learning Certificate Program 2021
No ratings yet
Deep Learning Certificate Program 2021
1 page
B.Tech Machine Learning Course Overview
No ratings yet
B.Tech Machine Learning Course Overview
2 pages

Understanding Data Classification Basics

Uploaded by

Understanding Data Classification Basics

Uploaded by

Unit 2 : Classification

• Red points: patterns

F1 Score / Harmonic Mean

• SVM is related to statistical learning theory

How would you

How would you

How would you

How would you

 In our case, w1*x1+w2*x2+b=0,

• What is the distance expression for a point x to a line wx+b= 0?

Input space Feature space

• Computation in the feature space can be costly because it is high

 But what are we going to do if the dataset is just

This slide is courtesy of [Link]/~pift6080/documents/papers/svm_tutorial.

• Uses a binary (yes/no) decision rule

• Requires the estimation of trade-off parameter, C, via held-out sets

The Cost function of Linear regression

The Cost function of Logistic regression

You might also like

 In our case, w1x1+w2x2+b=0,