0% found this document useful (0 votes)

77 views35 pages

Deep Learning Forward Pass Explained

Uploaded by

Javier Gonzalez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

77 views35 pages

Deep Learning Forward Pass Explained

Uploaded by

Javier Gonzalez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Running a forward

pass
INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Maham Faisal Khan

Senior Data Science Content Developer
What is a forward pass?
Input data is passed forward or Some possible outputs:
propagated through a network
Binary classification
Computations performed at each layer Single probability between 0 and 1
Outputs of each layer passed to each
Multiclass classification
subsequent layer
Distribution of probabilities summing to 1
Output of final layer: "prediction"
Regression values
Used for both training and prediction Continuous numerical predictions

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Is there also a backward pass?
Backward pass, or backpropagation is used to update weights and biases during training

In the "training loop", we:

1. Propagate data forward
2. Compare outputs to true values (ground-truth)

3. Backpropagate to update model weights and biases

4. Repeat until weights and biases are tuned to produce useful outputs

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Binary classification: forward pass
# Create input data of shape 5x6
input_data = [Link](
[[-0.4421, 1.5207, 2.0607, -0.3647, 0.4691, 0.0946],
[-0.9155, -0.0475, -1.3645, 0.6336, -1.9520, -0.3398],
[ 0.7406, 1.6763, -0.8511, 0.2432, 0.1123, -0.0633],
[-1.6630, -0.0718, -0.1285, 0.5396, -0.0288, -0.8622],
[-0.7413, 1.7920, -0.0883, -0.6685, 0.4745, -0.4245]])

# Create binary classification model

model = [Link](
[Link](6, 4), # First linear layer
[Link](4, 1), # Second linear layer
[Link]() # Sigmoid activation function
)

# Pass input data through model

output = model(input_data)

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Binary classification: forward pass
print(output)

tensor([[0.5188], [0.3761], [0.5015], [0.3718], [0.4663]],

grad_fn=<SigmoidBackward0>)

Outputs:
five probabilities between zero and one

one value for each sample (row) in data

Classification:
Class = 1 for first and third values: 0.5188 , 0.5015

Class = 0 for second, fourth and fifth values: 0.3761 , 0.3718 , 0.4633

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Multi-class classification: forward pass
# Specify model has three classes
n_classes = 3

# Create multiclass classification model

model = [Link](
[Link](6, 4), # First linear layer
[Link](4, n_classes), # Second linear layer
[Link](dim=-1) # Softmax activation
)

# Pass input data through model

output = model(input_data)
print([Link])

[Link]([5, 3])

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Multi-class classification: forward pass
print(output)

tensor([[0.4969, 0.3606, 0.1425],

[0.5105, 0.3262, 0.1633],
[0.3253, 0.3174, 0.3572],
[0.5499, 0.3361, 0.1141],
[0.4117, 0.3366, 0.2517]], grad_fn=<SoftmaxBackward0>)

Outputs:
The output dimension is 5 × 3

Each row sums to one

Value with highest probability is assigned predicted label in each row

Row 1 = class 1 (mammal), row 2 = class 1 (mammal), row 3 = class 3 (reptile)

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Regression: forward pass
# Create regression model tensor([[0.3818],
model = [Link]( [0.0712],
[Link](6, 4), # First linear layer [0.3376],
[Link](4, 1) # Second linear layer [0.0231],
) [0.0757]],
grad_fn=<AddmmBackward0>)
# Pass input data through model
output = model(input_data)

# Return output
print(output)

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Let's practice!
INTRODUCTION TO DEEP LEARNING WITH PYTORCH
Using loss functions
to assess model
predictions
INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Maham Faisal Khan

Senior Data Science Content Developer
Why do we need a loss function?
Loss function:

Gives feedback to model during training

Takes in model prediction y^ and ground truth y

Outputs a float

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Why do we need a loss function?
hair feathers eggs milk airborne aquatic predator toothed backbone breathes venomous fins legs tail domestic catsize class
1 0 0 1 0 0 1 1 1 1 0 0 4 0 0 1 0

Predicted class = 0 -> correct = low loss

Predicted class = 1 -> wrong = high loss

Predicted class = 2 -> wrong = high loss

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

One-hot encoding concepts
loss = F (y, y^)
y is a single integer (class label)
e.g. y = 0 when y is a mammal

y^ is a tensor (output of softmax)

If N is the number of classes, e.g. N = 3

y^ is a tensor with N dimensions,

e.g. y^ = [0.57492, 0.034961, 0.15669]

How do we compare an integer with a tensor?

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

One-hot encoding concepts
Transforming true label to tensor of zeros and ones

one_hot_numpy = [Link]([1, 0, 0])

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Transforming labels with one-hot encoding
import [Link] as F

F.one_hot([Link](0), num_classes = 3)

tensor([1, 0, 0])

F.one_hot([Link](1), num_classes = 3)

tensor([0, 1, 0])

F.one_hot([Link](2), num_classes = 3)

tensor([0, 0, 1])

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Cross entropy loss in PyTorch
from [Link] import CrossEntropyLoss

scores = tensor([[-0.1211, 0.1059]])

one_hot_target = tensor([[1, 0]])

criterion = CrossEntropyLoss()
criterion([Link](), one_hot_target.double())

tensor(0.8131, dtype=torch.float64)

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Bringing it all together
Loss function takes

scores
model predictions before the final softmax function

one_hot_target
one hot encoded ground truth label

and outputs

loss
a single float.

Our training goal is to minimize loss.

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Let's practice!
INTRODUCTION TO DEEP LEARNING WITH PYTORCH
Using derivatives to
update model
parameters
INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Maham Faisal Khan

Senior Data Science Content Developer
Minimizing the loss
We need to minimize loss

High loss: model prediction is wrong

Low loss: model prediction is correct

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

An analogy for derivatives
Hiking down a mountain to the valley floor:

steep slopes:
a step makes us lose a lot of elevation =
derivative is high (red arrows)

gentler slopes:
a step makes us lose a little bit of
elevation = derivative is low (green
arrows)

valley floor:
not losing elevation by taking a step =
derivative is null (blue arrow)

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Connecting derivatives and model training
Model training: updating a model's parameters to minimize the loss.

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Backpropagation concepts
Consider a network made of three layers,
L0, L1 and L2
we calculate local gradients for L0, L1
and L2 using backpropagation
we calculate loss gradients with respect
to L2, then use L2 gradients to calculate
L1 gradients, and so on

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Backpropagation in PyTorch
# Create the model and run a forward pass
model = [Link]([Link](16, 8),
[Link](8, 4),
[Link](4, 2))
prediction = model(sample)

# Calculate the loss and compute the gradients

criterion = CrossEntropyLoss()
loss = criterion(prediction, target)
[Link]()

# Access each layer's gradients

model[0].[Link], model[0].[Link]
model[1].[Link], model[1].[Link]
model[2].[Link], model[2].[Link]

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Updating model parameters
Update the weights by subtracting local # Learning rate is typically small
gradients scaled by the learning rate lr = 0.001

# Update the weights

weight = model[0].weight
weight_grad = model[0].[Link]
weight = weight - lr * weight_grad

# Update the biases

bias = model[0].bias
bias_grad = model[0].[Link]
bias = bias - lr * bias_grad

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Convex and non-convex functions
This is a convex function. This is a non-convex function.

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Gradient descent
For non-convex functions, we will use an iterative process such as gradient descent
In PyTorch, an optimizer takes care of weight updates

The most common optimizer is stochastic gradient descent (SGD)

import [Link] as optim

# Create the optimizer

optimizer = [Link]([Link](), lr=0.001)

Optimizer handles updating model parameters (or weights) after calculation of local
gradients

[Link]()

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Let's practice!
INTRODUCTION TO DEEP LEARNING WITH PYTORCH
Writing our first
training loop
INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Maham Faisal Khan

Senior Data Science Content Developer
Training a neural network
1. Create a model
2. Choose a loss function

3. Create a dataset

4. Define an optimizer

5. Run a training loop, where for each sample of the dataset, we repeat:
Calculating loss (forward pass)

Calculating local gradients

Updating model parameters

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Introducing the Data Science Salary dataset
This dataset contains salary data for data science-related jobs.
The features are: experience_level , employment_type , remote_ratio and company_size .
They were turned into categories.

experience_level employment_type remote_ratio company_size salary_in_usd

0 0 0.5 1 0.036
1 0 1.0 2 0.133
2 0 0.0 1 0.234
1 0 1.0 0 0.076
2 0 1.0 1 0.170

The target is salary in US dollars; it is not a category but a continuous quantity

For regression problems, we cannot use softmax or sigmoid as last activation function

We need a different loss function than cross-entropy

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Introducing the Mean Squared Error Loss
The mean squared error loss (MSE loss) is the squared difference between the prediction
and the ground truth.

def mean_squared_loss(prediction, target):

return [Link]((prediction - target)**2)

in PyTorch

criterion = [Link]()
# Prediction and target are float tensors
loss = criterion(prediction, target)

This loss is used for regression problems (e.g., when trying to fit a linear regression model).

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Before the training loop
# Create the dataset and the dataloader
dataset = TensorDataset([Link](features).float(), [Link](target).float())
dataloader = DataLoader(dataset, batch_size=4, shuffle=True)

# Create the model

model = [Link]([Link](4, 2),
[Link](2, 1))

# Create the loss and optimizer

criterion = [Link]()
optimizer = [Link]([Link](), lr=0.001)

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

The training loop
# Loop through the dataset multiple times
for epoch in range(num_epochs):
for data in dataloader:
# Set the gradients to zero
optimizer.zero_grad()
# Get feature and target from the data loader
feature, target = data
# Run a forward pass
pred = model(feature)
# Compute loss and gradients
loss = criterion(pred, target)
[Link]()
# Update the parameters
[Link]()

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Let's practice!
INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Forward Pass in PyTorch Explained
No ratings yet
Forward Pass in PyTorch Explained
35 pages
Deep Learning Forward Pass in PyTorch
No ratings yet
Deep Learning Forward Pass in PyTorch
35 pages
Training a Neural Network With PyTorch (Chapter3)
No ratings yet
Training a Neural Network With PyTorch (Chapter3)
31 pages
Deep Learning Basics with PyTorch
No ratings yet
Deep Learning Basics with PyTorch
35 pages
Deep Learning Basics with PyTorch
No ratings yet
Deep Learning Basics with PyTorch
50 pages
Kaiming Initialization in PyTorch
No ratings yet
Kaiming Initialization in PyTorch
37 pages
Deep Learning with PyTorch Guide
No ratings yet
Deep Learning with PyTorch Guide
34 pages
Deep Learning with PyTorch Guide
No ratings yet
Deep Learning with PyTorch Guide
34 pages
Deep Learning with PyTorch Overview
No ratings yet
Deep Learning with PyTorch Overview
30 pages
Introduction to PyTorch for ML Modeling
No ratings yet
Introduction to PyTorch for ML Modeling
45 pages
Deep Learning Activation Functions
No ratings yet
Deep Learning Activation Functions
26 pages
Deep Learning Activation Functions
No ratings yet
Deep Learning Activation Functions
26 pages
PyTorch Deep Learning Course Overview
No ratings yet
PyTorch Deep Learning Course Overview
6 pages
Deep Learning with PyTorch Basics
No ratings yet
Deep Learning with PyTorch Basics
39 pages
Introduction to PyTorch Basics
No ratings yet
Introduction to PyTorch Basics
25 pages
PyTorch Deep Learning Basics Guide
No ratings yet
PyTorch Deep Learning Basics Guide
8 pages
02 Pytorch Classification
No ratings yet
02 Pytorch Classification
22 pages
PyTorch Crash Course Overview
No ratings yet
PyTorch Crash Course Overview
15 pages
PyTorch 101 for Deep Learning PhD
No ratings yet
PyTorch 101 for Deep Learning PhD
19 pages
PyTorch Tensors and Gradients Guide
No ratings yet
PyTorch Tensors and Gradients Guide
10 pages
PyTorch Feedforward Neural Network Guide
No ratings yet
PyTorch Feedforward Neural Network Guide
13 pages
CSE 251B Deep Learning Announcement
No ratings yet
CSE 251B Deep Learning Announcement
37 pages
Deep Learning with PyTorch Guide
No ratings yet
Deep Learning with PyTorch Guide
1 page
PyTorch Crash Course: Tensors & Autograd
No ratings yet
PyTorch Crash Course: Tensors & Autograd
16 pages
Fundamentals of Deep Learning Overview
No ratings yet
Fundamentals of Deep Learning Overview
195 pages
Deep Learning with Torch Cheat Sheet
No ratings yet
Deep Learning with Torch Cheat Sheet
2 pages
Deep Learning with PyTorch Overview
No ratings yet
Deep Learning with PyTorch Overview
72 pages
Multiple Linear Regression in PyTorch
No ratings yet
Multiple Linear Regression in PyTorch
13 pages
Esrgan PDF
No ratings yet
Esrgan PDF
14 pages
Introduction to Machine Learning Course
No ratings yet
Introduction to Machine Learning Course
11 pages
Gradient Descent for Linear Models Lab
No ratings yet
Gradient Descent for Linear Models Lab
7 pages
Build Your First Deep Neural Network
No ratings yet
Build Your First Deep Neural Network
27 pages
Machine Learning with TensorFlow and NumPy
No ratings yet
Machine Learning with TensorFlow and NumPy
35 pages
PyTorch Feedforward Network Guide
No ratings yet
PyTorch Feedforward Network Guide
18 pages
Deep Learning Fundamentals with PyTorch
No ratings yet
Deep Learning Fundamentals with PyTorch
108 pages
Beginner's Guide to Deep Learning with PyTorch
No ratings yet
Beginner's Guide to Deep Learning with PyTorch
1,309 pages
Batch Normalization in MLP Training
No ratings yet
Batch Normalization in MLP Training
37 pages
Train Your First Neural Network in PyTorch
No ratings yet
Train Your First Neural Network in PyTorch
68 pages
PyTorch Neural Network Guide for Beginners
No ratings yet
PyTorch Neural Network Guide for Beginners
17 pages
ESRGAN
No ratings yet
ESRGAN
21 pages
PyTorch: Bridging Research and Production
No ratings yet
PyTorch: Bridging Research and Production
108 pages
PyTorch Neural Network Training Guide
No ratings yet
PyTorch Neural Network Training Guide
48 pages
Deep Learning for Image Classification
No ratings yet
Deep Learning for Image Classification
123 pages
Deep Learning: Gradient Descent Explained
No ratings yet
Deep Learning: Gradient Descent Explained
41 pages
DL lab 3 - Neural Network with PyTorch
No ratings yet
DL lab 3 - Neural Network with PyTorch
6 pages
PyTorch Image Classification Guide
No ratings yet
PyTorch Image Classification Guide
40 pages
PyTorch Image Classification Guide
No ratings yet
PyTorch Image Classification Guide
40 pages
Lec9 PyTorchExample
No ratings yet
Lec9 PyTorchExample
31 pages
Dive Into Deep Learning
No ratings yet
Dive Into Deep Learning
105 pages
Introduction to PyTorch Tensors
No ratings yet
Introduction to PyTorch Tensors
8 pages
PyTorch Neural Network Tutorial
No ratings yet
PyTorch Neural Network Tutorial
64 pages
Deep Learning
No ratings yet
Deep Learning
35 pages
Deep Learning with Neural Networks
No ratings yet
Deep Learning with Neural Networks
104 pages
Neural Networks and Gradient Learning
No ratings yet
Neural Networks and Gradient Learning
72 pages
Deep Learning Fundamentals and Techniques
No ratings yet
Deep Learning Fundamentals and Techniques
42 pages
CIFAR-10 Classification with MLP
No ratings yet
CIFAR-10 Classification with MLP
15 pages
TensorFlow One-Hot Encoding Guide
No ratings yet
TensorFlow One-Hot Encoding Guide
8 pages
Tesco's AI-Driven Marketing Insights
No ratings yet
Tesco's AI-Driven Marketing Insights
3 pages
AI Applications in Marketing: A Review
100% (1)
AI Applications in Marketing: A Review
8 pages
Beginner's Data Science Course Outline
No ratings yet
Beginner's Data Science Course Outline
3 pages
Python Colab Data Analysis Techniques
No ratings yet
Python Colab Data Analysis Techniques
71 pages
Deep Learning in Breast Cancer Chemotherapy
No ratings yet
Deep Learning in Breast Cancer Chemotherapy
37 pages
Multi-Source Transfer Learning for TSF
No ratings yet
Multi-Source Transfer Learning for TSF
25 pages
UPES Computer Science Student List
No ratings yet
UPES Computer Science Student List
28 pages
Analyzing IoT Data for Business Insights
No ratings yet
Analyzing IoT Data for Business Insights
35 pages
LSTM Predictions for VGF-GaAs Growth
No ratings yet
LSTM Predictions for VGF-GaAs Growth
13 pages
Brochure YP - AI Machine Learning Bootcamp
No ratings yet
Brochure YP - AI Machine Learning Bootcamp
7 pages
Crime Analysis and Prediction Using Optimized K-Means Algorithm
No ratings yet
Crime Analysis and Prediction Using Optimized K-Means Algorithm
4 pages
Fall 2024 Exam Schedule for CCE
No ratings yet
Fall 2024 Exam Schedule for CCE
1 page
COMPAS 2025 Conference Program Overview
No ratings yet
COMPAS 2025 Conference Program Overview
16 pages
Understanding Automated Systems and AI
No ratings yet
Understanding Automated Systems and AI
22 pages
Interview Prep for Developers & Data Scientists
No ratings yet
Interview Prep for Developers & Data Scientists
2 pages
AI Data Security Challenges and Solutions
No ratings yet
AI Data Security Challenges and Solutions
5 pages
Logistic Regression Interview Insights
100% (1)
Logistic Regression Interview Insights
39 pages
AI Engineer with Machine Learning Expertise
No ratings yet
AI Engineer with Machine Learning Expertise
3 pages
Data Science Certification Course Overview
No ratings yet
Data Science Certification Course Overview
36 pages
Multilayer Perceptron in Neural Networks
No ratings yet
Multilayer Perceptron in Neural Networks
71 pages
Smart Event Management App Development
No ratings yet
Smart Event Management App Development
8 pages
AI & Data Science Curriculum: Year 3
No ratings yet
AI & Data Science Curriculum: Year 3
3 pages
Flood Susceptibility Mapping in Nigeria
No ratings yet
Flood Susceptibility Mapping in Nigeria
24 pages
Deep Learning for NLP: A Guide
No ratings yet
Deep Learning for NLP: A Guide
373 pages
12-Week AI Job-Getting Plan
No ratings yet
12-Week AI Job-Getting Plan
3 pages
AI Tools for Efficient Literature Review
No ratings yet
AI Tools for Efficient Literature Review
14 pages
AI Business Opportunities Unveiled
100% (1)
AI Business Opportunities Unveiled
99 pages
Enhancing CRM with Hyper-Personalization
No ratings yet
Enhancing CRM with Hyper-Personalization
23 pages
Introduction to Data Science Concepts
No ratings yet
Introduction to Data Science Concepts
25 pages
Driver Drowsiness Detection Proposal
No ratings yet
Driver Drowsiness Detection Proposal
44 pages