0% found this document useful (0 votes)

329 views48 pages

PyTorch Neural Network Training Guide

Uploaded by

Da HUANG

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

329 views48 pages

PyTorch Neural Network Training Guide

Uploaded by

Da HUANG

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Machine Learning

Pytorch Tutorial
TA : 曾元（Yuan Tseng）
2022.02.18
Outline
● Background: Prerequisites & What is Pytorch?
● Training & Testing Neural Networks in Pytorch
● Dataset & Dataloader
● Tensors
● [Link]: Models, Loss Functions
● [Link]: Optimization
● Save/load models
Prerequisites
● We assume you are already familiar with…
1. Python3
■ if-else, loop, function, file IO, class, ...
■ refs: link1, link2, link3
2. Deep Learning Basics
■ Prof. Lee’s 1st & 2nd lecture videos from last year
■ ref: link1, link2

Some knowledge of NumPy will also be useful!

What is PyTorch?
● An machine learning framework in Python.
● Two main features:
○ N-dimensional Tensor computation (like NumPy) on GPUs
○ Automatic diﬀerentiation for training deep neural networks
Training Neural Networks

Define Neural Optimization

Loss Function
Network Algorithm

Training

More info about the training process in last year's lecture video.
Training & Testing Neural Networks

Training Validation Testing

Guide for training/validation/testing can be found here.

Training & Testing Neural Networks - in Pytorch
Step 1.
[Link] &
Load Data [Link]

Training Validation Testing

Dataset & Dataloader
● Dataset: stores data samples and expected values
● Dataloader: groups data in batches, enables multiprocessing

● dataset = MyDataset(file)
● dataloader = DataLoader(dataset, batch_size, shuffle=True)

Training: True
Testing: False

More info about batches and shuﬄing here.

Dataset & Dataloader
from [Link] import Dataset, DataLoader

class MyDataset(Dataset):
def __init__(self, file):
[Link] = ... Read data & preprocess

def getitem(self, index):

return [Link][index] Returns one sample at a time

def __len__(self):
return len([Link]) Returns the size of the dataset
Dataset & Dataloader
dataset = MyDataset(file)

dataloader = DataLoader(dataset, batch_size=5, shuffle=False)

DataLoader
__getitem__(0) 0
__getitem__(1) 1
Dataset __getitem__(2) 2 batch_size
__getitem__(3) 3
__getitem__(4) 4
mini-batch
Tensors
● High-dimensional matrices (arrays)

1-D tensor 2-D tensor 3-D tensor

e.g. audio e.g. black&white e.g. RGB images
images
Tensors – Shape of Tensors
● Check with .shape

4
3

5
3
5 5
(5, ) (3, 5) (4, 5, 3)

dim 0 dim 0 dim 1 dim 0 dim 1 dim 2

Note: dim in PyTorch == axis in NumPy

Tensors – Creating Tensors
● Directly from data (list or [Link]) tensor([[1., -1.],
x = [Link]([[1, -1], [-1, 1]]) [-1., 1.]])

x = torch.from_numpy([Link]([[1, -1], [-1, 1]]))

● Tensor of constant zeros & ones tensor([[0., 0.],

[0., 0.]])
x = [Link]([2, 2])

x = [Link]([1, 2, 5]) tensor([[[1., 1., 1., 1., 1.],

shape [1., 1., 1., 1., 1.]]])
Tensors – Common Operations
Common arithmetic functions are supported, such as:

● Addition ● Summation

z = x + y y = [Link]()

● Subtraction ● Mean

z = x - y y = [Link]()

● Power

y = [Link](2)
Tensors – Common Operations
● Transpose: transpose two speciﬁed dimensions

>>> x = [Link]([2, 3])

2
>>> [Link]
3
[Link]([2, 3])

>>> x = [Link](0, 1)

>>> [Link] 3

[Link]([3, 2])
2
Tensors – Common Operations
● Squeeze: remove the speciﬁed dimension with length = 1

>>> x = [Link]([1, 2, 3])

>>> [Link] 1
3
2
[Link]([1, 2, 3])

>>> x = [Link](0)
(dim = 0)
>>> [Link] 2

[Link]([2, 3]) 3
Tensors – Common Operations
● Unsqueeze: expand a new dimension

>>> x = [Link]([2, 3]) 2

>>> [Link]
3
[Link]([2, 3])

>>> x = [Link](1) (dim = 1)

>>> [Link] 2

[Link]([2, 1, 3]) 3
1
Tensors – Common Operations
x 2
3
1

● Cat: concatenate multiple tensors

y 2
>>> x = [Link]([2, 1, 3])
3
3
>>> y = [Link]([2, 3, 3])

>>> z = [Link]([2, 2, 3]) z

Data type dtype tensor

32-bit ﬂoating point [Link] [Link]

64-bit integer (signed) [Link] [Link]

see oﬃcial documentation for more information on data types.

Tensors – PyTorch v.s. NumPy
● Similar attributes

PyTorch NumPy
[Link] [Link]
[Link] [Link]

see oﬃcial documentation for more information on data types.

ref: [Link]
Tensors – PyTorch v.s. NumPy
● Many functions have the same names as well

PyTorch NumPy
[Link] / [Link] [Link]
[Link]() [Link]()
[Link](1) np.expand_dims(x, 1)

ref: [Link]
Tensors – Device
● Tensors & modules will be computed with CPU by default

Use .to() to move tensors to appropriate devices.

● CPU
x = [Link](‘cpu’)
● GPU
x = [Link](‘cuda’)
Tensors – Device (GPU)
● Check if your computer has NVIDIA GPU

[Link].is_available()

● Multiple GPUs: specify ‘cuda:0’, ‘cuda:1’, ‘cuda:2’, ...

● Why use GPUs?

○ Parallel computing with more cores for arithmetic calculations
○ See What is a GPU and do you need one in deep learning?
Tensors – Gradient Calculation
1 >>> x = [Link]([[1., 0.], [-1., 1.]], requires_grad=True)

2 >>> z = [Link](2).sum()

3 >>> [Link]()

4 >>> [Link]
1 2
tensor([[ 2., 0.],

[-2., 2.]])
3 4

See here to learn about gradient calculation.

Training & Testing Neural Networks – in Pytorch
Step 2.
[Link]
Load Data
Define Neural
Network

Loss Function Training Validation Testing

Optimization
Algorithm
[Link] – Network Layers
● Linear Layer (Fully-connected Layer)

[Link](in_features, out_features)

Input Tensor Output Tensor

[Link](32, 64)
* x 32 * x 64

can be any shape (but last dimension must be 32)

e.g. (10, 32), (10, 5, 32), (1, 1, 3, 32), ...
[Link] – Network Layers
● Linear Layer (Fully-connected Layer)

ref: last year's lecture video

[Link] – Neural Network Layers
● Linear Layer (Fully-connected Layer)

y1
x1

y2
x2

32 y3 64 W x x + b = y
x3 (64x32)
...

...

x32
y64
[Link] – Network Parameters
● Linear Layer (Fully-connected Layer)

>>> layer = [Link](32, 64)

>>> [Link]

[Link]([64, 32]) W x x + b = y
(64x32)
>>> [Link]

[Link]([64])
[Link] – Non-Linear Activation Functions
● Sigmoid Activation

[Link]()

● ReLU Activation

[Link]()

See here to learn about why we need activation functions.

[Link] – Build your own neural network
import [Link] as nn

class MyModel([Link]):
def __init__(self):
super(MyModel, self).__init__()
[Link] = [Link](
[Link](10, 32), Initialize your model & deﬁne layers
[Link](),
[Link](32, 1)
)

def forward(self, x):

Compute output of your NN
return [Link](x)
[Link] – Build your own neural network
import [Link] as nn import [Link] as nn

class MyModel([Link]): class MyModel([Link]):

def __init__(self): def __init__(self):
super(MyModel, self).__init__() super(MyModel, self).__init__()
[Link] = [Link]( self.layer1 = [Link](10, 32)
[Link](10, 32), self.layer2 = [Link](),
[Link](), = self.layer3 = [Link](32,1)
[Link](32, 1)
) def forward(self, x):
out = self.layer1(x)
def forward(self, x): out = self.layer2(out)
return [Link](x) out = self.layer3(out)
return out
Training & Testing Neural Networks – in Pytorch
Step 3.
[Link]
[Link] etc.
Load Data
Define Neural
Network

Loss Function Training Validation Testing

Optimization
Algorithm
[Link] – Loss Functions
● Mean Squared Error (for regression tasks)

criterion = [Link]()

● Cross Entropy (for classiﬁcation tasks)

criterion = [Link]()

● loss = criterion(model_output, expected_value)

Training & Testing Neural Networks – in Pytorch
Step 4.
[Link]
Load Data
Define Neural
Network

Loss Function Training Validation Testing

Optimization
Algorithm
[Link]
● Gradient-based optimization algorithms that adjust network
parameters to reduce error. (See Adaptive Learning Rate lecture video)

● E.g. Stochastic Gradient Descent (SGD)

[Link]([Link](), lr, momentum = 0)

[Link]
optimizer = [Link]([Link](), lr, momentum = 0)

● For every batch of data:

1. Call optimizer.zero_grad() to reset gradients of model parameters.
2. Call [Link]() to backpropagate gradients of prediction loss.
3. Call [Link]() to adjust model parameters.

See oﬃcial documentation for more optimization algorithms.

Training & Testing Neural Networks – in Pytorch

Load Data
Define Neural
Network

Loss Function Training Validation Testing

Optimization
Algorithm Step 5.
Entire Procedure
Neural Network Training Setup

dataset = MyDataset(file) read data via MyDataset

tr_set = DataLoader(dataset, 16, shuffle=True) put dataset into Dataloader

model = MyModel().to(device) construct model and move to device (cpu/cuda)

criterion = [Link]() set loss function

optimizer = [Link]([Link](), 0.1) set optimizer

Neural Network Training Loop
for epoch in range(n_epochs): iterate n_epochs

[Link]() set model to train mode

for x, y in tr_set: iterate through the dataloader

optimizer.zero_grad() set gradient to zero

x, y = [Link](device), [Link](device) move data to device (cpu/cuda)

pred = model(x) forward pass (compute output)

loss = criterion(pred, y) compute loss

[Link]() compute gradient (backpropagation)

[Link]() update model with optimizer

Neural Network Validation Loop
[Link]() set model to evaluation mode

total_loss = 0

for x, y in dv_set: iterate through the dataloader

x, y = [Link](device), [Link](device) move data to device (cpu/cuda)

with torch.no_grad(): disable gradient calculation

pred = model(x) forward pass (compute output)

loss = criterion(pred, y) compute loss

total_loss += [Link]().item() * len(x) accumulate loss

avg_loss = total_loss / len(dv_set.dataset) compute averaged loss

Neural Network Testing Loop
[Link]() set model to evaluation mode

preds = []

for x in tt_set: iterate through the dataloader

x = [Link](device) move data to device (cpu/cuda)

with torch.no_grad(): disable gradient calculation

pred = model(x) forward pass (compute output)

[Link]([Link]()) collect prediction

Notice - [Link](), torch.no_grad()
● [Link]()

Changes behaviour of some model layers, such as dropout and batch

normalization.

● with torch.no_grad()

Prevents calculations from being added into gradient computation

graph. Usually used to prevent accidental training on validation/testing
data.
Save/Load Trained Models
● Save

[Link](model.state_dict(), path)

● Load

ckpt = [Link](path)

model.load_state_dict(ckpt)
More About PyTorch
● torchaudio
○ speech/audio processing
● torchtext
○ natural language processing
● torchvision
○ computer vision
● skorch
○ scikit-learn + pyTorch
More About PyTorch
● Useful github repositories using PyTorch
○ Huggingface Transformers (transformer models: BERT, GPT, ...)
○ Fairseq (sequence modeling for NLP & speech)
○ ESPnet (speech recognition, translation, synthesis, ...)
○ Most implementations of recent deep learning papers
○ ...
References
● Machine Learning 2021 Spring Pytorch Tutorial
● Oﬃcial Pytorch Tutorials
● [Link]
Any questions?

PyTorch Neural Network Tutorial
No ratings yet
PyTorch Neural Network Tutorial
64 pages
PyTorch: Bridging Research and Production
No ratings yet
PyTorch: Bridging Research and Production
108 pages
Convolutional Neural Networks Overview
No ratings yet
Convolutional Neural Networks Overview
204 pages
Understanding Neural Network Loss Functions
No ratings yet
Understanding Neural Network Loss Functions
112 pages
Deep Learning with TensorFlow Lab Manual
No ratings yet
Deep Learning with TensorFlow Lab Manual
15 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
30 pages
Deep Learning Lab Practicals Guide
No ratings yet
Deep Learning Lab Practicals Guide
24 pages
Deep Learning for Recommendation Systems
No ratings yet
Deep Learning for Recommendation Systems
47 pages
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
No ratings yet
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
9 pages
Backpropagation in Neural Networks
No ratings yet
Backpropagation in Neural Networks
7 pages
RNN and LSTM Overview
No ratings yet
RNN and LSTM Overview
44 pages
Understanding Neural Networks Basics
No ratings yet
Understanding Neural Networks Basics
46 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
61 pages
Optimizers: Adam, RMSprop, Momentum
No ratings yet
Optimizers: Adam, RMSprop, Momentum
23 pages
RBF Neural Networks Overview and Applications
No ratings yet
RBF Neural Networks Overview and Applications
34 pages
Data Preprocessing for ML Techniques
No ratings yet
Data Preprocessing for ML Techniques
38 pages
Key Hyperparameters in Neural Networks
No ratings yet
Key Hyperparameters in Neural Networks
15 pages
Enhancing EdgeAI with SLM Techniques
No ratings yet
Enhancing EdgeAI with SLM Techniques
45 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
38 pages
Deep Learning Exam Answers and Matrix G
No ratings yet
Deep Learning Exam Answers and Matrix G
20 pages
Deep Learning & NLP Course Overview
No ratings yet
Deep Learning & NLP Course Overview
4 pages
Machine Learning Fundamentals Overview
No ratings yet
Machine Learning Fundamentals Overview
22 pages
Computer Vision Midterm Exam Guide
No ratings yet
Computer Vision Midterm Exam Guide
24 pages
U-Net Image Segmentation Lab Guide
No ratings yet
U-Net Image Segmentation Lab Guide
9 pages
Federated Learning
No ratings yet
Federated Learning
9 pages
Deep Learning Quiz 1: Concepts & Questions
No ratings yet
Deep Learning Quiz 1: Concepts & Questions
5 pages
Understanding BPTT in Deep Learning
No ratings yet
Understanding BPTT in Deep Learning
10 pages
TensorFlow vs. PyTorch Overview
No ratings yet
TensorFlow vs. PyTorch Overview
13 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
45 pages
Decision Trees: Entropy, Gini, & Info Gain
No ratings yet
Decision Trees: Entropy, Gini, & Info Gain
25 pages
Siamese Network for One-Shot Learning
No ratings yet
Siamese Network for One-Shot Learning
51 pages
Multimodal Data Fusion Techniques Overview
No ratings yet
Multimodal Data Fusion Techniques Overview
11 pages
Deep Learning Interview Questions Guide
No ratings yet
Deep Learning Interview Questions Guide
46 pages
ML Revision: Key Mathematical Concepts
No ratings yet
ML Revision: Key Mathematical Concepts
25 pages
Deep Learning Optimization Techniques
No ratings yet
Deep Learning Optimization Techniques
14 pages
AI and Machine Learning Overview
No ratings yet
AI and Machine Learning Overview
179 pages
Multi-layer Perceptron and Backpropagation
No ratings yet
Multi-layer Perceptron and Backpropagation
54 pages
LDA and SVM in Machine Learning
No ratings yet
LDA and SVM in Machine Learning
38 pages
General Framework for Object Detection
No ratings yet
General Framework for Object Detection
9 pages
AD3511 Deep Learning Lab Manual
No ratings yet
AD3511 Deep Learning Lab Manual
80 pages
Perceptron Learning with PyTorch Lab
No ratings yet
Perceptron Learning with PyTorch Lab
13 pages
Survey on Vision Transformers
No ratings yet
Survey on Vision Transformers
24 pages
Understanding Convolutional Neural Networks
100% (1)
Understanding Convolutional Neural Networks
28 pages
Logistic Regression vs Random Forest
100% (1)
Logistic Regression vs Random Forest
72 pages
Transformers in Computer Vision Survey
No ratings yet
Transformers in Computer Vision Survey
28 pages
Neural Computing Principles Exercise Solutions
0% (1)
Neural Computing Principles Exercise Solutions
64 pages
TensorFlow C++ Overview and Features
No ratings yet
TensorFlow C++ Overview and Features
17 pages
MLOps Course Syllabus Overview
100% (1)
MLOps Course Syllabus Overview
6 pages
Backpropagation Algorithm Explained
No ratings yet
Backpropagation Algorithm Explained
4 pages
Understanding Linear Threshold Units
No ratings yet
Understanding Linear Threshold Units
19 pages
YOLO Algorithm in Agricultural Object Detection
No ratings yet
YOLO Algorithm in Agricultural Object Detection
15 pages
Neural Network Models for Fashion MNIST
No ratings yet
Neural Network Models for Fashion MNIST
26 pages
FPGA Based 1D CNN Accelerator For Real Time Arrhythmia Classification
No ratings yet
FPGA Based 1D CNN Accelerator For Real Time Arrhythmia Classification
12 pages
YOLO vs Mask R-CNN for Fish Segmentation
No ratings yet
YOLO vs Mask R-CNN for Fish Segmentation
6 pages
Backpropagation Algorithm Overview
No ratings yet
Backpropagation Algorithm Overview
23 pages
Overview of Evolutionary Programming
No ratings yet
Overview of Evolutionary Programming
19 pages
Understanding Unsupervised Learning Techniques
No ratings yet
Understanding Unsupervised Learning Techniques
4 pages
PyTorch Autoencoder Architecture Guide
No ratings yet
PyTorch Autoencoder Architecture Guide
42 pages
PyTorch Neural Network Training Guide
No ratings yet
PyTorch Neural Network Training Guide
48 pages
PyTorch Tensor Operations Guide
No ratings yet
PyTorch Tensor Operations Guide
35 pages
Spectrum Analysis with Octave/MATLAB
No ratings yet
Spectrum Analysis with Octave/MATLAB
23 pages
Data Mining and Warehousing Exam Guide
No ratings yet
Data Mining and Warehousing Exam Guide
2 pages
Digital Signal Processing Overview
No ratings yet
Digital Signal Processing Overview
5 pages
Java Triangle Word Generator Code
No ratings yet
Java Triangle Word Generator Code
1 page
AI and ML Fundamentals Lesson Plan
No ratings yet
AI and ML Fundamentals Lesson Plan
45 pages
Least-Squares for Linear Differential Equations
No ratings yet
Least-Squares for Linear Differential Equations
18 pages
Operations Research Course Overview
No ratings yet
Operations Research Course Overview
40 pages
Data Structures Exam Paper April 2023
No ratings yet
Data Structures Exam Paper April 2023
2 pages
Branch_and_Bound_MCQs_50
No ratings yet
Branch_and_Bound_MCQs_50
7 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
46 pages
DES Algorithm Implementation Guide
No ratings yet
DES Algorithm Implementation Guide
7 pages
DVB-S2 Modulation and Coding Overview
No ratings yet
DVB-S2 Modulation and Coding Overview
21 pages
Partial Derivatives and Optimization Problems
No ratings yet
Partial Derivatives and Optimization Problems
3 pages
Enhancing Apriori Algorithm Efficiency
No ratings yet
Enhancing Apriori Algorithm Efficiency
4 pages
Apriori vs FP-Growth in Data Mining
100% (11)
Apriori vs FP-Growth in Data Mining
11 pages
Fourier Analysis in Science and Engineering
No ratings yet
Fourier Analysis in Science and Engineering
10 pages
Pattern Printing in Programming
No ratings yet
Pattern Printing in Programming
24 pages
Hermite Interpolation and Differentiation
No ratings yet
Hermite Interpolation and Differentiation
5 pages
AI & ML Two Marks Q&A Guide
No ratings yet
AI & ML Two Marks Q&A Guide
22 pages
Gauss Quadrature Methods Explained
No ratings yet
Gauss Quadrature Methods Explained
23 pages
WEKA Breast Cancer Classification Analysis
No ratings yet
WEKA Breast Cancer Classification Analysis
3 pages
Deep Learning for Groundwater Management
No ratings yet
Deep Learning for Groundwater Management
22 pages
Signal and Systems Exam Paper B.Tech
No ratings yet
Signal and Systems Exam Paper B.Tech
3 pages
Robust Face Recognition via Compressive Sensing
No ratings yet
Robust Face Recognition via Compressive Sensing
4 pages
Student Performance Prediction with SHAP
No ratings yet
Student Performance Prediction with SHAP
16 pages
Sorting and Searching Algorithms Overview
No ratings yet
Sorting and Searching Algorithms Overview
42 pages
C Program for Binary Search Tree Operations
No ratings yet
C Program for Binary Search Tree Operations
6 pages
Introduction to Linear Programming
No ratings yet
Introduction to Linear Programming
22 pages
Algorithm Analysis and Design Overview
No ratings yet
Algorithm Analysis and Design Overview
20 pages
Central Difference Coefficients Overview
No ratings yet
Central Difference Coefficients Overview
19 pages

PyTorch Neural Network Training Guide

Uploaded by

PyTorch Neural Network Training Guide

Uploaded by

Machine Learning

Some knowledge of NumPy will also be useful!

Define Neural Optimization

Training Validation Testing

Guide for training/validation/testing can be found here.

Training Validation Testing

More info about batches and shuﬄing here.

def __getitem__(self, index):

dataloader = DataLoader(dataset, batch_size=5, shuffle=False)

1-D tensor 2-D tensor 3-D tensor

dim 0 dim 0 dim 1 dim 0 dim 1 dim 2

Note: dim in PyTorch == axis in NumPy

x = torch.from_numpy([Link]([[1, -1], [-1, 1]]))

● Tensor of constant zeros & ones tensor([[0., 0.],

x = [Link]([1, 2, 5]) tensor([[[1., 1., 1., 1., 1.],

>>> x = [Link]([2, 3])

>>> x = [Link]([1, 2, 3])

>>> x = [Link]([2, 3]) 2

>>> x = [Link](1) (dim = 1)

● Cat: concatenate multiple tensors

>>> z = [Link]([2, 2, 3]) z

>>> w = [Link]([x, y, z], dim=1) 3

Data type dtype tensor

32-bit ﬂoating point [Link] [Link]

64-bit integer (signed) [Link] [Link]

see oﬃcial documentation for more information on data types.

see oﬃcial documentation for more information on data types.

Use .to() to move tensors to appropriate devices.

● Multiple GPUs: specify ‘cuda:0’, ‘cuda:1’, ‘cuda:2’, ...

● Why use GPUs?

See here to learn about gradient calculation.

Loss Function Training Validation Testing

Input Tensor Output Tensor

can be any shape (but last dimension must be 32)

ref: last year's lecture video

>>> layer = [Link](32, 64)

See here to learn about why we need activation functions.

def forward(self, x):

class MyModel([Link]): class MyModel([Link]):

Loss Function Training Validation Testing

● Cross Entropy (for classiﬁcation tasks)

● loss = criterion(model_output, expected_value)

Loss Function Training Validation Testing

● E.g. Stochastic Gradient Descent (SGD)

[Link]([Link](), lr, momentum = 0)

● For every batch of data:

See oﬃcial documentation for more optimization algorithms.

Loss Function Training Validation Testing

dataset = MyDataset(file) read data via MyDataset

tr_set = DataLoader(dataset, 16, shuffle=True) put dataset into Dataloader

model = MyModel().to(device) construct model and move to device (cpu/cuda)

criterion = [Link]() set loss function

optimizer = [Link]([Link](), 0.1) set optimizer

[Link]() set model to train mode

for x, y in tr_set: iterate through the dataloader

optimizer.zero_grad() set gradient to zero

x, y = [Link](device), [Link](device) move data to device (cpu/cuda)

pred = model(x) forward pass (compute output)

loss = criterion(pred, y) compute loss

[Link]() compute gradient (backpropagation)

[Link]() update model with optimizer

for x, y in dv_set: iterate through the dataloader

x, y = [Link](device), [Link](device) move data to device (cpu/cuda)

with torch.no_grad(): disable gradient calculation

pred = model(x) forward pass (compute output)

loss = criterion(pred, y) compute loss

total_loss += [Link]().item() * len(x) accumulate loss

avg_loss = total_loss / len(dv_set.dataset) compute averaged loss

for x in tt_set: iterate through the dataloader

x = [Link](device) move data to device (cpu/cuda)

with torch.no_grad(): disable gradient calculation

pred = model(x) forward pass (compute output)

[Link]([Link]()) collect prediction

Changes behaviour of some model layers, such as dropout and batch

Prevents calculations from being added into gradient computation

You might also like

def getitem(self, index):