100% found this document useful (4 votes)

417 views33 pages

CS236 Introduction To PyTorch

PyTorch is a machine learning framework that accelerates the path from research prototyping to production deployment. It provides tools for building and training neural networks and deploying models into production. The document discusses PyTorch's motivations, building blocks like tensors and modules, and provides an example of training a neural network on MNIST data.

Uploaded by

Gobi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (4 votes)

417 views33 pages

CS236 Introduction To PyTorch

Uploaded by

Gobi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

Introduction to PyTorch

CS236 Section, Autumn 2024

Honglin Chen
Machine learning framework

Deep learning primitives such as data loading, NN layer types,

activations, loss functions, and optimizers

Hardware acceleration on NVIDIA GPUs

What is Pytorch? Libraries for vision, NLP, and audio applications

Research prototyping
A machine learning framework that
accelerates the path from research Models are Python code, Automatic differentiation, and eager mode

prototyping to production deployment

Production deployment

TorchScript, TorchServe, quantization

Motivations
Python
NumPy

Building Blocks
Tensors
Operations

Overview
Modules

Examples
MNIST

Beyond PyTorch
Tools
High Level Libraries
Domain Speciﬁc Libraries
Motivations
Python vs. NumPy

X = [1] * 10000 X = np.full((10000,), 1)

Y = [0.5] * 10000 Y = np.full((10000,), 0.5)
Z = [None] * 10000 Z = X * Y

for i in range(10000):
Z[i] = X[i] * Y[i]

# 2.772092819213867 ms # 0.08273124694824219 ms
# Interpreter Overhead # Low Level Implementation
# 64 bit # Vectorization
Motivations
NumPy vs. PyTorch

X = np.full((10000,), 1) X = torch.full((10000,), 1).cuda()

Y = np.full((10000,), 0.5) Y = torch.full((10000,), 0.5).cuda()
Z = X * Y Z = X * Y

# 0.3185272216796875 ms
# GPU Acceleration

Z.sum().backward()
# 0.08273124694824219 ms dX = X.grad
# Low Level Implementation
# Vectorization # Automatic Differentiation
Building Blocks
TENSORS
Building Blocks
Tensors / Initialization

torch.tensor([5., 3.])
tensor([ 5., 3.,]) # defaults to
torch.float32

torch.from_numpy(np.array([5., 3.]))
tensor([ 5., 3.,], dtype=torch.float64) #
because numpy defaults to 64bit

torch.tensor([5., 3.]).numpy()
array([5., 3.], dtype=float32)
Building Blocks
Tensors / Initialization

torch.ones(5, 3)
tensor([[1., 1., 1.],
[1., 1., 1.],
[1., 1., 1.],
[1., 1., 1.],
[1., 1., 1.]], dtype=torch.float64)
Building Blocks
Tensors / Initialization

torch.randn(5, 3)

tensor([[ 0.2349, -0.0427, -0.5053],

[ 0.6455, 0.1199, 0.4239],
[ 0.1279, 0.1105, 1.4637],
[ 0.4259, -0.0763, -0.9671],
[ 0.6856, 0.5047, 0.4250]])
Building Blocks
Tensors / Initialization

torch.ones_like(tensor)
Input: tensor([[ 0.2349, -0.0427, -0.5053],
[ 0.6455, 0.1199,
0.4239]])
Output: tensor([[1., 1., 1.],
[1., 1., 1.],
dtype=torch.float64)
Building Blocks
Tensors / Initialization

torch.empty(5, 3)
tensor([[ 0.0000e+00, 2.5244e-29, 0.0000e+00],
[ 2.5244e-29, 1.4569e-19, 2.7517e+12],
[ 7.5338e+28, 3.0313e+32, 6.3828e+28],
[ 1.4603e-19, 1.0899e+27, 6.8943e+34],
[ 1.1835e+22, 7.0976e+22, 1.8515e+28]])

# The values are not initialized

Building Blocks
Tensors / Indexing & Reshaping

torch.tensor([[5., 3.]])[0, :]
tensor([ 5., 3.,])

torch.tensor([[5., 3.]]).view(-1) # infer

dimension size
torch.tensor([[5., 3.]]).view(2)
tensor([ 5., 3.,])

torch.tensor([[5., 3.]]).size()
torch.Size([1, 2])
Building Blocks
Tensors / Broadcasting

X = torch.ones((3, 3, 3))
Y = torch.ones((1, 1, 3))
Z = X * Y
Z.size()

torch.Size([3, 3, 3])

#
https://pytorch.org/docs/stable/notes/broad
casting.html
Building Blocks
Tensors / Devices

if torch.cuda.is_available():
device = torch.device("cuda") # a CUDA device object
x = torch.ones(2, device=device) # directly create a tensor on GPU
y = torch.ones(2).to(device) # or just use strings
`.to("cuda")`
z = x + y
print(z) # z is on GPU
print(z.to("cpu", torch.double)) # to(‘cpu’) moves array to CPU

# `x.cuda()` and `x.cpu()` also works

Building Blocks
Operations / Primitives

torch.tensor([5., 3.]) + torch.tensor([3., 5.])

tensor([ 8., 8.,])

z = torch.add(x, y)
torch.add(x, y, out=z)
y = y.add_(x) # inplace y += x

torch.tanh(y)
torch.stack([x, y])

# https://pytorch.org/docs/stable/torch.html
Building Blocks
Operations / Functional

import torch.nn.functional as F

X = torch.randn((64, 3, 256, 256))

W = torch.randn((8, 3, 3, 3)

out = F.conv2d(X, W, stride=1, padding=1)

# Like SciPy
# https://pytorch.org/docs/stable/nn.functional.html
Building Blocks
Operations / Automatic Differentiation

Computation as a graph built at runtime

x 2

x = torch.ones(2, 2, requires_grad=True)
tensor([[1., 1.],
[1., 1.]], requires_grad=True)
∂+

y = x + 2
tensor([[3., 3.],
[3., 3.]], grad_fn=<AddBackward0>)
y
Building Blocks
Operations / Automatic Differentiation

x 2
z = y * 3
out = z.mean()
∂+

tensor(9., grad_fn=<MeanBackward1>)
y 3

out.backward() # Must be scalar

∂*
print(x.grad) # Only leaf nodes have grad

Gradient w.r.t. the input Tensors is computed ∂mean

step-by-step from loss to the top in reverse
out
Building Blocks
Operations / Automatic Differentiation

x.requires_grad # True
(x ** 2).requires_grad # True

# Keeping track of activations is expensive

with torch.no_grad():
(x ** 2).requires_grad # False

(x.detach() ** 2).requires_grad # False

Building Blocks
Operations / nn

import torch.nn as nn import torch.nn.functional as F

X = torch.ones((64, 3, 256, 256)) X = torch.randn((64, 3, 256, 256))

W = torch.randn((8, 3, 3, 3)
conv = nn.Conv2D(in_channels=3,
out_channels=8, out = F.conv2d(X, W,
kernel_size=3, stride=1, padding=1)
stride=1,
padding=1) # Inherits from nn.Module
# Implemented using functional
out = conv(img) # Stores internal states
Building Blocks
Operations / Module

import torch.nn as nn # Move the module to GPUs

conv.cuda()
X = torch.ones((64, 3, 256, 256))
# Saves states
conv = nn.Conv2D(in_channels=3, conv.state_dict()
out_channels=8,
kernel_size=3, # Saves trainable states
stride=1, conv.parameters()
padding=1)
# Recursively visit child modules
conv.apply(weight_init)
Examples
MNIST
Preprocessing

Dataloader

Example Network
MNIST
Optimizer

Training
Examples
MNIST / Preprocessing

import torchvision.transforms as transforms

transform = transforms.Compose(
[transforms.ToTensor(),
transforms.Normalize((0.5, 0.5, 0.5), (0.5, 0.5, 0.5))])

# Convert to Torch Tensor and perform normalization

# https://pytorch.org/vision/stable/transforms.html
# e.x Color Jitter, Five Crops
Examples
MNIST / Dataloader

Import torch
import torchvision

trainset = torchvision.datasets.CIFAR10(
root='./data', train=True,
download=True, transform=transform)

# Dataloaders are python iterators

trainloader = torch.utils.data.DataLoader(
trainset, batch_size=8,
shuffle=True, num_workers=2)
Examples
MNIST / Network

import torch.nn as nn

class Net(nn.Module):
def __init__(self):
super().__init__()
self.conv1 = nn.Conv2d(3, 6, 5)
self.pool = nn.MaxPool2d(2, 2)
self.conv2 = nn.Conv2d(6, 16, 5)
self.fc1 = nn.Linear(16 * 5 * 5, 120)
self.fc2 = nn.Linear(120, 84)
self.fc3 = nn.Linear(84, 10)
Examples
MNIST / Network

import torch.nn.functional as F

class Net(nn.Module):
def __init__(self):
...
def forward(self, x):
x = self.pool(F.relu(self.conv1(x)))
x = torch.flatten(self.pool(F.relu(self.conv2(x))))
x = F.relu(self.fc1(x))
x = F.relu(self.fc2(x))
return self.fc3(x)
Examples
MNIST / Optimizer

import torch.optim as optim

# Instantiate nn.Module (Use default weights)

net = Net().to(“cuda”)

# Define loss function

criterion = nn.CrossEntropyLoss()

# Create optimizer: https://pytorch.org/docs/stable/optim.html

optimizer = optim.SGD(net.parameters(), lr=0.001, momentum=0.9)
Examples
MNIST / Training

net.train() # Set to training mode (there is also `net.eval()`)

for epoch in range(2):

for inputs, labels in trainloader:
# zero the parameter gradients
optimizer.zero_grad()
# forward + backward + optimize
outputs = net(inputs.to(“cuda”))
loss = criterion(outputs, labels.to(“cuda”))
loss.backward()
optimizer.step()
Examples
MNIST / Recap

... transforms.Compose( ... # Define preprocessing transforms

... torch.utils.data.DataLoader( ... # Create Dataloader
... def Net(nn.Module): ... # Define Network
... criterion = nn.CrossEntropyLoss() ... # Define loss function
... optim.SGD(net.parameters(), ... # Create Optimizer
... for x, y in trainloader: ... # Iterate over Dataloader
... outputs = net(inputs) # Forward Pass
... criterion(outputs, labels) ... # Compute Loss
... optimizer.zero_grad() ... # Zero out gradients
... loss.backward() ... # Back Propagate
... optimizer.step() ... # Update weights
Beyond PyTorch
Tools / Keep Track of experiments, artifacts
Beyond PyTorch
High Level Libraries / Distributed & Mixed Precision Training
Beyond PyTorch
Domain Speciﬁc Libraries / Graph, RL, Probabilistic Programming

Effective Xgboost
No ratings yet
Effective Xgboost
221 pages
LangChain - Chat With Your Data
No ratings yet
LangChain - Chat With Your Data
32 pages
Deep Learning TensorFlow and Keras
No ratings yet
Deep Learning TensorFlow and Keras
454 pages
Maker's Muse 50 3D Printing Tips 2017
No ratings yet
Maker's Muse 50 3D Printing Tips 2017
71 pages
Tensorflow Tutorial PDF
100% (6)
Tensorflow Tutorial PDF
90 pages
Tutorials
No ratings yet
Tutorials
17 pages
Introduction To TensorFlow For Artificial Intelligence
No ratings yet
Introduction To TensorFlow For Artificial Intelligence
41 pages
AI Publishing. Python Scikit-Learn For Beginners... For Data Scientist 2021
100% (9)
AI Publishing. Python Scikit-Learn For Beginners... For Data Scientist 2021
339 pages
PyTorch Cheat Sheet for Developers
No ratings yet
PyTorch Cheat Sheet for Developers
2 pages
Deep Learning With PyTorch Guide For Beginners and Intermediate
100% (7)
Deep Learning With PyTorch Guide For Beginners and Intermediate
120 pages
PyTorch for Deep Learning Experts
No ratings yet
PyTorch for Deep Learning Experts
72 pages
"Hello World" of Deep Learning
No ratings yet
"Hello World" of Deep Learning
26 pages
Keras RNN Guide for Beginners
No ratings yet
Keras RNN Guide for Beginners
13 pages
Machine: Learning ATO Z - I
No ratings yet
Machine: Learning ATO Z - I
131 pages
Deep Learning With Tensorflow
100% (1)
Deep Learning With Tensorflow
70 pages
Tensorflow 2 Tutorial PDF
100% (4)
Tensorflow 2 Tutorial PDF
66 pages
Keras
50% (2)
Keras
2 pages
Tutorial Pytorch Best Commands
No ratings yet
Tutorial Pytorch Best Commands
8 pages
Simple Libraries in Python
No ratings yet
Simple Libraries in Python
12 pages
TensorFlow 2.0 Guide for Developers
No ratings yet
TensorFlow 2.0 Guide for Developers
2 pages
TensorFlow Tutorial For Beginners (Article) - DataCamp PDF
No ratings yet
TensorFlow Tutorial For Beginners (Article) - DataCamp PDF
60 pages
Deep Learning With Keras and Tensorflow
No ratings yet
Deep Learning With Keras and Tensorflow
557 pages
TensorFlow Basics
100% (1)
TensorFlow Basics
38 pages
Day 45 PyTorch Presentation
No ratings yet
Day 45 PyTorch Presentation
67 pages
Deep Learning With Python
100% (8)
Deep Learning With Python
396 pages
Machine Learning Handouts
No ratings yet
Machine Learning Handouts
110 pages
02 - Lecture Note - TensorFlow Ops
No ratings yet
02 - Lecture Note - TensorFlow Ops
21 pages
NumPy Cheat Sheet for Beginners
67% (3)
NumPy Cheat Sheet for Beginners
1 page
Getting Started With GPT-4 API: May 14,2024 Update To From gpt-4 To Gpt-4o
No ratings yet
Getting Started With GPT-4 API: May 14,2024 Update To From gpt-4 To Gpt-4o
8 pages
Statistical Machine Learning
100% (1)
Statistical Machine Learning
12 pages
Getting Started With TensorFlow - Js - TensorFlow - Medium
No ratings yet
Getting Started With TensorFlow - Js - TensorFlow - Medium
6 pages
Deep Learning With Tensorflow
No ratings yet
Deep Learning With Tensorflow
50 pages
A Quick Introduction To Tensorflow: Machine Learning Spring 2019
100% (1)
A Quick Introduction To Tensorflow: Machine Learning Spring 2019
22 pages
Scikit-learn ML Course Guide
100% (1)
Scikit-learn ML Course Guide
23 pages
Tensor Flow 101
100% (8)
Tensor Flow 101
58 pages
ML Algorithms
100% (1)
ML Algorithms
1 page
Natural Language Toolkit NLTK PDF
No ratings yet
Natural Language Toolkit NLTK PDF
23 pages
Natural Language Processing With Python
100% (1)
Natural Language Processing With Python
504 pages
Deep Learning by AndrewNG Tutorial Notes
No ratings yet
Deep Learning by AndrewNG Tutorial Notes
298 pages
Pandas for Data Analysts
100% (1)
Pandas for Data Analysts
64 pages
Keras
No ratings yet
Keras
7 pages
Deep Learning A Z PDF
100% (8)
Deep Learning A Z PDF
799 pages
TensorFlow For Machine Intelligence
100% (27)
TensorFlow For Machine Intelligence
305 pages
Running Llama 2 On CPU Inference Locally For Document Q&A - by Kenneth Leung - Jul, 2023 - Towards Data Science
100% (1)
Running Llama 2 On CPU Inference Locally For Document Q&A - by Kenneth Leung - Jul, 2023 - Towards Data Science
21 pages
Py Torch
50% (2)
Py Torch
23 pages
Guidebook Machine Learning Basics PDF
100% (1)
Guidebook Machine Learning Basics PDF
16 pages
MACHINELEARING UNIT 1material
100% (1)
MACHINELEARING UNIT 1material
64 pages
PyTorch CrashCourse
No ratings yet
PyTorch CrashCourse
16 pages
PyTorch CrashCourse
No ratings yet
PyTorch CrashCourse
17 pages
ISPR 26 Pytorch
No ratings yet
ISPR 26 Pytorch
35 pages
(Deep Learning Using PyTorch) (Cheatsheet)
No ratings yet
(Deep Learning Using PyTorch) (Cheatsheet)
7 pages
Pytorch Tutorial 1 Rev 1
No ratings yet
Pytorch Tutorial 1 Rev 1
48 pages
PyTorch Tensor and Autograd Guide
No ratings yet
PyTorch Tensor and Autograd Guide
15 pages
Pytorch Demo 1749471354
No ratings yet
Pytorch Demo 1749471354
10 pages
Pytorch Tutorial For Beginner: Department of Computer Science & Engineering University of Washington
No ratings yet
Pytorch Tutorial For Beginner: Department of Computer Science & Engineering University of Washington
11 pages
Pytorch Tutorial 1
No ratings yet
Pytorch Tutorial 1
48 pages
Tutorials Sources Beginner Ptcheat
No ratings yet
Tutorials Sources Beginner Ptcheat
7 pages
Harvard CS197 Lecture 6 & 7 Notes
No ratings yet
Harvard CS197 Lecture 6 & 7 Notes
18 pages
Week2 - PytorchIntro - Ipynb - Colaboratory
No ratings yet
Week2 - PytorchIntro - Ipynb - Colaboratory
12 pages
DL Pytorch
No ratings yet
DL Pytorch
8 pages
Pytorch: Tensors and Datasets
No ratings yet
Pytorch: Tensors and Datasets
9 pages
Chapter 14 - Analyzing Adversarial Performance - The Deep Learning Architect's Handbook
No ratings yet
Chapter 14 - Analyzing Adversarial Performance - The Deep Learning Architect's Handbook
1 page
IterateAI Careers
No ratings yet
IterateAI Careers
4 pages
AIcrowd - Single-Source Augmentation - Challenges
No ratings yet
AIcrowd - Single-Source Augmentation - Challenges
1 page
Inductive Moment Matching
No ratings yet
Inductive Moment Matching
36 pages
2024 11 15 AI Updates
No ratings yet
2024 11 15 AI Updates
20 pages
Chapter 2. Transformers: A Note For Early Release Readers
No ratings yet
Chapter 2. Transformers: A Note For Early Release Readers
85 pages
The Most Used Positional Encoding: Rope: Damien Benveniste
No ratings yet
The Most Used Positional Encoding: Rope: Damien Benveniste
7 pages
Model Compression Techniquesin Deep Learning
No ratings yet
Model Compression Techniquesin Deep Learning
23 pages
SVM Explained for AI Enthusiasts
No ratings yet
SVM Explained for AI Enthusiasts
19 pages
AI by Hand: Neural Network Concepts
No ratings yet
AI by Hand: Neural Network Concepts
28 pages
Mplug-Docowl 1.5: Unified Structure Learning For Ocr-Free Document Understanding
No ratings yet
Mplug-Docowl 1.5: Unified Structure Learning For Ocr-Free Document Understanding
26 pages
Probabilistic Machine Learning: Exponential Families
No ratings yet
Probabilistic Machine Learning: Exponential Families
33 pages
(Universitext) Paolo Baldi - Probability - An Introduction Through Theory and Exercises-Springer (2024) (Z-Lib - Io)
No ratings yet
(Universitext) Paolo Baldi - Probability - An Introduction Through Theory and Exercises-Springer (2024) (Z-Lib - Io)
395 pages
DeepSeek-VL: Open-Source Vision-Language Model
No ratings yet
DeepSeek-VL: Open-Source Vision-Language Model
33 pages
Probabilistic Machine Learning: Exponential Families
No ratings yet
Probabilistic Machine Learning: Exponential Families
19 pages
RNN
No ratings yet
RNN
12 pages
Generative AI & LLMs Course Overview
No ratings yet
Generative AI & LLMs Course Overview
6 pages
Installation Process
No ratings yet
Installation Process
5 pages
SigmaWin+ Ver. 7 Usage Notes (ReadMe File) PDF
No ratings yet
SigmaWin+ Ver. 7 Usage Notes (ReadMe File) PDF
15 pages
Uvss User Manual 2.1
No ratings yet
Uvss User Manual 2.1
28 pages
Teknor Industrial Computers, Inc. Viper 809
No ratings yet
Teknor Industrial Computers, Inc. Viper 809
5 pages
xc4421 Manual 27697
No ratings yet
xc4421 Manual 27697
1 page
Sco 305 Computer Graphics
No ratings yet
Sco 305 Computer Graphics
3 pages
Computer Basics - What Is A Computer (Handout)
No ratings yet
Computer Basics - What Is A Computer (Handout)
2 pages
Two-Stage Network for Image Denoising
No ratings yet
Two-Stage Network for Image Denoising
5 pages
Putersyllabus 2024 Pattern V2
No ratings yet
Putersyllabus 2024 Pattern V2
82 pages
Generative AI Prompts Productivity, Imagination, and Innovation in The Enterprise
No ratings yet
Generative AI Prompts Productivity, Imagination, and Innovation in The Enterprise
11 pages
Computer Case
100% (2)
Computer Case
9 pages
Sensatic RPC-TP2 Update Guide
No ratings yet
Sensatic RPC-TP2 Update Guide
4 pages
Lastexception 63840978597
No ratings yet
Lastexception 63840978597
5 pages
MS Excel Shortcut List in Excel
No ratings yet
MS Excel Shortcut List in Excel
26 pages
Google Keep Clone Project Report
No ratings yet
Google Keep Clone Project Report
48 pages
VLIW and DSP Architecture Explained
No ratings yet
VLIW and DSP Architecture Explained
6 pages
ComboBox in C Sharp
No ratings yet
ComboBox in C Sharp
9 pages
A Review On Research and Application of AI-Based Image Analysis in The Field of Computer Vision
No ratings yet
A Review On Research and Application of AI-Based Image Analysis in The Field of Computer Vision
19 pages
Instructiuni Programare Bucla 56160 BA ICheck 416
No ratings yet
Instructiuni Programare Bucla 56160 BA ICheck 416
72 pages
Manual Modelación 1D en Flood Modeller
No ratings yet
Manual Modelación 1D en Flood Modeller
8 pages
COSC 1701AB Assignment-1 Summer 2023 Google Workspace
No ratings yet
COSC 1701AB Assignment-1 Summer 2023 Google Workspace
8 pages
L05 GraphicalMapping
No ratings yet
L05 GraphicalMapping
41 pages
Summative Test in CSS
No ratings yet
Summative Test in CSS
2 pages
Software Engineering Essentials
No ratings yet
Software Engineering Essentials
127 pages
En - Security Center Installation and Upgrade Guide 5.2 SR10
No ratings yet
En - Security Center Installation and Upgrade Guide 5.2 SR10
100 pages
Windows Logo Icon Guidelines 0215
No ratings yet
Windows Logo Icon Guidelines 0215
8 pages
Word Practice 2007
No ratings yet
Word Practice 2007
12 pages
Emergency Calling Application
No ratings yet
Emergency Calling Application
12 pages

CS236 Introduction To PyTorch

Uploaded by

CS236 Introduction To PyTorch

Uploaded by

Introduction to PyTorch

CS236 Section, Autumn 2024

Deep learning primitives such as data loading, NN layer types,

Hardware acceleration on NVIDIA GPUs

What is Pytorch? Libraries for vision, NLP, and audio applications

prototyping to production deployment

TorchScript, TorchServe, quantization

X = [1] * 10000 X = np.full((10000,), 1)

X = np.full((10000,), 1) X = torch.full((10000,), 1).cuda()

tensor([[ 0.2349, -0.0427, -0.5053],

# The values are not initialized

torch.tensor([[5., 3.]]).view(-1) # infer

# `x.cuda()` and `x.cpu()` also works

torch.tensor([5., 3.]) + torch.tensor([3., 5.])

X = torch.randn((64, 3, 256, 256))

out = F.conv2d(X, W, stride=1, padding=1)

Computation as a graph built at runtime

out.backward() # Must be scalar

Gradient w.r.t. the input Tensors is computed ∂mean

# Keeping track of activations is expensive

(x.detach() ** 2).requires_grad # False

import torch.nn as nn import torch.nn.functional as F

X = torch.ones((64, 3, 256, 256)) X = torch.randn((64, 3, 256, 256))

import torch.nn as nn # Move the module to GPUs

import torchvision.transforms as transforms

# Convert to Torch Tensor and perform normalization

# Dataloaders are python iterators

import torch.optim as optim

# Instantiate nn.Module (Use default weights)

# Define loss function

# Create optimizer: https://pytorch.org/docs/stable/optim.html

net.train() # Set to training mode (there is also `net.eval()`)

for epoch in range(2):

... transforms.Compose( ... # Define preprocessing transforms

You might also like