0% found this document useful (0 votes)

11 views8 pages

Exercise Classification

The document outlines a project to build, train, and evaluate a feedforward artificial neural network (ANN) for classifying handwritten digits using the MNIST dataset. It details the tools required, key concepts, model building steps, and the training process using both manual methods and Keras's high-level API. Additionally, it discusses real-world applications of the model and concludes with a summary of the exercise's objectives and outcomes.

Uploaded by

bannureddy669

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views8 pages

Exercise Classification

Uploaded by

bannureddy669

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Exercise: Handwritten Digit Classification using ANN (MNIST Dataset)

1. Objective
To build, train, and evaluate a feedforward artificial neural network (ANN) that classifies
handwritten digits from the MNIST dataset using both manual training loop (GradientTape) and
Keras’s high-level API.
2. Tools Required
 Python 3.x
 TensorFlow
 NumPy
 Matplotlib
3. Dataset Description
The MNIST dataset contains 70,000 grayscale images of handwritten digits (0 to 9), each of size
28x28 pixels. It is divided into:
 Training Set: 60,000 images
 Test Set: 10,000 images

4. Summary of Key Concepts

Concept Description Role in Project
MNIST Dataset Handwritten digit images and labels Provides input images and
expected outputs
ANN Fully connected neural network Learns patterns in image data to
classify digits
Flatten Layer Reshapes 28x28 to 784 vector Prepares image data for dense
layers
Dense Layer Fully connected neural layer Learns features through weighted
connections
ReLU Activation Applies ReLU non-linearity Allows network to learn complex
functions
Loss Function Measures difference between Guides learning by minimizing
(CrossEntropy) predicted and actual labels classification error
Optimizer (Adam) Optimizes weights using gradients Adjusts model weights during
training
GradientTape Manual training method Records operations for
backpropagation
Epoch One full pass over training data Repeated passes help refine
learning
Accuracy Performance metric in classification Measures correct predictions on
test set
Softmax Layer Converts logits to probabilities Used during prediction for
interpretation
5. Model Building Steps
Step 1: Import Libraries

Explanation: These libraries are needed for building the ANN, processing data, and
visualization.

Question: Why do we import models from keras?

A: To use the Sequential model for stacking layers.

Question. What is the purpose of tensorflow.keras in this code?

Answer:
tensorflow.keras is a high-level API that allows us to build, train, and evaluate deep learning
models easily. It includes layers, optimizers, and tools for loading datasets like MNIST.

Step 2: Load and Normalize the Data

Explanation: Pixel values are scaled from [0, 255] to [0, 1] for faster learning.
Question: What is normalization?
Answer: Scaling features to a standard range, here [0, 1].
Question. Why do we divide the pixel values by 255?
Answer:
Pixel values range from 0 to 255. Dividing by 255 normalizes the values to a range of 0 to 1,
which speeds up training and helps the model learn better.
Step 3: Create Training Batches

Explanation: Batches help in efficient training. Shuffling ensures varied input order per epoch.
Question: Why use batching?
Answer: For computational efficiency and stable gradient estimates.

Step 4: Build the Neural Network Model

Explanation: Sequential layers stack transformations on the input to produce logits.

Question. What is the purpose of the Flatten layer?
Answer:
The Flatten layer converts the 2D image (28x28) into a 1D vector (784) so that it can be passed
to the Dense layers.

Question. Why is the last Dense layer's output 10?

Answer:
Because we have 10 digit classes (0 to 9), we need 10 output neurons to represent the probability
for each class.

Question: Why no softmax in the last layer?

Answer: We’ll use logits with from_logits=True in loss function.

Step 5: Define Loss and Optimizer

Explanation: Cross-entropy is suitable for classification; Adam adapts learning rates.

Question: Why SparseCategoricalCrossentropy?
Answer: Because labels are integers (not one-hot encoded).

Question. What does an optimizer do during training?

Answer:
The optimizer updates the model's weights using the gradients to reduce the loss and improve
accuracy.

Section A: Manual Training Using GradientTape

Step 6: Manual Training Loop

Explanation: Custom loop for educational clarity. Shows step-by-step learning and loss updates.
Question: What does GradientTape() do?
Answer: Records operations to compute gradients.

OUTPUT:

What Do These Values Indicate?

The loss is decreasing steadily with each epoch:
 Epoch 1 (0.2277) → relatively high, as the model starts with random weights.
 Epoch 5 (0.0332) → much lower, indicating the model has learned meaningful
patterns from the training data.
This suggests:
 Your model is training correctly
 The optimizer is working
 Gradient descent is minimizing the loss
 The ANN is learning useful representations of the data
NOTE: After every epoch, the model:
1. Makes predictions.
2. Compares predictions with actual labels.
3. Computes loss (error).
4. Adjusts weights using gradients to minimize the error.
This process, called backpropagation, helps the model improve its accuracy over time.
Summary Table of Questions
Code Line Question Answer
One complete pass over the entire training
epochs = 5 What is an epoch?
dataset.
Why reset total_loss each To calculate fresh average loss for the new
total_loss = 0
epoch? epoch.
(x_batch, A subset of training data processed in one
What is a batch?
y_batch) step.
GradientTape() What does it do? Tracks operations to compute gradients.
training=True Why set this flag? To enable training behaviors like dropout.
What does the loss function
loss_fn() Measures prediction error.
do?
tape.gradient() What are gradients? Slopes that guide weight updates.
To improve model accuracy by updating
apply_gradients() Why apply gradients?
weights.
loss.numpy() Why convert loss to NumPy? To use it in Python arithmetic.
len(train_dataset) Why divide by this? To calculate average loss across all batches.

Step 7: Evaluate the Model Accuracy

Step 8: Make Predictions:

We compute accuracy by comparing predicted and true labels. argmax picks the class with the
highest logit value.

Question: Why set training=False during evaluation?

Answer: To disable layers like dropout or batch normalization.

Question: How is accuracy calculated?

Answer: Number of correct predictions divided by total test samples.

Question: Why use argmax?

Answer: To select the predicted class with highest score.
Step 9: Evaluate Individual Predictions:

Question: What does argmax(logits, axis=1) do?

Answer: Picks the index of the highest score (predicted class).

Question: Why use plt.imshow(images[i], cmap='gray')?

Answer: To display grayscale MNIST images.

Section B: Training Using High-Level Keras API

Complie the Model

Explanation: Short and readable method using Keras’s high-level API.

Question: When is model.fit() preferred?
Answer: When you don’t need custom training steps.
6. Visualizing Predictions

Explanation: Useful for visual verification of model predictions.

7. Real-world Applications
 Digit recognition on postal codes (OCR)
 Bank cheque processing
 Touchscreen handwriting input
8. Conclusion
This exercise demonstrated building and training a simple ANN for digit classification using
both a manual and high-level API.

C1W3 Assignment
No ratings yet
C1W3 Assignment
7 pages
TensorFlow Callback Tutorial
No ratings yet
TensorFlow Callback Tutorial
5 pages
Decision Analysis
80% (5)
Decision Analysis
18 pages
This Python Script Implements A Single
No ratings yet
This Python Script Implements A Single
6 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
Deep Learning For Vision Lab Manual 2024
100% (1)
Deep Learning For Vision Lab Manual 2024
25 pages
ASNM Program Explain
No ratings yet
ASNM Program Explain
4 pages
DLV Lab Manual Print
No ratings yet
DLV Lab Manual Print
29 pages
Multi Layer Perceptron Tf2 Code Description
No ratings yet
Multi Layer Perceptron Tf2 Code Description
10 pages
ML Guide: MNIST Digit Classification
No ratings yet
ML Guide: MNIST Digit Classification
98 pages
Assignment 2 DL
No ratings yet
Assignment 2 DL
10 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
Keras
No ratings yet
Keras
4 pages
DL Lab-Final
No ratings yet
DL Lab-Final
22 pages
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
No ratings yet
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
9 pages
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
No ratings yet
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
9 pages
NN From Scratch
No ratings yet
NN From Scratch
5 pages
Explore The Implementation of CNNs in Python
No ratings yet
Explore The Implementation of CNNs in Python
10 pages
Cat Dog Classification CNN Model
No ratings yet
Cat Dog Classification CNN Model
13 pages
CH 02 Summary
No ratings yet
CH 02 Summary
3 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
Video 7 - Building A Multilayer Feedforward Network For Classification in PyTorch
No ratings yet
Video 7 - Building A Multilayer Feedforward Network For Classification in PyTorch
18 pages
Deep Learning Experiments
No ratings yet
Deep Learning Experiments
42 pages
Build Image Classifier with Keras
No ratings yet
Build Image Classifier with Keras
17 pages
L8 - Image Classification
No ratings yet
L8 - Image Classification
20 pages
Deep Learning Classification-3
No ratings yet
Deep Learning Classification-3
17 pages
Image Classification Using MNIST Dataset
No ratings yet
Image Classification Using MNIST Dataset
28 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
Assignment 02# - Machine Learning 2023
No ratings yet
Assignment 02# - Machine Learning 2023
8 pages
Experiement 1,2,4 and 5
No ratings yet
Experiement 1,2,4 and 5
12 pages
Final Code
No ratings yet
Final Code
16 pages
L2 - Basic ANN Model Building With TF-Keras
No ratings yet
L2 - Basic ANN Model Building With TF-Keras
16 pages
A First Look On Nueral Network
No ratings yet
A First Look On Nueral Network
8 pages
Deep Learning
No ratings yet
Deep Learning
46 pages
How To Develop A CNN For MNIST Handwritten Digit Classification
No ratings yet
How To Develop A CNN For MNIST Handwritten Digit Classification
43 pages
Implement A Neural Network Using Python
No ratings yet
Implement A Neural Network Using Python
4 pages
NN From Scratch PDF 1735495327
No ratings yet
NN From Scratch PDF 1735495327
19 pages
Shivansh Exp8
No ratings yet
Shivansh Exp8
5 pages
Intro To Pytorch
No ratings yet
Intro To Pytorch
12 pages
Train Your Image Classifier Model With PyTorch
No ratings yet
Train Your Image Classifier Model With PyTorch
6 pages
Assignment 3 DS5620
No ratings yet
Assignment 3 DS5620
11 pages
Deep Learning Workshop Session 2
No ratings yet
Deep Learning Workshop Session 2
4 pages
Experiement 3
No ratings yet
Experiement 3
4 pages
DL 1
No ratings yet
DL 1
3 pages
KT 01 Intro2Keras
No ratings yet
KT 01 Intro2Keras
24 pages
Exp. No.: I. Aim: AIML634P Neural Network Lab 2262034
No ratings yet
Exp. No.: I. Aim: AIML634P Neural Network Lab 2262034
6 pages
Image Classification: Keras
No ratings yet
Image Classification: Keras
21 pages
Introduction To ANN With Steps 10 25
No ratings yet
Introduction To ANN With Steps 10 25
30 pages
Chapter04 - Getting Started With Neural Networks
No ratings yet
Chapter04 - Getting Started With Neural Networks
9 pages
Neural Network Implementation Using Keras
No ratings yet
Neural Network Implementation Using Keras
8 pages
Implement A Neural Network Using Python
No ratings yet
Implement A Neural Network Using Python
5 pages
Classifying Hand-Written Digits Using Neural Network
No ratings yet
Classifying Hand-Written Digits Using Neural Network
21 pages
Capstone Project-1
No ratings yet
Capstone Project-1
15 pages
Deep Learning Lab 3
No ratings yet
Deep Learning Lab 3
3 pages
Python Deep Learning Lab Programs
No ratings yet
Python Deep Learning Lab Programs
35 pages
21BCP167 Ai 9
No ratings yet
21BCP167 Ai 9
10 pages
HW4ML Project Code
No ratings yet
HW4ML Project Code
24 pages
DL Practical 3
No ratings yet
DL Practical 3
5 pages
Introduction To Genetic Algorithm Neural Networks
No ratings yet
Introduction To Genetic Algorithm Neural Networks
44 pages
Dutro Hybrid
92% (12)
Dutro Hybrid
169 pages
The Use of Link Motion On Mechanical Presses
No ratings yet
The Use of Link Motion On Mechanical Presses
6 pages
Book - NEW DEVELOPMENTS IN PDF
100% (1)
Book - NEW DEVELOPMENTS IN PDF
418 pages
DC Shunts for High Current Measurement
No ratings yet
DC Shunts for High Current Measurement
3 pages
Article 450 Transformers
100% (1)
Article 450 Transformers
72 pages
е з Guizhou Tyre Co.,Ltd
No ratings yet
е з Guizhou Tyre Co.,Ltd
76 pages
Pipe Thread NPT and BSPT Fittings Compatibility
No ratings yet
Pipe Thread NPT and BSPT Fittings Compatibility
5 pages
Industrial Coupling Options Guide
No ratings yet
Industrial Coupling Options Guide
24 pages
10 Reasons To Choose PeopleLink
No ratings yet
10 Reasons To Choose PeopleLink
3 pages
Advanced Handrail Systems Guide
No ratings yet
Advanced Handrail Systems Guide
16 pages
Astm c393 Testing Fixture
100% (1)
Astm c393 Testing Fixture
3 pages
2 Lecture 2 Diode B Stad - CH - 01
100% (1)
2 Lecture 2 Diode B Stad - CH - 01
66 pages
Summary
No ratings yet
Summary
1 page
HP Sampling Catalog
No ratings yet
HP Sampling Catalog
21 pages
QCI in MFG Companies
No ratings yet
QCI in MFG Companies
16 pages
Biohacking & Digital Health Trends
100% (1)
Biohacking & Digital Health Trends
50 pages
Major Challenges in Audiing ERP Security
No ratings yet
Major Challenges in Audiing ERP Security
3 pages
Pod Handler
No ratings yet
Pod Handler
68 pages
DIB - Generators & Motors
No ratings yet
DIB - Generators & Motors
6 pages
Eaton Edx 2000h
No ratings yet
Eaton Edx 2000h
4 pages
Repair GRUB2 When Ubuntu Won't Boot
No ratings yet
Repair GRUB2 When Ubuntu Won't Boot
14 pages
415 V System Stage-1
100% (1)
415 V System Stage-1
18 pages
Hotel Waste Reduction Guide
No ratings yet
Hotel Waste Reduction Guide
12 pages
101 Ways To Promote Your Real Estate Web Site
100% (1)
101 Ways To Promote Your Real Estate Web Site
391 pages
Molding Sand Composition & Properties
No ratings yet
Molding Sand Composition & Properties
4 pages
4 MPW-H330
No ratings yet
4 MPW-H330
1 page
Probable Technologies Behind The Vimanas Described in Ramayana
No ratings yet
Probable Technologies Behind The Vimanas Described in Ramayana
7 pages
Updated Tiago Accessories Price List 09 September 2017 PDF
No ratings yet
Updated Tiago Accessories Price List 09 September 2017 PDF
2 pages
RFT - Specifications (En)
No ratings yet
RFT - Specifications (En)
4 pages

Exercise Classification

Uploaded by

Exercise Classification

Uploaded by

Exercise: Handwritten Digit Classification using ANN (MNIST Dataset)

4. Summary of Key Concepts

Question: Why do we import models from keras?

Question. What is the purpose of tensorflow.keras in this code?

Step 2: Load and Normalize the Data

Step 4: Build the Neural Network Model

Explanation: Sequential layers stack transformations on the input to produce logits.

Question. Why is the last Dense layer's output 10?

Question: Why no softmax in the last layer?

Step 5: Define Loss and Optimizer

Explanation: Cross-entropy is suitable for classification; Adam adapts learning rates.

Question. What does an optimizer do during training?

Section A: Manual Training Using GradientTape

What Do These Values Indicate?

Step 7: Evaluate the Model Accuracy

Question: Why set training=False during evaluation?

Question: How is accuracy calculated?

Question: Why use argmax?

Question: What does argmax(logits, axis=1) do?

Question: Why use plt.imshow(images[i], cmap='gray')?

Section B: Training Using High-Level Keras API

Explanation: Short and readable method using Keras’s high-level API.

Explanation: Useful for visual verification of model predictions.

You might also like