0% found this document useful (0 votes)

4 views2 pages

Project Documentation

The project documentation outlines the approach taken for data collection, labeling, and model architecture in a Match Prediction project, utilizing synthetic data and unsupervised learning. Key challenges included a low response count and overfitting in the model, which was addressed through techniques like L2 regularization and dropout layers. The documentation also details the process for saving the model and measuring inference time using an inference script with Python's argparse module.

Uploaded by

dharavathsridharnayak745

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views2 pages

Project Documentation

Uploaded by

dharavathsridharnayak745

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Project Documentation

1. Data Collection:
Approach Used:

For our project, Match Prediction, we chose to collect synthetic data. This means we created our own dataset
instead of using existing real-world data. Even though it's synthetic, we designed it to reflect real scenarios as closely
as possible.

To do this, we created a Google Form with multiple-choice questions related to match outcomes. The questions were
based on what users might actually input in a real system, and the answer options matched the kind of labels we
wanted the model to learn from.

Challenges Faced:

• Low Response Count:

We were expecting around 300 to 500 responses, which would have given us more data to train the model.
However, we only received about 140 responses.

• Less Training Data:

Because of the smaller dataset, we had to be more careful with how we used the data. We focused on
keeping the questions clear and the labels balanced so that the model could still learn effectively.

2. Data Labelling:
For labelling the data in our Match Prediction project, we used an unsupervised learning approach.

Why Unsupervised Learning?

We chose unsupervised learning because it allowed us to automatically find patterns or groupings in the data
without needing manually written rules. In our case, a rule-based method wouldn’t be effective since it depends on
fixed, human-defined logic—which can be limiting and may not work well with the kind of data we collected.

3. Label Encoding:
To convert our labels into numbers that the model can understand, we used Label Encoder.

Why Label Encoder?

Since we had a small number of data points, using Label Encoder was the simplest and most efficient choice. It
assigns a unique number to each label, which works well when the dataset is small and the labels are not too
complex.

If we had a larger dataset (like nominal or ordinal data), we would have considered using other methods like One-Hot
Encoding or Ordinal Encoding depending on the label type.

4. Model Architecture:
We used an Artificial Neural Network (ANN) for our model architecture.

Initial Design

• The model started with Input layer, 2 hidden layers and output layer

• Activation functions and dense layers were used for basic learning.
• However, due to our limited and unbalanced dataset, we quickly faced overfitting.

How We Improved It

To fix these issues, we made several improvements:

• L2 Regularization: To reduce overfitting by penalizing large weights.

• Kernel Initializer: Helped in better weight initialization to stabilize learning.

• Batch Normalization: Improved training speed and stability.

• Dropout Layers: Randomly dropped neurons during training to prevent the model from becoming too
dependent on specific paths.

5. Saving the Model:

After training the model, it was important to save the best version for future use—especially for making predictions
later.

We used callbacks during training, the Model Checkpoint callback from Keras. This helped us automatically save the
model whenever it performed better on the validation data.

Instead of just saving the last model (which might not be the best), we set the callback to monitor validation
accuracy—so the model with the highest validation accuracy was saved.

6. Inference Script:
The main goal of inference script is to measure how much time a pre-trained model takes to make predictions on
new input data.

• We used Python’s argparse module to allow users to pass arguments from the terminal. First, we created
the parser object (parser = argparse.ArgumentParser() ).
• Then we added the arguments –weigths_path (path to the saved model file), --data_path (path to the data
file) and –num_preds (number of predictions to make).
• For each argument includes required = True (Makes the argument mandatory), type (Specifies the data type
(e.g., str, int)), default is given for only the data path and help (Describes what the argument is for).
• We used the following commands to load the saved model (model =
tensorflow.keras.models.load_model(weights_path) ).
• To measure how long the model takes to generate predictions, we used python ‘s built in time module
(import time
start = time.time()
predictions = model.predict(data)
end = time.time()
print(f"Prediction time: {end - start:.4f} seconds") ).

NN From Scratch PDF 1735495327
No ratings yet
NN From Scratch PDF 1735495327
19 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
13 pages
NNProject t2
No ratings yet
NNProject t2
9 pages
Deep Learning Workshop Session 2
No ratings yet
Deep Learning Workshop Session 2
4 pages
Report
No ratings yet
Report
14 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
Exercise Classification
No ratings yet
Exercise Classification
8 pages
Lab 4
No ratings yet
Lab 4
4 pages
Weekly Activity 6
No ratings yet
Weekly Activity 6
5 pages
Deep Learning For Vision Lab Manual 2024
100% (1)
Deep Learning For Vision Lab Manual 2024
25 pages
Deep Learning Model Management Guide
No ratings yet
Deep Learning Model Management Guide
8 pages
Deep Learning Nanodegree Syllabus: Project: Find Donors For Charityml
No ratings yet
Deep Learning Nanodegree Syllabus: Project: Find Donors For Charityml
13 pages
This Python Script Implements A Single
No ratings yet
This Python Script Implements A Single
6 pages
Assignment 3
No ratings yet
Assignment 3
6 pages
Coding Neural Networks-Classification & Regression
No ratings yet
Coding Neural Networks-Classification & Regression
39 pages
Chapter04 - Getting Started With Neural Networks
No ratings yet
Chapter04 - Getting Started With Neural Networks
9 pages
Ex No:1 Implementing A Perceptron Algorithm For Binary Classification Date: Aim
No ratings yet
Ex No:1 Implementing A Perceptron Algorithm For Binary Classification Date: Aim
41 pages
DLT Experiment 2
No ratings yet
DLT Experiment 2
7 pages
Lab 12
No ratings yet
Lab 12
6 pages
A3 Classification and Feature Engineering
No ratings yet
A3 Classification and Feature Engineering
2 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
ML Guide: MNIST Digit Classification
No ratings yet
ML Guide: MNIST Digit Classification
98 pages
CS335 Lab6
No ratings yet
CS335 Lab6
7 pages
DL Lab - Merged
No ratings yet
DL Lab - Merged
60 pages
Final DL
No ratings yet
Final DL
26 pages
DLP Lab
No ratings yet
DLP Lab
81 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
Assignment 3 DS5620
No ratings yet
Assignment 3 DS5620
11 pages
Week 4 - Lab
No ratings yet
Week 4 - Lab
7 pages
Multi Layer Perceptron Tf2 Code Description
No ratings yet
Multi Layer Perceptron Tf2 Code Description
10 pages
CS 461 - Fall 2021 - Neural Networks - Machine Learning
No ratings yet
CS 461 - Fall 2021 - Neural Networks - Machine Learning
5 pages
Machine Learning HW3 - Image Classification
No ratings yet
Machine Learning HW3 - Image Classification
48 pages
Deep Learning Lab Manual - 23-24
No ratings yet
Deep Learning Lab Manual - 23-24
41 pages
AIML 7 To 11
No ratings yet
AIML 7 To 11
7 pages
Deep Learning and Machine Learning: Lab Explanation
No ratings yet
Deep Learning and Machine Learning: Lab Explanation
34 pages
Report Sentiment Analysis Marcos Matheus
No ratings yet
Report Sentiment Analysis Marcos Matheus
12 pages
Practical: Build and Train A Feedforward Neural Network (MLP)
No ratings yet
Practical: Build and Train A Feedforward Neural Network (MLP)
4 pages
09 Milestone Project 2 Skimlit
No ratings yet
09 Milestone Project 2 Skimlit
32 pages
Deep Learning Assignments
No ratings yet
Deep Learning Assignments
6 pages
DL 3
No ratings yet
DL 3
6 pages
Text Classification - Movie Review - News Wires
No ratings yet
Text Classification - Movie Review - News Wires
5 pages
ASNM Program Explain
No ratings yet
ASNM Program Explain
4 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
Classifying Hand-Written Digits Using Neural Network
No ratings yet
Classifying Hand-Written Digits Using Neural Network
21 pages
Machine Learning Guide
No ratings yet
Machine Learning Guide
10 pages
R20!63!20ITC27 Deep Learning Lab Manual (Minor Proj 2) Dr.K.ramu
No ratings yet
R20!63!20ITC27 Deep Learning Lab Manual (Minor Proj 2) Dr.K.ramu
47 pages
Report Week 1 and 2
No ratings yet
Report Week 1 and 2
12 pages
DL Lab1
No ratings yet
DL Lab1
15 pages
DL Lab-Final
No ratings yet
DL Lab-Final
22 pages
Tutorial 4
No ratings yet
Tutorial 4
6 pages
Kirkvik Acit2022
No ratings yet
Kirkvik Acit2022
155 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
10 pages
Neural Networks for Python Beginners
No ratings yet
Neural Networks for Python Beginners
2 pages
Deep Learning
No ratings yet
Deep Learning
25 pages
DL & AI - Lab Manual
No ratings yet
DL & AI - Lab Manual
33 pages
CCC
No ratings yet
CCC
25 pages
Deep Learning
No ratings yet
Deep Learning
30 pages
Machine Learning for Professionals
No ratings yet
Machine Learning for Professionals
26 pages
Shaurya DL File
No ratings yet
Shaurya DL File
75 pages
Smart Prop Presentation
No ratings yet
Smart Prop Presentation
1 page
78 - Qpa 2024 MSC Data Science E-Library Telangana by DR - Durgaprasad
No ratings yet
78 - Qpa 2024 MSC Data Science E-Library Telangana by DR - Durgaprasad
45 pages
Cover Letter
No ratings yet
Cover Letter
1 page
Hand - Gestures Controlled Virtual Mouse by Using Python PPT. SRIDHAR NAYAK
No ratings yet
Hand - Gestures Controlled Virtual Mouse by Using Python PPT. SRIDHAR NAYAK
15 pages
In 7252055
No ratings yet
In 7252055
3 pages
Time Management Sample
No ratings yet
Time Management Sample
1 page
3628527-Data Cleaning
No ratings yet
3628527-Data Cleaning
1 page
3602441-Optimizing Inventory & Pricing For An Electronics Retailer
No ratings yet
3602441-Optimizing Inventory & Pricing For An Electronics Retailer
1 page
Unit II Chap - 2 Notes
No ratings yet
Unit II Chap - 2 Notes
3 pages
50Cc Scooter Ac Ignition System: B G/Y G Y/R BR BR/W B Y BL/W
100% (1)
50Cc Scooter Ac Ignition System: B G/Y G Y/R BR BR/W B Y BL/W
1 page
ST93C46 Data Sheets
No ratings yet
ST93C46 Data Sheets
14 pages
Metal Casting 3
No ratings yet
Metal Casting 3
23 pages
PNB vs. CA 217 Scra 347
100% (1)
PNB vs. CA 217 Scra 347
2 pages
Spray Booth Design English
No ratings yet
Spray Booth Design English
7 pages
Crime Mapping for Police Planning
No ratings yet
Crime Mapping for Police Planning
7 pages
Oracle Exadata Training Extended
No ratings yet
Oracle Exadata Training Extended
3 pages
Management MCQ - Merged (1) - 1
No ratings yet
Management MCQ - Merged (1) - 1
1 page
Lab 3
No ratings yet
Lab 3
16 pages
Quantitative Methods in Procurement
No ratings yet
Quantitative Methods in Procurement
15 pages
LiFePO4 Battery Specs HP-50160282
No ratings yet
LiFePO4 Battery Specs HP-50160282
14 pages
Women's Day - Famous Space Women
No ratings yet
Women's Day - Famous Space Women
2 pages
Labour Welfare Scheme
No ratings yet
Labour Welfare Scheme
20 pages
ACADEMIC CALENDAR 2025 Approved
No ratings yet
ACADEMIC CALENDAR 2025 Approved
2 pages
Working at Heights Verification of Competency RIIWHS204E OHS - Com.au
No ratings yet
Working at Heights Verification of Competency RIIWHS204E OHS - Com.au
4 pages
Geogia Hotel Ghana LTD Vrs Silver Star Auto LTD (J4 34 of 2012) 2012 GHASC 54 (4 December 2012)
No ratings yet
Geogia Hotel Ghana LTD Vrs Silver Star Auto LTD (J4 34 of 2012) 2012 GHASC 54 (4 December 2012)
26 pages
Bucket Bag
100% (1)
Bucket Bag
8 pages
Pari 1
No ratings yet
Pari 1
35 pages
Victoria Adaugo Onyekwere - 8109678605 - 20250102202313
No ratings yet
Victoria Adaugo Onyekwere - 8109678605 - 20250102202313
43 pages
Mos Word 2016 - Core Practice Exam 3 Training
No ratings yet
Mos Word 2016 - Core Practice Exam 3 Training
9 pages
SP 3 D Upgrade Guide
No ratings yet
SP 3 D Upgrade Guide
37 pages
Nature and Scope of Rural Development
No ratings yet
Nature and Scope of Rural Development
59 pages
North Indian Restaurant Financials
No ratings yet
North Indian Restaurant Financials
9 pages
Industrial Ventilation A Manual of Recommended Practice For Operation and Maintenance 2nd Edition Acgih Download
100% (2)
Industrial Ventilation A Manual of Recommended Practice For Operation and Maintenance 2nd Edition Acgih Download
58 pages
Import Java - Util.Scanner Import Java - Text.Decimalformat Public Class Javaapplication4 (
No ratings yet
Import Java - Util.Scanner Import Java - Text.Decimalformat Public Class Javaapplication4 (
1 page
Comer Letter To NARA
No ratings yet
Comer Letter To NARA
3 pages
Consumer Perception Towards Online Grocery Stores, Chennai
No ratings yet
Consumer Perception Towards Online Grocery Stores, Chennai
14 pages
How To Use Nmap - Commands and Tutorial Guide
No ratings yet
How To Use Nmap - Commands and Tutorial Guide
18 pages
SIGVERIF
No ratings yet
SIGVERIF
7 pages

Project Documentation

Uploaded by

Project Documentation

Uploaded by

Project Documentation

• Low Response Count:

• Less Training Data:

Why Unsupervised Learning?

Why Label Encoder?

To fix these issues, we made several improvements:

• L2 Regularization: To reduce overfitting by penalizing large weights.

• Kernel Initializer: Helped in better weight initialization to stabilize learning.

• Batch Normalization: Improved training speed and stability.

5. Saving the Model:

You might also like