0% found this document useful (0 votes)

30 views1 page

Problem Statement

Uploaded by

ashutosh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views1 page

Problem Statement

Uploaded by

ashutosh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Problem Statement:

The objec)ve of this project is to iden)fy the correct audio device from a set of audio devices. The dataset
consists of train, development, and test samples. Each sample can have between 2 and 5 possible device op)ons.
The task is to develop a mul)-class classiﬁer (or any other suitable model) that can predict the correct device
selec)on.

Dataset:

The dataset is divided into train, development (dev), and evalua)on (eval) sets. The input data comprises audio
recordings from all the devices. To simplify the processing, I will provide pre-extracted features instead of audio
ﬁles. There are two types of features available:
1. Single Feature Vector: Each device in each training sample is associated with a 640-dimensional feature
vector. The size of the feature vector may vary.
2. Time-Series Feature: Alterna)vely, we can provide a )me-series feature matrix for each device in each
training sample. This matrix has dimensions of (Feature dimension x Time stamps).
The features for all samples will be of the same dimension. To handle scenarios where the number of devices is
less than the maximum of 5, non-exis)ng devices will be represented as zeros or very small values (e.g., 1e-8) in
the feature representa)on.

Target Device/Class: For each sample in the dataset, the target device for selec)on will be provided as ground
truth informa)on. The target device value lies between 0 and 4, represen)ng the device op)ons.

Data Details:
• Train Samples: A dataset containing 120,000+ samples for training the model will be provided.
• Dev Samples: A dataset containing 6,000+ samples will be available for ﬁne-tuning the models.
• Eval Samples: Two evalua)on sets will be shared. The easy set comprises 7,000+ samples, while the
diﬃcult set comprises 2,000+ samples.

PyTorch Model:
To achieve the desired accuracy of at least 70% on the evalua)on sets, please employ advanced models in
PyTorch, such as recurrent neural networks (RNNs) and transformer models, which have demonstrated success
in audio and sequence classiﬁca)on tasks. These models oﬀer the poten)al for improved performance compared
to regular CNNs and DNNs. You are welcome to try whatever works the best for the dataset. I will leave this to
your exper)se.

Deliverables:
Upon comple)on, we will provide the following deliverables:
1. Trained Model: The trained PyTorch model script and weights; capable of audio device selec)on.
2. Decoding Scripts: Scripts to decode the predic)ons of the model and map them to the corresponding
audio devices.
3. Addi)onal Scripts and Insights: We will provide scripts that generate insights, such as plots, histograms,
correla)ons, and any other relevant analysis, to support your report and provide a deeper understanding of the
model's performance.

HISTORY ISC GRADE 11 2018-19 Emergence of The Colonial Economy. Why Was There A
No ratings yet
HISTORY ISC GRADE 11 2018-19 Emergence of The Colonial Economy. Why Was There A
13 pages
9t83b3382 Hoja de Datos
No ratings yet
9t83b3382 Hoja de Datos
4 pages
Wyse 5070 Technical Guidebook PDF
100% (1)
Wyse 5070 Technical Guidebook PDF
27 pages
100 Geometry Problems: Contributors: Djmathman, Abishek99, Captainflint
No ratings yet
100 Geometry Problems: Contributors: Djmathman, Abishek99, Captainflint
8 pages
SIH FALKON PARADOX Final
No ratings yet
SIH FALKON PARADOX Final
6 pages
Toolbox Solidworks 2016
No ratings yet
Toolbox Solidworks 2016
53 pages
Fan Tool Kit - Ad Hoc Group - V4dd
No ratings yet
Fan Tool Kit - Ad Hoc Group - V4dd
121 pages
iPhone 14 Setup Guide for New Users
No ratings yet
iPhone 14 Setup Guide for New Users
22 pages
Devid
No ratings yet
Devid
1 page
Sound Classification
No ratings yet
Sound Classification
5 pages
10 Audio Processing Tasks To Get You Started With Deep Learning Applications (With Case Studies)
No ratings yet
10 Audio Processing Tasks To Get You Started With Deep Learning Applications (With Case Studies)
5 pages
All Projects Spring 22
No ratings yet
All Projects Spring 22
202 pages
Davanagere
No ratings yet
Davanagere
11 pages
Alternative Depression Remedies
No ratings yet
Alternative Depression Remedies
37 pages
Audio Recognition with Deep Learning
No ratings yet
Audio Recognition with Deep Learning
52 pages
Guide To YAMNet - Sound Event Classifier
No ratings yet
Guide To YAMNet - Sound Event Classifier
10 pages
Predicting Singer Voice Using Convolutional Neural Network
No ratings yet
Predicting Singer Voice Using Convolutional Neural Network
17 pages
Succeed I Can Worksheet
No ratings yet
Succeed I Can Worksheet
8 pages
Audio Recognition with CNN
No ratings yet
Audio Recognition with CNN
14 pages
Seminar Report - 3sem
No ratings yet
Seminar Report - 3sem
34 pages
Shier Jordie MSC 2021
No ratings yet
Shier Jordie MSC 2021
153 pages
Implementing Binary Adder and Subtractor Circuits: Laboratory Exercise 4
100% (1)
Implementing Binary Adder and Subtractor Circuits: Laboratory Exercise 4
11 pages
English Grammar: Fill-in-the-Blank Exercises
No ratings yet
English Grammar: Fill-in-the-Blank Exercises
2 pages
Verma CNN-based System For Speaker Independent Cell-Phone Identification From Recorded Audio CVPRW 2019 Paper
No ratings yet
Verma CNN-based System For Speaker Independent Cell-Phone Identification From Recorded Audio CVPRW 2019 Paper
9 pages
Tinowang Isda Is The Visayan Way of Cooking Fish With Soup
No ratings yet
Tinowang Isda Is The Visayan Way of Cooking Fish With Soup
4 pages
Audio Signal Analysis in Industrial Settings
No ratings yet
Audio Signal Analysis in Industrial Settings
11 pages
Deep Learning Audio Classification
No ratings yet
Deep Learning Audio Classification
25 pages
Generating Music Using AI: Ebba Rickard
No ratings yet
Generating Music Using AI: Ebba Rickard
66 pages
Parlor Games
No ratings yet
Parlor Games
13 pages
Paper 10
No ratings yet
Paper 10
9 pages
Adding & Subtracting Integers Lesson
No ratings yet
Adding & Subtracting Integers Lesson
5 pages
A Robust Audio Deepfake Detection System Via Multi-View Feature
No ratings yet
A Robust Audio Deepfake Detection System Via Multi-View Feature
5 pages
SRS of Project
No ratings yet
SRS of Project
21 pages
Onychophagia (Nail Biting), Anxiety, and Malocclusion
No ratings yet
Onychophagia (Nail Biting), Anxiety, and Malocclusion
4 pages
Industrial Cutting Machines Guide
No ratings yet
Industrial Cutting Machines Guide
8 pages
George B. Handley - Literature and Ecotheology - From Chaos To Cosmos (Routledge Environmental Humanities) - Routledge (2024)
No ratings yet
George B. Handley - Literature and Ecotheology - From Chaos To Cosmos (Routledge Environmental Humanities) - Routledge (2024)
249 pages
Policy Wordings
No ratings yet
Policy Wordings
19 pages
RP1
No ratings yet
RP1
2 pages
Ahmed Raza
No ratings yet
Ahmed Raza
4 pages
Construction Exam Solutions
100% (1)
Construction Exam Solutions
5 pages
14 Network Hardwares
No ratings yet
14 Network Hardwares
11 pages
List of Land Lease in TPM
No ratings yet
List of Land Lease in TPM
3 pages
RBPRATYUSH448
No ratings yet
RBPRATYUSH448
20 pages
SNS - Final Project Report
No ratings yet
SNS - Final Project Report
19 pages
Minor Project Ms
No ratings yet
Minor Project Ms
12 pages
s1 Result Analysis
No ratings yet
s1 Result Analysis
4 pages
5 Sgasgs
No ratings yet
5 Sgasgs
6 pages
Project PPT Bhu
No ratings yet
Project PPT Bhu
12 pages
Absurdism in "The Outsider": Ruoqi Han
No ratings yet
Absurdism in "The Outsider": Ruoqi Han
5 pages
Mrac Paper1a
No ratings yet
Mrac Paper1a
11 pages
Samsung Prism PPT 2
No ratings yet
Samsung Prism PPT 2
11 pages
Audio Noise Detection
No ratings yet
Audio Noise Detection
29 pages
MLSP Lab Exp3
No ratings yet
MLSP Lab Exp3
6 pages
00 EEME30002 Coursework Brief 24 25
No ratings yet
00 EEME30002 Coursework Brief 24 25
2 pages
Environmental Biotech Solutions
No ratings yet
Environmental Biotech Solutions
10 pages
Audio - Deepfake - Detection - Using - Deep - Learning Paper2
No ratings yet
Audio - Deepfake - Detection - Using - Deep - Learning Paper2
6 pages
FNDS3536S-V3 Encoder Satellitegateway Iptv
No ratings yet
FNDS3536S-V3 Encoder Satellitegateway Iptv
4 pages
Unit 2 NMU
No ratings yet
Unit 2 NMU
4 pages
DL Report
No ratings yet
DL Report
16 pages
Seminar Report Parthiv
No ratings yet
Seminar Report Parthiv
58 pages
Assignment 10
No ratings yet
Assignment 10
1 page
Latihan UAP
No ratings yet
Latihan UAP
3 pages
BTP Report
No ratings yet
BTP Report
39 pages
COE101 - Project Guidelines (Spring 24-25)
No ratings yet
COE101 - Project Guidelines (Spring 24-25)
19 pages
2018ac04523 FR
No ratings yet
2018ac04523 FR
27 pages
Seminar Report Final
No ratings yet
Seminar Report Final
37 pages
Distinguishing Between Two Human Voices Using AI
No ratings yet
Distinguishing Between Two Human Voices Using AI
11 pages
BTP Final
No ratings yet
BTP Final
16 pages
Final Intro AIReport
No ratings yet
Final Intro AIReport
9 pages
2018ac04523 Final Report
No ratings yet
2018ac04523 Final Report
27 pages
COMPOUND-SDS - INDONESIA-English - Jayaboard (2023)
No ratings yet
COMPOUND-SDS - INDONESIA-English - Jayaboard (2023)
6 pages
Marker Assisted Breeding
No ratings yet
Marker Assisted Breeding
19 pages
SRS Final
No ratings yet
SRS Final
21 pages
Audio Deepfake Detection Final Report
No ratings yet
Audio Deepfake Detection Final Report
5 pages
NEPALI Report Review2
No ratings yet
NEPALI Report Review2
24 pages
Framed Structures
No ratings yet
Framed Structures
3 pages
19ECE284 DSP Lab ECE B-B1 Team 02 115
No ratings yet
19ECE284 DSP Lab ECE B-B1 Team 02 115
7 pages
DS - Ass 4
No ratings yet
DS - Ass 4
3 pages
Audio Classification
No ratings yet
Audio Classification
6 pages
Paper 4-Enhancing Audio Classification Through MFCC
No ratings yet
Paper 4-Enhancing Audio Classification Through MFCC
17 pages
Blackbook Report (6.2) Combined
No ratings yet
Blackbook Report (6.2) Combined
81 pages
Technical Plan - High-Accuracy On-Device Snoring Detection
No ratings yet
Technical Plan - High-Accuracy On-Device Snoring Detection
8 pages
Blackbook - Report (4.1)
No ratings yet
Blackbook - Report (4.1)
38 pages
Final
No ratings yet
Final
35 pages
Research Paper Update S6
No ratings yet
Research Paper Update S6
9 pages
Deep-Learning Capstone Project
No ratings yet
Deep-Learning Capstone Project
10 pages

Problem Statement

Uploaded by

Problem Statement

Uploaded by

Problem Statement:

You might also like