Project Report SSUC-12

The project focuses on developing a Speech Emotion Recognition System using deep learning techniques to classify emotional states from speech signals. It employs advanced feature extraction methods and a custom-designed deep neural network trained on the RAVDESS dataset to enhance accuracy and efficiency in emotion recognition. The system aims to improve upon traditional models by reducing computational complexity while maintaining high performance in real-world applications.

Uploaded by

Naga Prog

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views2 pages

Project Report SSUC-12

Uploaded by

Naga Prog

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Form No: Prj_S 02

Date :20.01.2025

QIS College of Engineering and Technology

(Autonomous)

Project Summary Report

Department: CSE - DS Section: 2
Project Domain: Machine Learning & Deep Functional Domain:
Learning
Mentor Name: Dr. Y. Sowjanya Kumari Batch Number: 12
Name: N. Satya Sai Umesh Chandra Roll No: 21491A4490

Finalized Title: Speech Emotion Recognition System

Abstract/Summary: In the realm of human-machine interface applications, emotion recognition

from speech signals has been a research focus for several years. Emotions play an essential role in
human communication and expression, making their recognition crucial for applications involving
human-computer interaction. This project explores Speech Emotion Recognition (SER), aiming to
classify emotional states from speech signals. The proposed system uses Mel-frequency cepstral
coefficients (MFCC), Chromogram, Mel-scaled spectrogram, spectral contrast, and tonal centroid
features to analyze audio inputs. A Deep Neural Network (DNN) is employed for classification,
trained using the Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS). This
approach aims to address the limitations of traditional methods, improving accuracy while reducing
computational complexity.

Existing Method: Traditional models for emotion recognition primarily relied on machine learning
algorithms such as Support Vector Machines (SVM) and K-Nearest Neighbors (KNN). These models
demonstrated limited accuracy and were computationally intensive. Although deep learning models
have been explored in the past, they typically required extensive datasets and high-performance
hardware, which posed challenges in terms of scalability and practical implementation.

Proposed Method: The proposed approach introduces a deep neural network-based framework for
emotion recognition, beginning with audio pre-processing to remove noise and enhance quality. Key
features such as MFCC, Chromogram, Mel-scaled spectrogram, spectral contrast, and tonal centroid
are extracted to capture essential timbral, tonal, and harmonic information from the audio. A custom-
designed deep neural network with multiple hidden layers and ReLU activation functions is trained
using the Adam optimizer and Categorical Cross-Entropy loss function for multi-class classification.
The RAVDESS dataset ensures balanced representation of emotional classes, and regularization
techniques like dropout prevent overfitting. Model performance is evaluated using metrics such as
accuracy, precision, recall, and F1-score, making the system effective and efficient for real-world
applications.

Technique(s) Used:

 Feature extraction:
 Mel-requency cepstral coefficients (MFCC): Captures the timbral aspects of speech.
 Chromogram: Identifies tonal content and harmonic structure.
 Mel-scaled spectrogram: Provides a visual representation of frequencies over time.
 Spectral contrast: Differentiates between peaks and valleys in the spectrum.
 Tonal centroid: Represents tonal information in a compact form.
 Classification:
 Architecture: Multi-layer perceptrons with activation functions such as ReLU.
 Optimization: Stochastic Gradient Descent (SGD) or Adam optimizer for efficient
training.
 Loss Function: Categorical Cross-Entropy for multi-class classification.

Technology Used:

 Programming language: Python (v3.6+): For scripting and implementing algorithms.

 Deep learning libraries: TensorFlow and Keras: For designing and training the neural network.
 Audio Processing Libraries:
 Librosa: For extracting audio features and preprocessing.
 NumPy and Pandas: For data manipulation and analysis.
 Visualization Tools: Matplotlib and Seaborn: For plotting audio features and model
performance.
 Development Environment: Jupyter Notebook or PyCharm for code development and testing.

Data Sets used:

 Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS):

 The RAVDESS dataset is a validated multimodal dataset consisting of 24 professional
actors (12 male, 12 female) vocalizing two lexically-matched statements in a neutral
North American accent.
 The dataset includes emotional expressions such as calm, happy, sad, angry, fearful,
surprise, and disgust in both speech and song modalities.
 Each expression is available at two intensity levels and is balanced in terms of gender
distribution.
 It is widely used in emotion recognition research for its clarity, diversity, and balanced
representation.
 Format: WAV files with 48kHz sample rate.

Mentor HoD CSCD Examinar

FAMILY CODE - Ateneo Reviewer
100% (1)
FAMILY CODE - Ateneo Reviewer
26 pages
Sped 277 Udl Lesson Plan Templated Inferences
No ratings yet
Sped 277 Udl Lesson Plan Templated Inferences
2 pages
Voice Emotion Recognition
No ratings yet
Voice Emotion Recognition
11 pages
Machine Learning and Deep Learning Techniques For Emotion Recognition From Human Speech Using Acoustic Analysis
No ratings yet
Machine Learning and Deep Learning Techniques For Emotion Recognition From Human Speech Using Acoustic Analysis
10 pages
Project Report - 092046
No ratings yet
Project Report - 092046
5 pages
Sentispeak Tone Mood Detector
No ratings yet
Sentispeak Tone Mood Detector
16 pages
Reality
No ratings yet
Reality
11 pages
IJRPR4210
No ratings yet
IJRPR4210
12 pages
DL Research Paper PDF
No ratings yet
DL Research Paper PDF
15 pages
Draft 6
No ratings yet
Draft 6
14 pages
JETIR2106163
No ratings yet
JETIR2106163
5 pages
4b Review 2
No ratings yet
4b Review 2
23 pages
CNN Based Approach For Speech Emotion Recognition Using MFCC Croma and STFT Hand-Crafted Features
No ratings yet
CNN Based Approach For Speech Emotion Recognition Using MFCC Croma and STFT Hand-Crafted Features
5 pages
4th Sem Project
No ratings yet
4th Sem Project
22 pages
GROUP7 Researchpaper
No ratings yet
GROUP7 Researchpaper
9 pages
IVADED
No ratings yet
IVADED
20 pages
Speech Emotion Recognition Using Deep Learning
No ratings yet
Speech Emotion Recognition Using Deep Learning
5 pages
Serdl 2
No ratings yet
Serdl 2
10 pages
Speech Emotion Journal Phase 2-3
No ratings yet
Speech Emotion Journal Phase 2-3
6 pages
Sat - 82.Pdf - Election Prediction With Automated Speech Emotion Recognition
No ratings yet
Sat - 82.Pdf - Election Prediction With Automated Speech Emotion Recognition
11 pages
MS Thesis Final
No ratings yet
MS Thesis Final
47 pages
P 5
No ratings yet
P 5
16 pages
162,163,174 - 1 - Revised Paper
No ratings yet
162,163,174 - 1 - Revised Paper
13 pages
Research Paper Seminar
No ratings yet
Research Paper Seminar
17 pages
Speech Emotion Recognition (Sound C
No ratings yet
Speech Emotion Recognition (Sound C
2 pages
Exploring The Effectiveness of Advanced Machine Learning Models in Speech Emotion Recognition
No ratings yet
Exploring The Effectiveness of Advanced Machine Learning Models in Speech Emotion Recognition
6 pages
Multimodal Emotion Recognition
No ratings yet
Multimodal Emotion Recognition
5 pages
Literature Review (2) Smaple
No ratings yet
Literature Review (2) Smaple
9 pages
MiniProject 5
No ratings yet
MiniProject 5
11 pages
Deep Learning Report 4 6
No ratings yet
Deep Learning Report 4 6
3 pages
Deep Learning Report 1 3
No ratings yet
Deep Learning Report 1 3
3 pages
Electronics 11 03831
No ratings yet
Electronics 11 03831
12 pages
2019 BE Emotionrecognition ICESTMM19
No ratings yet
2019 BE Emotionrecognition ICESTMM19
8 pages
Group No 37
No ratings yet
Group No 37
19 pages
Deep Learning for Emotion Detection
No ratings yet
Deep Learning for Emotion Detection
9 pages
Speech Emotion Recognition Using Machine Learning
No ratings yet
Speech Emotion Recognition Using Machine Learning
8 pages
1st Review
No ratings yet
1st Review
19 pages
Paper5 Implementation
No ratings yet
Paper5 Implementation
7 pages
A Multimodal Fusion Approach Emotion Identification From Audio and Video
No ratings yet
A Multimodal Fusion Approach Emotion Identification From Audio and Video
5 pages
Set Conference Draft Paper - 223585
No ratings yet
Set Conference Draft Paper - 223585
6 pages
Human Emotion Detection With Speech Recognition Using Mel-Frequency Cepstral Coefficient and CNN - New
No ratings yet
Human Emotion Detection With Speech Recognition Using Mel-Frequency Cepstral Coefficient and CNN - New
2 pages
Emotion Recognition From Speech Via The Use of Dif
No ratings yet
Emotion Recognition From Speech Via The Use of Dif
11 pages
Speech Emotion Recognition Guide
No ratings yet
Speech Emotion Recognition Guide
86 pages
SER (Research Paper)
No ratings yet
SER (Research Paper)
5 pages
Efficient Speech Emotion Recognition: Presented By: Samir Kumar Majhi
No ratings yet
Efficient Speech Emotion Recognition: Presented By: Samir Kumar Majhi
12 pages
Chethana H N REPORT
No ratings yet
Chethana H N REPORT
12 pages
Speech and Text Emotion Recognition Using Machine Learning Batch Number - 08 First Review 2.0
No ratings yet
Speech and Text Emotion Recognition Using Machine Learning Batch Number - 08 First Review 2.0
12 pages
Speech Based Emotion Recognition
No ratings yet
Speech Based Emotion Recognition
26 pages
Literature Study 2025
No ratings yet
Literature Study 2025
10 pages
Electronics 12 00839 v2
No ratings yet
Electronics 12 00839 v2
17 pages
Cyprus University of Technology TEPAK Report Template English PDF
No ratings yet
Cyprus University of Technology TEPAK Report Template English PDF
17 pages
Real-Time Speech Emotion Detection
No ratings yet
Real-Time Speech Emotion Detection
16 pages
Speech Emotion Recognition Study
No ratings yet
Speech Emotion Recognition Study
17 pages
Emotion Detection Through Speech
No ratings yet
Emotion Detection Through Speech
9 pages
Speech-Emotion-Recognition Using SVM, Decision Tree and LDA Report
No ratings yet
Speech-Emotion-Recognition Using SVM, Decision Tree and LDA Report
7 pages
Emotion Recognition Using Speech Processing
No ratings yet
Emotion Recognition Using Speech Processing
5 pages
Speech Emotion System Full Project Report
No ratings yet
Speech Emotion System Full Project Report
54 pages
Template
No ratings yet
Template
5 pages
Emotional Speech Synthesis Review
No ratings yet
Emotional Speech Synthesis Review
4 pages
Presentation On Design Manufacturing and Testing of A Normal Solar Collector For House Hold Use by Bekri M. & Bruck A Advisor Dr. Mulu B
No ratings yet
Presentation On Design Manufacturing and Testing of A Normal Solar Collector For House Hold Use by Bekri M. & Bruck A Advisor Dr. Mulu B
19 pages
Gravitation
75% (4)
Gravitation
23 pages
Cbse Xii - MT - 06 (Assignment) (14-07-2025) - 11891 - Sol
No ratings yet
Cbse Xii - MT - 06 (Assignment) (14-07-2025) - 11891 - Sol
16 pages
Atheist's Critique of God's Nature
100% (1)
Atheist's Critique of God's Nature
16 pages
Pre-Alternative Algebras and Pre-Alternative Bialgebras: Abstract
No ratings yet
Pre-Alternative Algebras and Pre-Alternative Bialgebras: Abstract
34 pages
English Present Tenses Practice
No ratings yet
English Present Tenses Practice
5 pages
Study On The Relationship Between The WTO's IP Agreement and The Convention On Biological Diversity - Ipleaders
No ratings yet
Study On The Relationship Between The WTO's IP Agreement and The Convention On Biological Diversity - Ipleaders
20 pages
Service Manual: Finisher
No ratings yet
Service Manual: Finisher
235 pages
Vani Ganapathy
No ratings yet
Vani Ganapathy
2 pages
English Exam Video Guide
No ratings yet
English Exam Video Guide
8 pages
Methods For Testing Tar and Bituminous Materials - Determination of Specific Gravity
100% (1)
Methods For Testing Tar and Bituminous Materials - Determination of Specific Gravity
10 pages
Karnataka Vat Audit
100% (1)
Karnataka Vat Audit
199 pages
Insert - Elecsys FSH.08932387500.V2.En
No ratings yet
Insert - Elecsys FSH.08932387500.V2.En
4 pages
Incorporating Data Warehouse Using SSIS
No ratings yet
Incorporating Data Warehouse Using SSIS
25 pages
Mini Project 1
No ratings yet
Mini Project 1
9 pages
BA Underpayment Appeal Letter - NSA MNRP
No ratings yet
BA Underpayment Appeal Letter - NSA MNRP
3 pages
Personal Statement
100% (1)
Personal Statement
3 pages
Py Bom 13729140000069375
No ratings yet
Py Bom 13729140000069375
2 pages
"Doğan v. Turkey: ECHR Just Satisfaction Judgment"
No ratings yet
"Doğan v. Turkey: ECHR Just Satisfaction Judgment"
17 pages
Soal Ulangan Genap3
No ratings yet
Soal Ulangan Genap3
7 pages
Multicasting in TCP/IP Protocols
No ratings yet
Multicasting in TCP/IP Protocols
48 pages
October 2023 Online Version
No ratings yet
October 2023 Online Version
5 pages
5941604-16SCH r6
No ratings yet
5941604-16SCH r6
1 page
M5 Paper (Chaper 11-12)
No ratings yet
M5 Paper (Chaper 11-12)
15 pages
Ivey Business School Private Equity - Bus9452 Course Syllabus and Outline MBA 2021 5 Elective Period
No ratings yet
Ivey Business School Private Equity - Bus9452 Course Syllabus and Outline MBA 2021 5 Elective Period
5 pages
Speeding Mitigation Plan
No ratings yet
Speeding Mitigation Plan
2 pages
QUIZ040404
No ratings yet
QUIZ040404
7 pages