Thanks to visit codestin.com
Credit goes to Github.com

Skip to content

A deep learning-based Speech Emotion Recognition (SER) model trained primarily on Indian languages. Designed for applications in call centers, sentiment analysis, and accessibility tools.

License

Notifications You must be signed in to change notification settings

aaivu/KuralNet

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ”Š KuralNet: Multilingual Speech Emotion Recognition (SER) Model

Version 1.0 | Trained on 13 Languages & 14 Datasets

Multilingual Speech Emotion Recognition model primarily focused on South Indian languages, designed to detect emotions from speech, enhancing call centers, sentiment analysis, and accessibility tools.

License Version Paper Hugging Face PyPI Docker

πŸš€ Key Features

  • Emotion Detection: Capable of detecting emotions from speech in multiple Indian languages.
  • Use Cases: Call centers, sentiment analysis, and accessibility tools.
  • Optimized Performance: Designed for real-time emotion analysis.

🎯 Purpose

This model aims to enhance user experiences by detecting emotions from speech across multilingual datasets. The focus is to apply it in industries like customer service, where emotional tone plays a crucial role.

🧠 Model Architecture

Model uses a dual-branch approach: a Whisper branch for contextual embeddings and a traditional feature branch (MFCC, etc.) for acoustic details. An AttentionFusion module then dynamically combines these features using cross-attention. Finally, classification layers predict the emotion.

In essence: Raw Audio -> (Whisper + Traditional Features) -> AttentionFusion -> Emotion Prediction.

Model

πŸ“ˆ Performance Overview

Radar Chart by Language

Radar Chart by Language

πŸ“Š Detailed Performance Table

Detailed Table

πŸ“œ Citation

If you are using this model or research findings, please cite the following paper:

@article{placeholder2024,
  author    = {Author(s)},
  title     = {Paper Title},
  journal   = {Conference/Journal},
  year      = {2024},
  volume    = {X},
  number    = {Y},
  pages     = {ZZ-ZZ},
  doi       = {10.XXXX/placeholder},
}

πŸ“¬ Contact

🏷️ Name πŸ“§ Email πŸ”— LinkedIn πŸ“š Google Scholar
Luxshan Thavarasa [email protected] LinkedIn Google Scholar
Jubeerathan Thevakumar [email protected] LinkedIn Google Scholar
Thanikan Sivatheepan [email protected] LinkedIn Google Scholar
Uthayasanker Thayasivam [email protected] LinkedIn Google Scholar

πŸ™ Acknowledgment

We would like to thank Dr. Uthayasanker Thayasivam for his guidance as my supervisor, Braveenan Sritharan for his mentorship, and all the dataset owners for making their datasets available for us through open access or upon request. Your support has been invaluable.

About

A deep learning-based Speech Emotion Recognition (SER) model trained primarily on Indian languages. Designed for applications in call centers, sentiment analysis, and accessibility tools.

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •