SLTPPT Final

The project focuses on developing a real-time Sign Language Translation (SLT) system using a CNN-LSTM hybrid architecture to enhance communication between sign language users and non-signers. It aims to accurately recognize dynamic hand gestures and produce grammatically correct translations, addressing existing limitations in real-time gesture recognition. The system shows promising results with a 76.42% accuracy rate and has potential applications in various fields, including healthcare and education.

Uploaded by

saanvi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as KEY, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views19 pages

SLTPPT Final

Uploaded by

saanvi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as KEY, PDF, TXT or read online on Scribd

You are on page 1/ 19

GLOBAL ACADEMY OF

TECHNOLOGY
RAJARAJESHAWARI NAGAR,BENGALURU-
560098
Department of Electronics and
Communication Engineering
Major Project Presentation
(21ECP83)
On
“Real-Time Sign Language Translation using CNN-LSTM
Hybrid Architecture for Spatial-Temporal Gesture Synthesis
and Grammar Refinement”
Under The Guidance o
PRESENTED BY: Mrs.Shubha G.N
C Saanvi 1GA21EC028
Chidaksh Babu 1GA21EC032 Assistant Professor
Varsha P V 1GA21EC165 Dept. of ECE, GAT
CONTENTS
1. INTRODUCTION
2. LITERATURE SURVEY
3. PROBLEM STATEMENT
4. OBJECTIVES
5. IMPLEMENTATION
6. RESULTS AND DISCUSSION
7. ADVANTAGES/DISADVANTAGES
8. APPLICATIONS
9. CONCLUSION
10. FUTURE SCOPE
11. REFERENCES
INTRODUCTION
Sign Language (SL) serves as a vital communication method for individuals who are
hearing or speech impaired. However, a major barrier exists when communicating
with non-signers. To bridge this gap, automated Sign Language Translation (SLT)
systems have gained research attention in recent years.
These systems aim to translate hand gestures into text, enabling effective
interaction between signers and non-signers. Deep learning models, especially
CNNs and LSTMs, have shown great potential in recognizing both spatial and
temporal aspects of hand movements.
This project adopts a real-time SLT approach using computer vision and deep
learning, focusing on accurate gesture recognition and grammatically correct output
to enhance inclusivity and communication.
LITERATURE
SURVEY
PROBLEM STATEMENT
There is a major communication gap between sign language users
and non-signers. Existing solutions struggle with real-time gesture
recognition, lack accuracy in dynamic hand movements, and don’t
offer grammar correction. This project aims to build a real-time sign
language translator using deep learning (CNN-LSTM), MediaPipe for
hand tracking, and grammar correction to bridge this gap and
improve accessibility.
OBJECTIVES
To design a real-time sign language translation system using deep
learning.
To accurately detect and classify dynamic hand gestures using
MediaPipe and LSTM.
To convert recognized gestures into grammatically correct sentences
using NLP tools.
To enhance communication accessibility for the hearing and speech-
impaired community.
IMPLEMENTATION
Video Frame Capture:
Real-time video is captured using a webcam with OpenCV. Frames are
continuously extracted from the video stream to monitor hand movements as
they happen.
Hand Landmark Detection:
MediaPipe Holistic is used to detect 21 key 3D landmarks on the hand in each
frame. These points represent the shape and orientation of the hand, forming
the core input for gesture recognition.
Feature Preprocessing:
The extracted landmarks are normalized to reduce variation due to hand size or
camera angle. Input sequences are padded to a uniform length, and
augmentation techniques like flipping and rotation are used to improve model
robustness.
Model Training (LSTM):
A deep learning model based on stacked LSTM layers is trained on labeled
gesture sequences. The model captures temporal patterns in the hand
movements and learns to associate them with specific gestures. It is optimized
using categorical cross-entropy loss and the Adam optimizer.
Gesture Classification:
In real-time, the preprocessed input is passed through the trained LSTM model. It
classifies the gesture by predicting the most likely class based on the sequence
of hand movements.
Grammar Correction:
The output from gesture classification is refined using languagetoolpython. This
module corrects grammar, ensuring that the translated sentence is clear, correct,
and easy to understand.
Real-Time Display:
The final, grammatically correct output is displayed on a user-friendly interface,
allowing seamless and accessible communication between sign language users
and non-signers.
RESULTS AND DISCUSSION
Achieved 76.42% accuracy in real-time gesture recognition.
Precision, Recall, F1-Score: All at 0.76, showing balanced
performance.
Outperforms traditional methods by capturing temporal
patterns using LSTM.
Works smoothly on both CPU and GPU.
Handles single-hand gestures with good reliability.
ADVANTAGES/DISADVANTAGES
Advantages Disadvantages
Real-time sign language Accuracy depends on
translation lighting and camera quality
High accuracy with spatial- Requires large, diverse
temporal modeling (LSTM) dataset for better
generalization
Grammar correction for
meaningful sentences Not yet integrated with
speech output
Compatible with both CPU
and GPU setups May lag on low-resource
devices during training
User-friendly interface for
easy interaction
APPLICATIONS
Communication for the Hearing and Speech Impaired: The system
translates sign language into text or speech, enabling real-time
communication for those with hearing or speech impairments. This helps
bridge the gap between them and the general public.
Integration with Smart Devices: Sign language recognition can be
integrated into smart devices like smartphones or wearables. It allows
users to control these devices through hand gestures.
Educational Tools: The system can be used in apps to teach and help
practice sign language. It provides real-time feedback, making learning
more interactive and accessible.
Healthcare Applications: In medical environments, this system helps
healthcare providers communicate with deaf or hard-of-hearing patients. It
ensures accurate communication for proper care and treatment.
Real-Time Translation for Accessibility: Sign language recognition can be
used at public events to translate sign language into text or speech. This
makes events accessible to people who are deaf or hard of hearing.
CONCLUSION
Sign language recognition systems using customised
Convolutional Neural Networks (CNNs) have immense potential
to enhance communication and accessibility. These systems
can bridge gaps for the hearing and speech impaired, integrate
with smart technologies, and improve educational tools. They
also hold promise in healthcare settings and public events,
ensuring inclusivity and real-time interaction for those who rely
on sign language. As technology advances, such systems will
continue to make significant contributions to society,
empowering individuals and fostering better communication
across various domains.
FUTURE SCOPE
Support for Multiple Sign Languages: Expanding the system to
recognize various sign languages globally, making it more inclusive.
Integration with Wearable Devices and AR: Utilizing smart wearables
or augmented reality for more intuitive sign language
communication.
Sign Language to Speech Translation: Converting sign language into
speech in real-time, allowing seamless communication with non-
signers.
Real-Time Multilingual Translation: Enabling real-time translation of
sign language into multiple spoken languages for broader
accessibility.
Enhanced Accuracy and Speed: Improving recognition accuracy and
processing speed with advanced machine learning, making systems
more reliable and efficient.
REFERENCES
[1] D. Guo, W. Zhou, H. Li, and M. Wang, ”Hierarchical LSTM for sign language translation,” Proceedings of
the AAAI Conference on Artificial Intelligence, vol. 32, no. 1, pp. 1–8, Apr. 2018.
[2] Q. Xiao, X. Chang, X. Zhang, and X. Liu, ”Multi-information spatial–temporal LSTM fusion continuous
sign language neural machine translation,” IEEE Access, vol. 8, pp. 216718–216728, 2020, doi:
10.1109/ACCESS.2020.3039539.
[3] D. A. Kumar, A. S. C. S. Sastry, P. V. V. Kishore, E. K. Kumar, and M. T. K. Kumar, ”S3DRGF: Spatial 3-
D relational geometric features for 3-D sign language representation and recognition,” IEEE Signal
Processing Letters, vol. 26, no. 1, pp. 169–173, Jan. 2019, doi: 10.1109/LSP.2018.2883864.
[4] M. Sultana, J. Thomas, S. Thomas, M. SA, and S. L. S, ”Design and development of teaching and
learning tool using sign language translator to enhance the learning skills for students with hearing and
verbal impairment,” in Proc. 2nd Int. Conf. Emerging Trends Inf. Technol. Eng. (ICETITE), Vellore, India,
2024, pp. 1–5, doi: 10.1109/ic- ETITE58242.2024.10493342.
[5] M. Ahmed, M. Idrees, Z. ul Abideen, R. Mumtaz, and S. Khalique, ”Deaf talk using 3D animated sign
language: A sign language interpreter using Microsoft’s Kinect v2,” in Proc. 2016 SAI Computing Conf. (SAI),
London, UK, 2016, pp. 330–335, doi: 10.1109/SAI.2016.7556002.
[6] O. Koller, N. C. Camgoz, H. Ney, and R. Bowden, ”Weakly su- pervised learning with multi-
stream CNN-LSTM-HMMs to discover sequential parallelism in sign language videos,” IEEE
Trans. Pattern Anal. Mach. Intell., vol. 42, no. 9, pp. 2306–2320, Sept. 2020, doi:
10.1109/TPAMI.2019.2911077.
[7] Y. Liao, P. Xiong, W. Min, W. Min, and J. Lu, ”Dynamic sign language recognition based on
video sequence with BLSTM-3D residual networks,” IEEE Access, vol. 7, pp. 38044–38054,
2019, doi: 10.1109/AC- CESS.2019.2904749.
[8] I. Papastratis, K. Dimitropoulos, D. Konstantinidis, and P. Daras, ”Con- tinuous sign
language recognition through cross-modal alignment of video and text embeddings in a joint-
latent space,” IEEE Access, vol
[9] Z. Liu et al., ”Improving end-to-end sign language translation with adaptive video
representation enhanced transformer,” IEEE Transactions on Circuits and Systems for Video
Technology, vol. 34, no. 9, pp. 8327– 8342, Sept. 2024, doi: 10.1109/TCSVT.2024.3376404.
[10] Z. Huang, W. Xue, Y. Zhou et al., ”Dual-stage temporal percep- tion network for continuous
sign language recognition,” The Vi- sual Computer, vol. 41, pp. 1971–1986, 2025. [Online].
Available: https://doi.org/10.1007/s00371-024-03516-x
THANK YOU

Sign Language Translator Presentation
No ratings yet
Sign Language Translator Presentation
19 pages
key นายสิบตำรวจ อำนวยการ ตม 6 PDF
No ratings yet
key นายสิบตำรวจ อำนวยการ ตม 6 PDF
17 pages
TTC Catalog - EN 2013
No ratings yet
TTC Catalog - EN 2013
148 pages
Sign Language Translator Presentation - II
0% (1)
Sign Language Translator Presentation - II
26 pages
Dual Mode Sign Language Recognizer-An Android Based CNN and LSTM Prediction Model
No ratings yet
Dual Mode Sign Language Recognizer-An Android Based CNN and LSTM Prediction Model
5 pages
PPTT
No ratings yet
PPTT
35 pages
Sign Doc 2 - Merged
No ratings yet
Sign Doc 2 - Merged
42 pages
Apartment Management Project Report
20% (10)
Apartment Management Project Report
12 pages
Mudratalk: Indian Sign Language Translator: Bharati Vidyapeeth Deemed To Be University
No ratings yet
Mudratalk: Indian Sign Language Translator: Bharati Vidyapeeth Deemed To Be University
18 pages
AI Report
No ratings yet
AI Report
23 pages
Deep Learning for Sign Language
No ratings yet
Deep Learning for Sign Language
29 pages
Development of An End-To-End Deep Learning Framework For Sign Language Recognition Translation and Video Generation
No ratings yet
Development of An End-To-End Deep Learning Framework For Sign Language Recognition Translation and Video Generation
17 pages
Cluster Analysis and Applications
No ratings yet
Cluster Analysis and Applications
37 pages
Final Review1 PPT
No ratings yet
Final Review1 PPT
18 pages
Radio RCD 510: Wiring Diagram
No ratings yet
Radio RCD 510: Wiring Diagram
6 pages
Hand Signs To Audio Converte1
No ratings yet
Hand Signs To Audio Converte1
11 pages
ASL Recognition with LSTM & Hand Detection
No ratings yet
ASL Recognition with LSTM & Hand Detection
8 pages
1KHW002589 - E Firmware Download For ETL600R4
No ratings yet
1KHW002589 - E Firmware Download For ETL600R4
7 pages
Sign Language Translation Presentation
No ratings yet
Sign Language Translation Presentation
20 pages
Architecture
No ratings yet
Architecture
17 pages
Name: P Surya Narayana Subject: Summer Internship Section: K18Uw REG NO: 11802507 Course Code: Cse443 Topic: Dsa Self Paced
No ratings yet
Name: P Surya Narayana Subject: Summer Internship Section: K18Uw REG NO: 11802507 Course Code: Cse443 Topic: Dsa Self Paced
33 pages
WWW Hackingarticles in Category Collection of Hacking Tools
No ratings yet
WWW Hackingarticles in Category Collection of Hacking Tools
28 pages
06 Synchronization
No ratings yet
06 Synchronization
52 pages
ICT's Role in Modern Media Transformation
No ratings yet
ICT's Role in Modern Media Transformation
6 pages
Conference Paper - 1
No ratings yet
Conference Paper - 1
2 pages
Report
No ratings yet
Report
8 pages
Sign Language Detection System
No ratings yet
Sign Language Detection System
11 pages
Team 3
No ratings yet
Team 3
14 pages
HP Software and Driver Downloads For HP Printers, Laptops, Desktops and More - HP® Customer Support
No ratings yet
HP Software and Driver Downloads For HP Printers, Laptops, Desktops and More - HP® Customer Support
1 page
Sign Language
No ratings yet
Sign Language
12 pages
Final
No ratings yet
Final
13 pages
Real-Time Sign Language Tech
No ratings yet
Real-Time Sign Language Tech
4 pages
5MP H.265 WDR IR Fisheye IP Camera
No ratings yet
5MP H.265 WDR IR Fisheye IP Camera
7 pages
Sign-Lang (1) APSARA3
No ratings yet
Sign-Lang (1) APSARA3
20 pages
USA BATCH IIi
No ratings yet
USA BATCH IIi
92 pages
Real-Time Sign Language Recognition System
No ratings yet
Real-Time Sign Language Recognition System
6 pages
Real-Time Sign Language Translator
No ratings yet
Real-Time Sign Language Translator
22 pages
Checklist TCR Niaga 070918
No ratings yet
Checklist TCR Niaga 070918
19 pages
GMC 300E Plus User Guide
No ratings yet
GMC 300E Plus User Guide
24 pages
DIgital Image Processing
No ratings yet
DIgital Image Processing
74 pages
Project Review 1
No ratings yet
Project Review 1
24 pages
Project Diary - Major
No ratings yet
Project Diary - Major
12 pages
2 Smartforms
No ratings yet
2 Smartforms
7 pages
Final Project
No ratings yet
Final Project
24 pages
Sureshppt
No ratings yet
Sureshppt
14 pages
Sign Recognition Research Paper
No ratings yet
Sign Recognition Research Paper
16 pages
(Print SV) Chapter 3 - International Sale Contract
No ratings yet
(Print SV) Chapter 3 - International Sale Contract
154 pages
ABSTRACT
No ratings yet
ABSTRACT
34 pages
Aditya Engineering College (II Shift Polytechnic) : Sign Language Recognition System
No ratings yet
Aditya Engineering College (II Shift Polytechnic) : Sign Language Recognition System
18 pages
Compilation Process
No ratings yet
Compilation Process
2 pages
Customer Journey Map
100% (1)
Customer Journey Map
20 pages
Batch 25
No ratings yet
Batch 25
14 pages
Final Capstone Review
No ratings yet
Final Capstone Review
29 pages
Sign Language
No ratings yet
Sign Language
5 pages
Sign 1
No ratings yet
Sign 1
6 pages
Sign Language Detection Using The Computer Vision
No ratings yet
Sign Language Detection Using The Computer Vision
27 pages
Reviw Paper
No ratings yet
Reviw Paper
4 pages
Sign Language Detection Using The Computer Visio1
No ratings yet
Sign Language Detection Using The Computer Visio1
26 pages
Irjet V11i5155
No ratings yet
Irjet V11i5155
7 pages
Ends Emp PT Sign Language
No ratings yet
Ends Emp PT Sign Language
16 pages
Vap Project PDF
No ratings yet
Vap Project PDF
66 pages
Lab 3 Oops
No ratings yet
Lab 3 Oops
17 pages
Major
No ratings yet
Major
20 pages
Ensemble-Based Botnet Attack Detection and Classification Using Machine Learning Algorithms On NBaIoT Dataset
No ratings yet
Ensemble-Based Botnet Attack Detection and Classification Using Machine Learning Algorithms On NBaIoT Dataset
6 pages
2025 UP College of Law LAE Manual For Examinees
No ratings yet
2025 UP College of Law LAE Manual For Examinees
23 pages
Midterm Capstone
No ratings yet
Midterm Capstone
18 pages
Project Report
No ratings yet
Project Report
17 pages
Sign To Speech
No ratings yet
Sign To Speech
7 pages
Abstract 8th Sem
No ratings yet
Abstract 8th Sem
5 pages
ODM03D User Guide (06) (PDF) - EN
No ratings yet
ODM03D User Guide (06) (PDF) - EN
38 pages
Sign Language Recognition Using LSTM and Media Pipe
No ratings yet
Sign Language Recognition Using LSTM and Media Pipe
6 pages
Empower Tech
No ratings yet
Empower Tech
7 pages
ZX81 Fpga VHDL
No ratings yet
ZX81 Fpga VHDL
1 page
SLTPPT Final 1
No ratings yet
SLTPPT Final 1
20 pages
0 Intro
No ratings yet
0 Intro
26 pages
SLTPPT
No ratings yet
SLTPPT
19 pages
Hologic Dimensions Rel.n.
No ratings yet
Hologic Dimensions Rel.n.
12 pages
Real Time Sign
No ratings yet
Real Time Sign
7 pages
Conference 14062024
No ratings yet
Conference 14062024
6 pages
Phase 1
No ratings yet
Phase 1
12 pages
Ijst 2023 2583
No ratings yet
Ijst 2023 2583
9 pages
SSRN 5230744
No ratings yet
SSRN 5230744
15 pages
Sign Language
No ratings yet
Sign Language
11 pages
UnMuteAI WhitePaper
No ratings yet
UnMuteAI WhitePaper
7 pages
Sign Language Interpretation and Sentence Building: A CNN-Based Solution
No ratings yet
Sign Language Interpretation and Sentence Building: A CNN-Based Solution
9 pages
Voice and Hand Sign Language Recognition
No ratings yet
Voice and Hand Sign Language Recognition
24 pages
No. Title (Year) Author Methodology Used Accuracy Limitation
No ratings yet
No. Title (Year) Author Methodology Used Accuracy Limitation
11 pages
Sign Language Translator With Speech Recognition I
No ratings yet
Sign Language Translator With Speech Recognition I
9 pages

SLTPPT Final

Uploaded by

SLTPPT Final

Uploaded by

GLOBAL ACADEMY OF

You might also like