0% found this document useful (0 votes)

26 views18 pages

Softcom Assignment1

Uploaded by

Yousuf ali Safin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views18 pages

Softcom Assignment1

Uploaded by

Yousuf ali Safin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 18

C OU R SE N O : C SE 4 1 1 4

Cou rse Tit le :Pat t e rn R e cogn it ion an d Mach in e L e arn in g

A Comprehensive Study to Sentiment Analysis of Bangla

Cricket-Related Social Media Comments Using ML and LSTM
Models
Research Participants

Adibul Haque Yousuf Ali Miftahul Sheikh

ID: 20200204029 ID: 20200204037 ID: 20200204038

Slide 02
Research Paper Presentation
Outline
01 Abstract 07 EVALUATION METRICS

02 Introduction 08 RESEARCH GAP

03 LITERATURE REVIEW 9 CONCLUSION

04 DATASETS 10 CONTRIBUTION OF GROUP MEMBERS

05 PRE-PROCESSING TECHNIQUES 11 REFERENCES

06 MODELS

Slide 03
ABSTRACT
• Sentiment analysis of Bangla cricket-related
social media comments.

• Logistic Regression, KNN, and LSTM models

applied.

• Text normalization, tokenization, and word

embeddings on Facebook and YouTube
comments.

• KNN: 72.1%, Logistic Regression: 70.1%, LSTM:

77.6%.

• ML + DL boost Bangla sentiment analysis;

future: expand dataset, explore hybrids.

Slide 04
INTRODUCTION

• Increased social media use boosts cricket

discussions in Bangladesh.

• Essential to understand public sentiment on

cricket in Bangladeshi culture.

• Lack of Bangla sentiment analysis in cricket

context.

• Aims to bridge the gap in Bangla sentiment

analysis for cricket-related social media
comments.

• Combines traditional and deep learning

techniques to enhance sentiment analysis
accuracy.

Slide 05
Motivation
• Aims to accurately interpret fan sentiments.

• Employs NLP and ML to handle Bangla language

specifics.

• Addresses unique Bangla linguistic challenges

• Applies advanced methods for deeper analysis.

• Enhances understanding of Bangladeshi cricket

fans' opinions.

Slide 04
LITERATURE REVIEW

• The paper analyzes Bangla movie reviews for sentiment.

EVALUATION OF NA¨ IVE BAYES
• It uses Naive Bayes (NB) and Support Vector Machines
AND SUPPORT VECTOR MACHINES
(SVM) for polarity detection.
ON BANGLA TEXTUAL MOVIE
• SVM, with stemmed unigram features, achieved a
REVIEWS.
precision of 0.86.

• 82.20% for abusive Bengali text detection.

A DEEP LEARNING APPROACH TO • Outperformed ANN (81.10%), LinearSVC (75.70%), Logit
DETECT ABUSIVE BENGALI TEXT. (75.20%), MNB (73.90%), and RF (70.50%).
• LSTM > other models.

Slide 06
LITERATURE REVIEW
• The study used 57,000 Bangla news items to identify
A STUDY TOWARDS BANGLA FAKE fake news.
NEWS DETECTION USING MACHINE • Bi-LSTM models with GloVe and FastText achieved up to
LEARNING AND DEEP LEARNING. 96% accuracy.
• GRU model accuracy was 77%.

• RNN with LSTM for Bangla cricket sentiment analysis.

CRICKET SENTIMENT ANALYSIS FROM • The LSTM model achieves an accuracy of 95%
BANGLA TEXT USING RECURRENT • LSTM outperforms the Support Vector Machine (SVM),
NEURAL NETWORK WITH LONG SHORT which has an accuracy of 71.03%
TERM MEMORY MODEL.

Slide 07
DATASETS
• Paper [1]: Utilized phishing • Paper [5]: 10,000 URLs from
dataset with 11,000 URLs and 30 Kaggle, balanced phishing and non-
features. phishing.

• Paper [2]: Real-world phishing

data used, unspecified source.

• Paper [3]: CIC-Bell-DNS 2021 with

400,000 benign and 13,011
malicious samples; UCI Phishing
Domains and 3,000 URLs.

• Paper [4]: Real-world website

details, no specific dataset
provided.

Slide 09
PRE-PROCESSING TECHNIQUES

01 02 03 04

D ATA F E AT U R E F E AT U R E N O R M A L I Z AT I
CLEANING EXTRACTION SELECTION ON AND
SCALING

05 06 07 08

D ATA BALANCING D ATA A D VA N C E D

ENCODING D ATA SPLITTING TECHNIQUES

Slide 10
MODELS USED
CLASSIFIER ACCURACY PRECISION RECALL

LINEAR REGRESSION GFG STANDARD PROFESSIONAL

SVM 0.7214 0.6852 0.7215

RANDOM FOREST 0.7065 0.6754 0.7012

KNN 0.7114 0.6814 0.7114

Slide 11 XGBOOST 0.7449 0.7350 0.7450

Evaluation Metrics
• Accuracy
• Precis ion
• Recall/S ens itivity
• F-measure
• Error Rate (ERR)
• Fals e Pos itive Rate ( FPR)
• Specifi city
• Detection S peed ( DS)

Slide 12
RESEARCH GAP

Paper [1]: Paper [4]:

1 The HEFS method is slow for real-time 4 The system struggles with new phishing
detection in resource-limited environments techniques and targeted attacks, has
and lacks thorough testing against privacy concerns, and requires more
diff erent phishing types and false alarms. research and teamwork to improve
accuracy with feedback and context
Paper [2]:
2 Paper [5]:
The study does not test the model against 5 The model’s eff ectiveness depends heavily
various phishing types or discuss real-world
deployment challenges, and its embedding on data, may miss some phishing threats,
techniques may not capture all phishing and does not show signifi cant benefi ts of
variations. using ANN and AdaBoost together over ANN
alone.
Paper [3]: Paper [6]:
3 The GNN models need better accuracy and 6 The study relies on outdated data and
adaptation to new phishing tactics, lacks comprehensive testing against
focusing mainly on URL structures and various phishing techniques, potentially
requiring signifi cant computing power. limiting its practical applicability and
eff ectiveness.

Slide 13
CONCLUSION

1 Extensive research of ML
techniques.

Random Forest and Neural Networks are

2 highly accurate.

Feature engineering and

3 preprocessing are crucial.

4 Larger datasets and real-world tests

are needed.

The study suggests future cybersecurity

5 improvements.

Slide 14
Related
Papers
Nayan Banik and Md Hasan Hafizur Rahman. Evaluation Elias Hossain, Md Nadim Kaysar, Abu Zahid Md Jalal
01 of na¨ ıve bayes and support vector machines on bangla Uddin Joy, MdMizanur Rahman, and Wahidur Rahman. A
textual movie reviews. In 2018 international conference 03 study towards bangla fake news detection using machine
on Bangla speech and language processing (ICBSLP), learning and deep learning. In Sentimental Analysis and
pages 1–6. IEEE, 2018. Deep Learning: Proceedings of ICSADL 2021, pages 79–
95. Springer, 2022.

Estiak Ahmed Emon, Shihab Rahman, Joti Banarjee, Amit Md Ferdous Wahid, Md Jahid Hasan, and Md Shahin Alom.
Kumar Das, and Tanni Mittra. A deep learning approach to Cricket sentiment analysis from bangla text using
02 detect abusive bengali text. In 2019 7th International 04 recurrent neural network with long short term memory
Conference on Smart Computing & Communications model. In 2019 International Conference on Bangla
(ICSCC), pages 1–5. IEEE, 2019. Speech and Language Processing (ICBSLP), pages 1–4.
IEEE, 2019.

Slide 15
CONTRIBUTION OF GROUP MEMBERS
Wr i ti n g Rep or t Prep arin g
Pap er P resen tation

Abstract, Introduction, Adib

Yousfu Ali
Conclusion.

Adibul Literature Review,

Mifta
Haque Datasets, References

Pre-processing
Miftahul
Techniques, Models, Nafisa
Sheikh
Evaluation Metrics

Slide 16
THANK YOU

Pattern Assignment
No ratings yet
Pattern Assignment
18 pages
Softcom Assignment1
No ratings yet
Softcom Assignment1
18 pages
PatternProject FinalReport
No ratings yet
PatternProject FinalReport
5 pages
CSE440 G2 SentimentAnalysis
No ratings yet
CSE440 G2 SentimentAnalysis
15 pages
35 - Cricket Sentiment Analysis From Bangla Text Using Recurrent Neural Network With Long Short Term Memory Model
No ratings yet
35 - Cricket Sentiment Analysis From Bangla Text Using Recurrent Neural Network With Long Short Term Memory Model
5 pages
Thesis - Aru Omarali
No ratings yet
Thesis - Aru Omarali
34 pages
Mock Test Demo Question
No ratings yet
Mock Test Demo Question
2 pages
FULLTEXT01
No ratings yet
FULLTEXT01
8 pages
Leveraging NLP Techniques and Explainable AI For Abusive Bangla Comment Detection
No ratings yet
Leveraging NLP Techniques and Explainable AI For Abusive Bangla Comment Detection
6 pages
Maisha Et Al. - 2021 - Supervised Machine Learning Algorithms For Sentime
No ratings yet
Maisha Et Al. - 2021 - Supervised Machine Learning Algorithms For Sentime
9 pages
Harsh Internship
No ratings yet
Harsh Internship
18 pages
Final Review 1
No ratings yet
Final Review 1
29 pages
Avoid Note
No ratings yet
Avoid Note
8 pages
E-commerce Fraud Detection with NLP
No ratings yet
E-commerce Fraud Detection with NLP
43 pages
Phishing URL Detection Using ML: Project Report
No ratings yet
Phishing URL Detection Using ML: Project Report
24 pages
Fake-FInal-000 Final 00
No ratings yet
Fake-FInal-000 Final 00
40 pages
Sentiment Analysis of Social Media With Python - by Haaya Naushan - Towards Data Science
No ratings yet
Sentiment Analysis of Social Media With Python - by Haaya Naushan - Towards Data Science
9 pages
Final PPT - Phishing Website
100% (1)
Final PPT - Phishing Website
23 pages
NLP Project (Documentation)
No ratings yet
NLP Project (Documentation)
8 pages
ML Projrct Article 2
No ratings yet
ML Projrct Article 2
6 pages
DL Paper
No ratings yet
DL Paper
11 pages
Final
No ratings yet
Final
10 pages
NLP Final Mini Project
No ratings yet
NLP Final Mini Project
17 pages
Bangla Political Cyberbullying Detection
No ratings yet
Bangla Political Cyberbullying Detection
22 pages
Sentiment Analysis for Students
No ratings yet
Sentiment Analysis for Students
26 pages
Electronics 12 02165
No ratings yet
Electronics 12 02165
13 pages
ML Report Fake News Detection
No ratings yet
ML Report Fake News Detection
15 pages
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Reviews
No ratings yet
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Reviews
4 pages
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Reviews
No ratings yet
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Reviews
4 pages
Bengali Text Suspicion Detection
No ratings yet
Bengali Text Suspicion Detection
2 pages
A Comparative Analysis of Machine Learning Techniques On Fake News Detection 1
No ratings yet
A Comparative Analysis of Machine Learning Techniques On Fake News Detection 1
42 pages
Webphishing Detection PPT 83
No ratings yet
Webphishing Detection PPT 83
16 pages
81 Cse e
No ratings yet
81 Cse e
5 pages
Sentimental Analysis
No ratings yet
Sentimental Analysis
13 pages
Deep Learning Based Sentiment
No ratings yet
Deep Learning Based Sentiment
62 pages
Text Classification - Movie Review - News Wires
No ratings yet
Text Classification - Movie Review - News Wires
5 pages
422 News
No ratings yet
422 News
10 pages
18CSE006 Thesis Report
No ratings yet
18CSE006 Thesis Report
23 pages
Strategies For Enhancing The Performance of News Article Classification in Bangla Handling Imbalance and Interpretation
No ratings yet
Strategies For Enhancing The Performance of News Article Classification in Bangla Handling Imbalance and Interpretation
21 pages
Commentclass: A Robust Ensemble Machine Learning Model For Comment Classification
No ratings yet
Commentclass: A Robust Ensemble Machine Learning Model For Comment Classification
20 pages
Report in ML
No ratings yet
Report in ML
9 pages
Fake News Detection
100% (1)
Fake News Detection
25 pages
Phishing 094610
No ratings yet
Phishing 094610
26 pages
Deep - Learning - Techniques - For - Sentiment - Analysis - On - Social - Media - Text Final
No ratings yet
Deep - Learning - Techniques - For - Sentiment - Analysis - On - Social - Media - Text Final
51 pages
Detection of Phishing Websites by Investigating Their Urls Using LSTM Algorithm
No ratings yet
Detection of Phishing Websites by Investigating Their Urls Using LSTM Algorithm
10 pages
Ascertaining Polarity of Public Opinions On Bangladesh Cricket Through Sentiment Analysis
No ratings yet
Ascertaining Polarity of Public Opinions On Bangladesh Cricket Through Sentiment Analysis
51 pages
Identifying Fake News
No ratings yet
Identifying Fake News
9 pages
Sarcastic Tweet - MGR
No ratings yet
Sarcastic Tweet - MGR
26 pages
2 Review
No ratings yet
2 Review
21 pages
Toxic Comment Detection Report
No ratings yet
Toxic Comment Detection Report
13 pages
IR - Group1
No ratings yet
IR - Group1
27 pages
An Expert-Level Report On The Comparative Analysis of Machine Learning and Deep Learning Models For IMDb Sentiment Classification
No ratings yet
An Expert-Level Report On The Comparative Analysis of Machine Learning and Deep Learning Models For IMDb Sentiment Classification
12 pages
Use of Supervised Machine Learning Class
No ratings yet
Use of Supervised Machine Learning Class
22 pages
NM Project Phase-2
No ratings yet
NM Project Phase-2
9 pages
Sentimental Analysis of Movie Review Based On Naive Bayes and Random Forest Technique
No ratings yet
Sentimental Analysis of Movie Review Based On Naive Bayes and Random Forest Technique
5 pages
22 04 CPE Presentation
No ratings yet
22 04 CPE Presentation
18 pages
Final Presentation Main
No ratings yet
Final Presentation Main
35 pages
3 Merged
No ratings yet
3 Merged
61 pages
Ih Brno Adela Othova Ihcylt 2015-16 tp1 Coversheet
No ratings yet
Ih Brno Adela Othova Ihcylt 2015-16 tp1 Coversheet
2 pages
Mats Module 12345
No ratings yet
Mats Module 12345
104 pages
Title Pages: Reference in Discourse
No ratings yet
Title Pages: Reference in Discourse
913 pages
Grade 8 English Term II 2025 Schemes of Work
No ratings yet
Grade 8 English Term II 2025 Schemes of Work
21 pages
1537177499-0llcomputer Applications ICSE 9th Answer PDF
100% (2)
1537177499-0llcomputer Applications ICSE 9th Answer PDF
207 pages
Grammar in English
100% (1)
Grammar in English
63 pages
The Gingerbread Man Activity Book
No ratings yet
The Gingerbread Man Activity Book
20 pages
Master Thesis Verbs
100% (3)
Master Thesis Verbs
5 pages
Unit Test 7
No ratings yet
Unit Test 7
3 pages
Languages Ch#05
No ratings yet
Languages Ch#05
3 pages
WARAY Lesson-Guide-2 DEVELOPING
No ratings yet
WARAY Lesson-Guide-2 DEVELOPING
19 pages
Family Part 2
No ratings yet
Family Part 2
6 pages
Palanquin Bearers by Sarojini Naidu (: Nightingale of India)
No ratings yet
Palanquin Bearers by Sarojini Naidu (: Nightingale of India)
10 pages
EMC Things Fall Apart
100% (1)
EMC Things Fall Apart
51 pages
MODERN ENGLISH DRAMA in 19th Century For 4th Year BSU
No ratings yet
MODERN ENGLISH DRAMA in 19th Century For 4th Year BSU
31 pages
Paper 1 Knowledge Organiser
No ratings yet
Paper 1 Knowledge Organiser
2 pages
Content-Based Interactive Reading Module For Grade 8
No ratings yet
Content-Based Interactive Reading Module For Grade 8
11 pages
Essay Jessica Rubio
No ratings yet
Essay Jessica Rubio
3 pages
How Human Language Could Have Evolved From Birdsong
No ratings yet
How Human Language Could Have Evolved From Birdsong
4 pages
CH3 Mini Test 1 2018
No ratings yet
CH3 Mini Test 1 2018
6 pages
Lesson Plan (IMS)
100% (1)
Lesson Plan (IMS)
7 pages
Persian Culinary Metaphors: A Cross-Cultural Conceptualization
No ratings yet
Persian Culinary Metaphors: A Cross-Cultural Conceptualization
20 pages
Female Jewel Names
No ratings yet
Female Jewel Names
6 pages
Another Psycho
No ratings yet
Another Psycho
716 pages
HaiKaveh 2023
No ratings yet
HaiKaveh 2023
8 pages
Continous Tense Review
No ratings yet
Continous Tense Review
4 pages
Instruction To Mod 4
No ratings yet
Instruction To Mod 4
2 pages
Offers and Suggestions
No ratings yet
Offers and Suggestions
6 pages
The Most Memorable Day of My Life 700 Words
No ratings yet
The Most Memorable Day of My Life 700 Words
2 pages

Softcom Assignment1

Uploaded by

Softcom Assignment1

Uploaded by

C OU R SE N O : C SE 4 1 1 4

Cou rse Tit le :Pat t e rn R e cogn it ion an d Mach in e L e arn in g

A Comprehensive Study to Sentiment Analysis of Bangla

Adibul Haque Yousuf Ali Miftahul Sheikh

02 Introduction 08 RESEARCH GAP

03 LITERATURE REVIEW 9 CONCLUSION

04 DATASETS 10 CONTRIBUTION OF GROUP MEMBERS

05 PRE-PROCESSING TECHNIQUES 11 REFERENCES

• Logistic Regression, KNN, and LSTM models

• Text normalization, tokenization, and word

• KNN: 72.1%, Logistic Regression: 70.1%, LSTM:

• ML + DL boost Bangla sentiment analysis;

• Increased social media use boosts cricket

• Essential to understand public sentiment on

• Lack of Bangla sentiment analysis in cricket

• Aims to bridge the gap in Bangla sentiment

• Combines traditional and deep learning

• Employs NLP and ML to handle Bangla language

• Addresses unique Bangla linguistic challenges

• Applies advanced methods for deeper analysis.

• Enhances understanding of Bangladeshi cricket

• The paper analyzes Bangla movie reviews for sentiment.

• 82.20% for abusive Bengali text detection.

• RNN with LSTM for Bangla cricket sentiment analysis.

• Paper [2]: Real-world phishing

• Paper [3]: CIC-Bell-DNS 2021 with

• Paper [4]: Real-world website

D ATA BALANCING D ATA A D VA N C E D

LINEAR REGRESSION GFG STANDARD PROFESSIONAL

SVM 0.7214 0.6852 0.7215

RANDOM FOREST 0.7065 0.6754 0.7012

KNN 0.7114 0.6814 0.7114

Slide 11 XGBOOST 0.7449 0.7350 0.7450

Paper [1]: Paper [4]:

Random Forest and Neural Networks are

Feature engineering and

4 Larger datasets and real-world tests

The study suggests future cybersecurity

Abstract, Introduction, Adib

Adibul Literature Review,

You might also like