0% found this document useful (0 votes)

6 views6 pages

Sequence Modelling Basics

Uploaded by

siddharth.anil.cd.che24

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views6 pages

Sequence Modelling Basics

Uploaded by

siddharth.anil.cd.che24

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

COPS Summer of Code 2025

Intelligence Guild

Club Of Programmers, IIT (BHU) Varanasi

NLP Track: Sequence Modeling

2 – 8 June 2025

Official IG Website: https://cops-iitbhu.github.io/IG-website/

All deadlines are strict. No extensions will be granted.

Introduction
COPS Summer of Code (CSOC) is a flagship initiative under the Club Of Program-
mers, IIT (BHU) Varanasi, with all verticals contributing through focused tracks. This
document embarks the journey of deep learning and contains the contents of ANN.
Modules will be released from time to time. Adhere strictly to deadlines. Sub-
missions will be evaluated on approach, technical correctness, and clarity. The most
technically accurate solution may not necessarily be the one chosen; clarity of thought
and a well-reasoned approach will be valued more.

Communities
All communication for the programme will be conducted strictly via Discord. Do not
reach out through other channels. Resources and updates will be posted on Github, and
all notifications will be made via Discord.

Final Report
A concise report may be submitted along with your final assignment. While not manda-
tory, it may strengthen your overall evaluation. Reports must be written in LATEX and
submitted in PDF format only. We are not interested in surface-level descriptions —
focus strictly on your analysis, approach, and reasoning. The report itself constitutes
the final assignment. No additional files are to be submitted. Refer to the Assignment
section for details.

Contact Details
In case of any doubts, clarifications, or guidance, you can contact one of us. We request
that you stick to Discord as the preferred mode of communication for all the questions
that you have as it will also benefit others. However, you can reach out to us through
other means in case we fail to respond on Discord.

• Tejbir Panghal - 9034705165

• Sakshi Kumar - 8073247266

1
Resources
In this module, we’ll dive into the world of Recurrent Neural Networks (RNNs),
Long Short-Term Memory networks (LSTMs), Gated Recurrent Units (GRUs),
and Word Embeddings. These are crucial concepts for processing sequential data in
NLP.

Recurrent Neural Networks (RNNs), LSTMs, and GRUs

This section will guide you through the fundamental concepts of sequence models.

• If you’re looking for clear, intuitive explanations, check outStatQuest with Josh
Starmer:

– For RNNs: Recurrent Neural Networks (RNNs), Clearly Explained!!!

– For LSTMs: Long Short-Term Memory (LSTM), Clearly Explained

• For a visual and step-by-step breakdown of both LSTM and GRU: Illustrated Guide
to LSTM’s and GRU’s: A step by step explanation by The AI Hacker.

• To get hands-on with implementing these in PyTorch: PyTorch Tutorial - RNN &
LSTM & GRU - Recurrent Neural Nets by Patrick Loeber.

• For a deeper understanding by building an RNN from scratch in Python: RNN

From Scratch In Python by Dataquest.

If you prefer reading over videos, these resources are excellent:

• A comprehensive article covering RNNs, LSTMs, and GRUs: A Journey Through

RNN, LSTM, GRU and beyond.

• Colah’s blog is highly recommended for its in-depth and intuitive explanations,
especially for LSTMs: Understanding LSTMs.

Word Embeddings
Understanding how words are represented numerically is key.

• For the basics of text preprocessing and early embedding techniques:

– Text preprocessing steps: Understanding the Essentials: NLP Text Prepro-

cessing Steps.
– NLP Zero to Hero (Part 1), covering Bag of Words, TF-IDF, and an introduc-
tion to Word2Vec: Introduction, BoW, TF-IDF, Word2Vec.

• Krish Naik offers great conceptual videos:

– Bag of Words intuition: Natural Language Processing—Bag Of Words Intu-

ition.
– TF-IDF intuition: Natural Language Processing—TF-IDF Intuition— Text
Preprocessing.

2
– An introduction to Word Embeddings: Word Embedding - Natural Language
Processing— Deep Learning.

• For a deep dive into Word2Vec:

– An excellent breakdown of skip-gram and CBOW: Word2Vec Explained – by

Lilian Weng.
– A video explanation of Word2Vec: Word Embedding and Word2Vec, Clearly
Explained!!! by StatQuest with Josh Starmer.
– A highly visual and intuitive explanation: The Illustrated Word2Vec – from
Jay Alammar.

Extras
For those eager to explore further:

• A systematic empirical analysis of LSTM variants: LSTM: A Search Space Odyssey.

• A classic and insightful blog post by Andrej Karpathy: The Unreasonable Effec-
tiveness of Recurrent Neural Networks.

• Stanford’s renowned NLP course material (Week 1 is highly relevant): Stanford

224n course.

• The original Word2Vec paper: Efficient Estimation of Word Representations in

Vector Space (Mikolov et al., 2013).

• For understanding more advanced RNN architectures:

– Bidirectional RNNs (BiLSTM, BiGRU): Bidirectional RNN — BiLSTM —

Bidirectional LSTM — Bidirectional GRU by CampusX.
– Deep/Stacked RNNs, LSTMs, and GRUs: Deep RNNs — Stacked RNNs —
Stacked LSTMs — Stacked GRUs by CampusX.

• A personal favorite NLP course in blog form, covering a wide range of topics:
https://lena-voita.github.io/nlpc ourse.html.

Assignment: Binary Sentiment Classification

Objective:
Build a sequence-based model (RNN/LSTM/GRU) to predict sentiment polarity (positive
or negative) from product review text. Go beyond detecting keywords — the model should
learn subtle tone, sarcasm, and emotional cues, encouraging thoughtful model design.

3
Dataset Summary:
• text: Full review written by a user.

• title: Title of the review

• polarity:

– 1 → Negative
– 2 → Positive

Dataset link: Amazon Reviews Dataset

Task Description:
• Preprocess and tokenize the text (cleaning, padding, truncating).

• Use GloVe or Word2Vec embeddings (trainable or pre-trained).

• Train a sequence model (RNN, LSTM and GRU or their variants preferred) for
binary classification.

• Evaluate using accuracy, F1-score, and confusion matrix.

• Analyze errors — especially false positives/negatives where the language is ambigu-

ous or sarcastic.

Bonus : Minimal Clue Challenge:

• Pick 5–10 short, ambiguous reviews that failed on the trained model and answer:

– Why might your model misclassify this?

– What clues could a human pick up that your model can’t?
– How would you fix this?

Stretch Goals:
• Compare the lengths of reviews that the RNN model predicts correctly versus those
predicted by the LSTM/GRU models.

• Hence, analyze how review length or rare words affect performance.

Submission Guidelines
• Create a GitHub repository named <roll number>-CSOC-IG (e.g., 23014019-CSOC-IG)

• Repository organization:

– A folder named ”Sequence Modelling Basics” containing all source code im-
plementations
– The final report in PDF format, authored using LATEX

4
Everything must be in the github repo itself.

• Submit the repository link via the provided Google Form here

• Note: The report constitutes the primary assignment submission. No additional

files are required

• Deadlines are strict and will not be extended

Final Remarks
Ensure that your submission reflects a clear understanding of the concepts and method-
ologies applied. Focus on the analytical aspects and the rationale behind your implemen-
tations. We look forward to your insightful contributions.

Adios, and keep learning!

NLP Assignment 2
No ratings yet
NLP Assignment 2
3 pages
RNN LSTM
No ratings yet
RNN LSTM
49 pages
Admin Cloudera
100% (3)
Admin Cloudera
637 pages
Deep Learning: Sequence Models
No ratings yet
Deep Learning: Sequence Models
85 pages
VSE+InfoScale Enterprise OracleRAC 2020 05
No ratings yet
VSE+InfoScale Enterprise OracleRAC 2020 05
89 pages
Problem 1 Proposal
No ratings yet
Problem 1 Proposal
24 pages
3 Sequence and Language Modeling
No ratings yet
3 Sequence and Language Modeling
56 pages
SAP-TCodes Module MDM-EN
No ratings yet
SAP-TCodes Module MDM-EN
8 pages
For Seminar
No ratings yet
For Seminar
17 pages
Duplication - Typecasting-Problem Statement
No ratings yet
Duplication - Typecasting-Problem Statement
6 pages
Natural Language Processing With RNNs .Ipynb - Colaboratory
No ratings yet
Natural Language Processing With RNNs .Ipynb - Colaboratory
15 pages
V30 User Manual V1.65
No ratings yet
V30 User Manual V1.65
35 pages
5BH Ti
No ratings yet
5BH Ti
16 pages
How To Download A Scientific Paper: To My Dear Advisor: Simonina Ol 'Ga Aleksandrovna
No ratings yet
How To Download A Scientific Paper: To My Dear Advisor: Simonina Ol 'Ga Aleksandrovna
19 pages
Lecture8 421
No ratings yet
Lecture8 421
85 pages
Brochures FX Y
No ratings yet
Brochures FX Y
20 pages
NLP Concepts and Techniques Guide
No ratings yet
NLP Concepts and Techniques Guide
15 pages
Huawei BTS 3900 Configuration Guide
No ratings yet
Huawei BTS 3900 Configuration Guide
7 pages
Sequence Models231205
No ratings yet
Sequence Models231205
72 pages
Process Synchronization Basics
No ratings yet
Process Synchronization Basics
58 pages
3 - Deep Learning
No ratings yet
3 - Deep Learning
33 pages
Clinical Data Management
No ratings yet
Clinical Data Management
5 pages
08 Natural Language Processing in Tensorflow
No ratings yet
08 Natural Language Processing in Tensorflow
29 pages
Autodesk FlexLM Error Codes
No ratings yet
Autodesk FlexLM Error Codes
4 pages
Sequence Models - Merged
No ratings yet
Sequence Models - Merged
67 pages
Amazon Complaint
No ratings yet
Amazon Complaint
103 pages
Deep Learning: Sequence Models Course
No ratings yet
Deep Learning: Sequence Models Course
1 page
Oracle SCM Functional Consultant Resume
No ratings yet
Oracle SCM Functional Consultant Resume
3 pages
Introduction To Using C# For Graphics and Guis: Learning Objectives
No ratings yet
Introduction To Using C# For Graphics and Guis: Learning Objectives
13 pages
Yamaha Song Filer Manual
No ratings yet
Yamaha Song Filer Manual
11 pages
HTML Beginner
No ratings yet
HTML Beginner
16 pages
Thuyết Trình TWP
No ratings yet
Thuyết Trình TWP
7 pages
Nas326 3
No ratings yet
Nas326 3
6 pages
LSTM Seq2Seq Models for Text Data
No ratings yet
LSTM Seq2Seq Models for Text Data
44 pages
(Slides) Module 44
No ratings yet
(Slides) Module 44
119 pages
Sequence Learning Problem
No ratings yet
Sequence Learning Problem
42 pages
AN2DL 05 2324 Seq2SeqAndWordEmbedding
No ratings yet
AN2DL 05 2324 Seq2SeqAndWordEmbedding
42 pages
Python ToC
No ratings yet
Python ToC
3 pages
NLP Lab2
No ratings yet
NLP Lab2
7 pages
Tej3m Network Design 2014 Final
No ratings yet
Tej3m Network Design 2014 Final
3 pages
Survey On Recurrent Neural Network in Natural Lang
No ratings yet
Survey On Recurrent Neural Network in Natural Lang
5 pages
Unit 5.
No ratings yet
Unit 5.
17 pages
NLP Basics
No ratings yet
NLP Basics
119 pages
Recurrent Neural Networks (RNN) : Subtitle
No ratings yet
Recurrent Neural Networks (RNN) : Subtitle
53 pages
AME Annual Report 2019 2020
No ratings yet
AME Annual Report 2019 2020
23 pages
Recurrent Neural Networks (RNN) : Course 5: Sequence Models
No ratings yet
Recurrent Neural Networks (RNN) : Course 5: Sequence Models
76 pages
DP Module 5
No ratings yet
DP Module 5
8 pages
RNN LSTM GRU Transformers
0% (1)
RNN LSTM GRU Transformers
123 pages
COC255 - Investigating Natural Language Processing Methods To Classify News Articles As Terrorist Attacks
No ratings yet
COC255 - Investigating Natural Language Processing Methods To Classify News Articles As Terrorist Attacks
67 pages
VMware Setup for Pexip Infinity
No ratings yet
VMware Setup for Pexip Infinity
23 pages
01-Transformer Based NLP Applications
No ratings yet
01-Transformer Based NLP Applications
55 pages
The 7 NLP Techniques That Will Change How You Communicate in The Future (Part I)
No ratings yet
The 7 NLP Techniques That Will Change How You Communicate in The Future (Part I)
19 pages
Session2 2024 - 2025 - Natural Language Processing
No ratings yet
Session2 2024 - 2025 - Natural Language Processing
30 pages
ART Multiscale Finite Element Calculations in Python Using SfePy
No ratings yet
ART Multiscale Finite Element Calculations in Python Using SfePy
25 pages
Specifications
No ratings yet
Specifications
58 pages
Directory PDF
No ratings yet
Directory PDF
1 page
Assignment I
No ratings yet
Assignment I
6 pages
1 30PM Application of The Autonomous Ground Vehicle Reference Architecture in Model Based Systems Engineering
No ratings yet
1 30PM Application of The Autonomous Ground Vehicle Reference Architecture in Model Based Systems Engineering
12 pages
OOPs Coding Problems
No ratings yet
OOPs Coding Problems
4 pages
REPORT-MTechPESJul23BGrp2-3 (22-02-25)
No ratings yet
REPORT-MTechPESJul23BGrp2-3 (22-02-25)
15 pages
Teachnical Propsal For BODY WORN CAMERA SOLUTION
No ratings yet
Teachnical Propsal For BODY WORN CAMERA SOLUTION
12 pages
Exam
No ratings yet
Exam
10 pages
Pretraining and Evaluation CodeLLMs
No ratings yet
Pretraining and Evaluation CodeLLMs
71 pages
Module 4-1
No ratings yet
Module 4-1
44 pages
RNN StannfordBased
No ratings yet
RNN StannfordBased
102 pages
RNNs and Their Types - 15 Slides (Easy Copy-Paste Format)
No ratings yet
RNNs and Their Types - 15 Slides (Easy Copy-Paste Format)
6 pages
ENG Cypefire Design y Sprinklers
No ratings yet
ENG Cypefire Design y Sprinklers
8 pages
A M3 RD Ipjn Yd Ps GKF
No ratings yet
A M3 RD Ipjn Yd Ps GKF
20 pages
2AMM30+AY23 24+Text+Mining+Lecture+3
No ratings yet
2AMM30+AY23 24+Text+Mining+Lecture+3
88 pages
Unit 2
No ratings yet
Unit 2
48 pages
Top Five Colleges: Preparing To Present
No ratings yet
Top Five Colleges: Preparing To Present
9 pages
Machine Learning
No ratings yet
Machine Learning
17 pages
Slide
No ratings yet
Slide
28 pages
Week 2
No ratings yet
Week 2
6 pages
NLP End Sem
No ratings yet
NLP End Sem
6 pages
11 RNN
No ratings yet
11 RNN
32 pages
Complete NLP Guide - From Fundamentals To Deep Learning With TensorFlow
No ratings yet
Complete NLP Guide - From Fundamentals To Deep Learning With TensorFlow
13 pages
Natural Language Processing
No ratings yet
Natural Language Processing
6 pages
NN UNIT 5 Notes
No ratings yet
NN UNIT 5 Notes
23 pages
Sequence Models
No ratings yet
Sequence Models
73 pages
Sequence Models For NLP
No ratings yet
Sequence Models For NLP
195 pages
Sequence Models Notes
No ratings yet
Sequence Models Notes
4 pages
11-Transformer LLMs Updated
No ratings yet
11-Transformer LLMs Updated
96 pages
Session03 - RNN
No ratings yet
Session03 - RNN
69 pages
Deep Learning With NLP - Theory
No ratings yet
Deep Learning With NLP - Theory
2 pages
FDP Deep Learning Architectures and Applications
No ratings yet
FDP Deep Learning Architectures and Applications
51 pages

Sequence Modelling Basics

Uploaded by

Sequence Modelling Basics

Uploaded by

COPS Summer of Code 2025

Club Of Programmers, IIT (BHU) Varanasi

NLP Track: Sequence Modeling

Official IG Website: https://cops-iitbhu.github.io/IG-website/

All deadlines are strict. No extensions will be granted.

• Tejbir Panghal - 9034705165

• Sakshi Kumar - 8073247266

Recurrent Neural Networks (RNNs), LSTMs, and GRUs

– For RNNs: Recurrent Neural Networks (RNNs), Clearly Explained!!!

• For a deeper understanding by building an RNN from scratch in Python: RNN

If you prefer reading over videos, these resources are excellent:

• A comprehensive article covering RNNs, LSTMs, and GRUs: A Journey Through

• For the basics of text preprocessing and early embedding techniques:

– Text preprocessing steps: Understanding the Essentials: NLP Text Prepro-

• Krish Naik offers great conceptual videos:

– Bag of Words intuition: Natural Language Processing—Bag Of Words Intu-

• For a deep dive into Word2Vec:

– An excellent breakdown of skip-gram and CBOW: Word2Vec Explained – by

• A systematic empirical analysis of LSTM variants: LSTM: A Search Space Odyssey.

• Stanford’s renowned NLP course material (Week 1 is highly relevant): Stanford

• The original Word2Vec paper: Efficient Estimation of Word Representations in

• For understanding more advanced RNN architectures:

– Bidirectional RNNs (BiLSTM, BiGRU): Bidirectional RNN — BiLSTM —

Assignment: Binary Sentiment Classification

• title: Title of the review

Dataset link: Amazon Reviews Dataset

• Use GloVe or Word2Vec embeddings (trainable or pre-trained).

• Evaluate using accuracy, F1-score, and confusion matrix.

• Analyze errors — especially false positives/negatives where the language is ambigu-

Bonus : Minimal Clue Challenge:

– Why might your model misclassify this?

• Hence, analyze how review length or rare words affect performance.

• Note: The report constitutes the primary assignment submission. No additional

• Deadlines are strict and will not be extended

Adios, and keep learning!

You might also like