0% found this document useful (0 votes)

17 views8 pages

Experiential Learning

Uploaded by

Balakumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views8 pages

Experiential Learning

Uploaded by

Balakumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Experiential Learning -2

NAME: BALAKUMAR M D
REG NO: 927621BAL005
SUBJECT NAME: NATURAL LANGUAGE PROCESSING SUBJECT
CODE: 18AMC306T
1. Text Summarizer
Develop a tool that can automatically generate concise summaries of given
texts, making use of extractive or abstractive summarization techniques.

Introduction

Text summarization involves condensing a longer piece of text into a shorter version,
focusing on retaining the essential points and key information. This process can be classified
into two primary categories:

 Extractive Summarization: This method involves identifying and selecting key

sentences, phrases, or segments from the original text and combining them to form a
summary.
 Abstractive Summarization: This technique requires understanding the main ideas
of the text and generating new sentences to convey these ideas, often utilizing
advanced natural language processing (NLP) techniques.

Objectives

The main goal is to create an automated tool that can produce concise summaries of given
texts. This tool should be capable of:

1. Comprehending the context and key points of the input text.

2. Generating a coherent and concise summary.
3. Supporting both extractive and abstractive summarization techniques.

Methodology

Data Collection and Preprocessing

 Data Sources: Gather a diverse range of texts from sources such as news articles,
academic papers, blogs, and books.
 Preprocessing Steps:
o Tokenization: Splitting the text into sentences and words.
o Stop-word Removal: Removing common words that do not significantly
contribute to the meaning.
o Stemming and Lemmatization: Converting words to their base or root forms.

Extractive Summarization

 Techniques:
o Frequency-Based Methods: Identify the most frequently occurring terms in
the text and select sentences containing these terms.
o Graph-Based Methods: Use algorithms like TextRank to score sentences
based on their connectivity in a graph representation of the text.
o Machine Learning-Based Methods: Train models to identify and select key
sentences.
Abstractive Summarization

 Neural Networks:
o Recurrent Neural Networks (RNNs): Particularly Long Short-Term Memory
(LSTM) networks to handle sequential data.
o Attention Mechanisms: Focus on different parts of the text to generate more
relevant summaries.
o Transformer Models: Utilize advanced models like BERT, GPT, and T5 to
generate summaries.
 Training Data: Use paired datasets of articles and their summaries for training.

Implementation

Tools and Libraries

 Programming Language: Python

 Libraries:
o NLTK, spaCy: For text preprocessing.
o Gensim: For extractive summarization.
o TensorFlow, PyTorch: For building and training neural network models.
o Hugging Face Transformers: For implementing transformer models.

Steps

1. Text Preprocessing:

python
Copy code
import nltk
from nltk.corpus import stopwords
from nltk.tokenize import word_tokenize, sent_tokenize

def preprocess_text(text):
stop_words = set(stopwords.words('english'))
words = word_tokenize(text)
filtered_words = [w for w in words if not w in stop_words]
return ' '.join(filtered_words)

2. Extractive Summarization using TextRank:

python
Copy code
from gensim.summarization import summarize

def extractive_summary(text):
return summarize(text, ratio=0.2) # Summarize to 20% of original
length

3. Abstractive Summarization using Transformers:

python
Copy code
from transformers import pipeline
summarizer = pipeline("summarization")

def abstractive_summary(text):
summary = summarizer(text, max_length=150, min_length=30,
do_sample=False)
return summary[0]['summary_text']

Evaluation Metrics

 ROUGE (Recall-Oriented Understudy for Gisting Evaluation): Measures the

overlap between the system-generated summary and reference summaries.
 BLEU (Bilingual Evaluation Understudy): Assesses the accuracy of the generated
summary against reference summaries.

Results

 Both extractive and abstractive summarization techniques were implemented and

tested.
 Performance was evaluated using ROUGE and BLEU scores.
 It was observed that transformer-based models provided more coherent and
contextually accurate summaries compared to traditional methods.

Conclusion

The developed text summarization tool effectively generates concise summaries of given
texts using both extractive and abstractive methods. While transformer-based abstractive
summarization is computationally intensive, it yields superior results in terms of coherence
and relevance. Future enhancements will focus on optimizing models for faster processing
and expanding the training dataset to improve performance across different domains.

Future Work

 Model Optimization: Improve the performance and speed of transformer models.

 Dataset Expansion: Incorporate more diverse texts for training to enhance
generalization.
 User Interface: Develop a user-friendly interface for easy access and utilization of
the summarization tool.
2. Chatbot for Customer Support
Develop a chatbot that can interact with users, answer frequently asked
questions, and provide support using a knowledge base.

Introduction

Customer support chatbots are automated systems designed to interact with users, answer
frequently asked questions (FAQs), and provide support by leveraging a pre-defined
knowledge base. These chatbots enhance customer service efficiency, reduce wait times, and
offer 24/7 support.

Objectives

The primary objectives of this case study are to develop a chatbot that can:

1. Communicate with users in natural language.

2. Accurately answer FAQs.
3. Provide relevant support using a comprehensive knowledge base.
4. Escalate complex queries to human agents when necessary.

Methodology

Requirements Gathering

 User Requirements: Identify common user needs and frequently asked questions.
 Technical Requirements: Determine the platform (e.g., web, mobile), integration
points (e.g., CRM systems), and technical stack.

Data Collection and Preparation

 FAQs Collection: Gather a list of frequently asked questions from existing customer
support data.
 Knowledge Base Creation: Compile a detailed knowledge base with answers and
support documentation.
 Training Data: Collect conversational data to train the chatbot's natural language
understanding (NLU) models.

Chatbot Design

 Architecture: Design a modular architecture with components for NLU, dialogue

management, and response generation.
 Conversational Flow: Create conversational flows for various user intents and
scenarios.

Implementation

Tools and Libraries

 Programming Language: Python
 Frameworks and Libraries:
o NLU: Rasa NLU, spaCy
o Dialogue Management: Rasa Core
o API Integration: Flask/Django for backend services
o Database: MongoDB for storing user interactions and the knowledge base
o Front-end: HTML/CSS/JavaScript for the chat interface

Steps

1. Natural Language Understanding (NLU):

o Intent Recognition: Identify user intents such as greeting, asking for
information, or seeking support.
o Entity Extraction: Extract relevant entities such as dates, product names, or
locations.

python
Copy code
from rasa.nlu.training_data import load_data
from rasa.nlu.model import Trainer
from rasa.nlu import config

def train_nlu():
training_data = load_data('data/nlu.md')
trainer = Trainer(config.load('config.yml'))
trainer.train(training_data)
model_directory = trainer.persist('models/',
fixed_model_name='nlu')

2. Dialogue Management:
o Define Stories: Outline conversation paths using stories.
o Action Implementation: Define custom actions for fetching information from
the knowledge base.

python
Copy code
from rasa.core.agent import Agent
from rasa.core.policies import MemoizationPolicy, KerasPolicy

def train_dialogue():
agent = Agent('domain.yml', policies=[MemoizationPolicy(),
KerasPolicy()])
training_data = agent.load_data('data/stories.md')
agent.train(training_data)
agent.persist('models/dialogue')

3. Response Generation:
o Template Responses: Use template responses for common queries.
o Custom Actions: Implement custom actions to fetch dynamic data.

python
Copy code
from rasa_sdk import Action

class ActionFetchAnswer(Action):
def name(self):
return 'action_fetch_answer'

def run(self, dispatcher, tracker, domain):

query = tracker.latest_message['text']
answer = fetch_from_knowledge_base(query) # Custom function
to fetch answers
dispatcher.utter_message(text=answer)
return []

4. Integration and Deployment:

o API Development: Develop APIs to connect the chatbot with web or mobile
interfaces.
o Deployment: Deploy the chatbot on a server and integrate it with the customer
support system.

python
Copy code
from flask import Flask, request
from rasa.core.agent import Agent

app = Flask(__name__)
agent = Agent.load('models/dialogue')

@app.route('/webhook', methods=['POST'])
def webhook():
user_message = request.json['message']
response = agent.handle_text(user_message)
return {'response': response[0]['text']}

if __name__ == '__main__':
app.run(port=5005)

Evaluation and Testing

 Testing Scenarios: Test the chatbot with various user queries and scenarios.
 User Feedback: Collect feedback from users to improve the chatbot's performance.
 Performance Metrics: Measure the chatbot's accuracy, response time, and user
satisfaction.

Results

 Implemented a chatbot that can handle FAQs and provide support using a knowledge
base.
 Achieved high accuracy in intent recognition and entity extraction.
 Reduced average response time and improved customer satisfaction.

Conclusion

The developed customer support chatbot effectively interacts with users, answers FAQs, and
provides support using a pre-defined knowledge base. It enhances the customer service
experience by providing instant responses and reducing the workload on human agents.

Future Work
 Continuous Learning: Implement machine learning models for continuous learning
from new interactions.
 Advanced Features: Add features such as sentiment analysis, voice interaction, and
multilingual support.
 Integration with Other Systems: Integrate with additional systems like CRM, ERP,
and other third-party services for more comprehensive support.

This case study outlines the systematic approach to developing a customer support chatbot,
detailing the necessary steps from requirements gathering to deployment and future
improvements.

Case Conceptualization in CBT
100% (3)
Case Conceptualization in CBT
8 pages
Telangana Schemes and Policies (2014-2024) Updated Book-Target TSPSC - 35390223 - 2024 - 06 - 24 - 11 - 08
50% (2)
Telangana Schemes and Policies (2014-2024) Updated Book-Target TSPSC - 35390223 - 2024 - 06 - 24 - 11 - 08
117 pages
Incredible English. Unit 8
No ratings yet
Incredible English. Unit 8
4 pages
H and M Hennes and Mauritz Retail Private Limited
No ratings yet
H and M Hennes and Mauritz Retail Private Limited
20 pages
Automating Document Summarization
No ratings yet
Automating Document Summarization
12 pages
NLP Case Study
No ratings yet
NLP Case Study
5 pages
IR Report
No ratings yet
IR Report
10 pages
NLP Mini Project
No ratings yet
NLP Mini Project
19 pages
Chatbot and Text Summarization
No ratings yet
Chatbot and Text Summarization
5 pages
IOT Based Mini Project
No ratings yet
IOT Based Mini Project
28 pages
Comparative Analysis of Modern Text Summarization Techniques
No ratings yet
Comparative Analysis of Modern Text Summarization Techniques
16 pages
Report Group-8
No ratings yet
Report Group-8
16 pages
DNLP ABL Project
No ratings yet
DNLP ABL Project
7 pages
AI Text Summarization Report
No ratings yet
AI Text Summarization Report
43 pages
Taask
No ratings yet
Taask
18 pages
Project Report
No ratings yet
Project Report
25 pages
TC6 PROJECT SYNOPSIS KrishShetty VedantLandge 231106 101402
No ratings yet
TC6 PROJECT SYNOPSIS KrishShetty VedantLandge 231106 101402
13 pages
New Text Document
No ratings yet
New Text Document
3 pages
LLM1
No ratings yet
LLM1
7 pages
Final Ojt
No ratings yet
Final Ojt
31 pages
AML Project Report Final Practical Exam
No ratings yet
AML Project Report Final Practical Exam
16 pages
Implementation of NLP Based Automatic Text Summarization Using Spacy
No ratings yet
Implementation of NLP Based Automatic Text Summarization Using Spacy
15 pages
Synopsis Creation For Research Paper Using Text Summarization Models
No ratings yet
Synopsis Creation For Research Paper Using Text Summarization Models
5 pages
Britto 1 15 2 15 - Merged
No ratings yet
Britto 1 15 2 15 - Merged
18 pages
UNIT IV Lecture Notes Covering Natural Language Processing
No ratings yet
UNIT IV Lecture Notes Covering Natural Language Processing
6 pages
ACM Journals Primary Article Template Latest Version 4
No ratings yet
ACM Journals Primary Article Template Latest Version 4
31 pages
NLP - Assignment2 Proper RNN Working
No ratings yet
NLP - Assignment2 Proper RNN Working
3 pages
Natural Language Understanding in Chatbots
No ratings yet
Natural Language Understanding in Chatbots
4 pages
ChatGPT Prompt Engineering Guide
100% (1)
ChatGPT Prompt Engineering Guide
1 page
NLP Short Que Ans
No ratings yet
NLP Short Que Ans
21 pages
Natural Language Processing
No ratings yet
Natural Language Processing
8 pages
NLP Assignment2
No ratings yet
NLP Assignment2
7 pages
Gen Ai 6,7
No ratings yet
Gen Ai 6,7
6 pages
Textlytic Research Paper
No ratings yet
Textlytic Research Paper
10 pages
PPR Confe (1) Docx
No ratings yet
PPR Confe (1) Docx
5 pages
(Group-12) NLP Project File
No ratings yet
(Group-12) NLP Project File
23 pages
Course Project Report For: Artificial Intelligence EL-3011
No ratings yet
Course Project Report For: Artificial Intelligence EL-3011
8 pages
Project File
No ratings yet
Project File
23 pages
NLP Final Review
No ratings yet
NLP Final Review
32 pages
Unit 2 Notes Genai
No ratings yet
Unit 2 Notes Genai
45 pages
The Perfect Chatbot
No ratings yet
The Perfect Chatbot
11 pages
Final Presentation
No ratings yet
Final Presentation
22 pages
Sma U-4
No ratings yet
Sma U-4
25 pages
11461-Article Text-20356-1-10-20211106
No ratings yet
11461-Article Text-20356-1-10-20211106
5 pages
Mpreport
No ratings yet
Mpreport
20 pages
Conference - Mimansha Singh
No ratings yet
Conference - Mimansha Singh
18 pages
Paper 3
No ratings yet
Paper 3
6 pages
Britto
No ratings yet
Britto
16 pages
Literature Survey - Ai Mini Project: Research Papers
No ratings yet
Literature Survey - Ai Mini Project: Research Papers
5 pages
BERT Text Summarization Guide
No ratings yet
BERT Text Summarization Guide
16 pages
Unit - 4
No ratings yet
Unit - 4
26 pages
CSDM2-Text Preprocessing For NL Data - 011050
No ratings yet
CSDM2-Text Preprocessing For NL Data - 011050
6 pages
Hope To Skills: Lecture# 02 Irfan Malik, Dr. Sheraz Naseer
No ratings yet
Hope To Skills: Lecture# 02 Irfan Malik, Dr. Sheraz Naseer
38 pages
NLP Unit3&4 QB
No ratings yet
NLP Unit3&4 QB
5 pages
Sentiment Analysis Pipeline Guide
No ratings yet
Sentiment Analysis Pipeline Guide
8 pages
Assignment
No ratings yet
Assignment
6 pages
Research Paper Summarizer Using AI
No ratings yet
Research Paper Summarizer Using AI
5 pages
Irsw Project
No ratings yet
Irsw Project
8 pages
Text Summarization Using NLP Technique
No ratings yet
Text Summarization Using NLP Technique
7 pages
Ai Drive - Prompt Library
No ratings yet
Ai Drive - Prompt Library
4 pages
LUFFING TOWER CRANE Tower Crane Specifications - Jib Tower Crane
No ratings yet
LUFFING TOWER CRANE Tower Crane Specifications - Jib Tower Crane
14 pages
Foundations of Microeconomics 7 Ed Bade
No ratings yet
Foundations of Microeconomics 7 Ed Bade
307 pages
FlowCon Green DN15 40 Tech Note 2024 03 EN
No ratings yet
FlowCon Green DN15 40 Tech Note 2024 03 EN
10 pages
Dementia Care Case Studies
No ratings yet
Dementia Care Case Studies
3 pages
EXCEL Formulae
No ratings yet
EXCEL Formulae
211 pages
Dental Manpower
No ratings yet
Dental Manpower
24 pages
Oxford Rooftops 5th - Reinforcement and Extension-26
No ratings yet
Oxford Rooftops 5th - Reinforcement and Extension-26
1 page
Medical Technology Laws and Bioethics
No ratings yet
Medical Technology Laws and Bioethics
12 pages
Study On The Relationship Between The WTO's IP Agreement and The Convention On Biological Diversity - Ipleaders
No ratings yet
Study On The Relationship Between The WTO's IP Agreement and The Convention On Biological Diversity - Ipleaders
20 pages
Internship Progress Report Vivek
No ratings yet
Internship Progress Report Vivek
10 pages
Cisco Stealthwatch: Cisco Threat Response Integration Guide 7.1.2
No ratings yet
Cisco Stealthwatch: Cisco Threat Response Integration Guide 7.1.2
23 pages
THINK L2 Unit 4 Vocabulary Extension
No ratings yet
THINK L2 Unit 4 Vocabulary Extension
2 pages
Brainy kl6 Short Tests Unit 6 Lesson 1
No ratings yet
Brainy kl6 Short Tests Unit 6 Lesson 1
1 page
PT Science-6 Q1
No ratings yet
PT Science-6 Q1
6 pages
Invoice Details For Plab
100% (1)
Invoice Details For Plab
3 pages
ENGM90006 Assignment 14 v1
No ratings yet
ENGM90006 Assignment 14 v1
3 pages
Evidence of Evolution
No ratings yet
Evidence of Evolution
15 pages
Madrid Vs Mapoy
No ratings yet
Madrid Vs Mapoy
2 pages
ASSIGNMENT
No ratings yet
ASSIGNMENT
8 pages
Exam Center Data
No ratings yet
Exam Center Data
2 pages
Supreme Court: Eusebio C. Encarnacion For Appellant. Attorney-General Jaranilla For Appellee
No ratings yet
Supreme Court: Eusebio C. Encarnacion For Appellant. Attorney-General Jaranilla For Appellee
2 pages
Lista Musicas - Texto
No ratings yet
Lista Musicas - Texto
4 pages
Success STory: SAP C4C Sales Cloud Implementation at AL Tasnim Group (ATNM)
No ratings yet
Success STory: SAP C4C Sales Cloud Implementation at AL Tasnim Group (ATNM)
1 page
I'm Yours Lyrics for Singers
No ratings yet
I'm Yours Lyrics for Singers
2 pages
Chap 12 PM-BB Multiple Choice Type Questions
No ratings yet
Chap 12 PM-BB Multiple Choice Type Questions
24 pages

Experiential Learning

Uploaded by

Experiential Learning

Uploaded by

Experiential Learning -2

 Extractive Summarization: This method involves identifying and selecting key

1. Comprehending the context and key points of the input text.

Data Collection and Preprocessing

Tools and Libraries

 Programming Language: Python

2. Extractive Summarization using TextRank:

3. Abstractive Summarization using Transformers:

 ROUGE (Recall-Oriented Understudy for Gisting Evaluation): Measures the

 Both extractive and abstractive summarization techniques were implemented and

 Model Optimization: Improve the performance and speed of transformer models.

1. Communicate with users in natural language.

Data Collection and Preparation

 Architecture: Design a modular architecture with components for NLU, dialogue

Tools and Libraries

1. Natural Language Understanding (NLU):

def run(self, dispatcher, tracker, domain):

4. Integration and Deployment:

Evaluation and Testing

You might also like