0% found this document useful (0 votes)

7 views7 pages

DNLP ABL Project

ABL

Uploaded by

rajdandwe503

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views7 pages

DNLP ABL Project

ABL

Uploaded by

rajdandwe503

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

A

Project Based Learning Report

For

“Develop text summarization tool by using extractive

summarization techniques.”
is submitted in partial fulfillment of the requirement for the award of degree of
Bachelor of Technology
(VII Semester B. Tech. for the course Data Science for NLP (PECAD703T))
in

Artificial Intelligence & Data Science

Submitted by
Aashay Kale Rohan Chouhan
Rajesh Dandwe Tanu Patil

Under the guidance of

Prof. Kalyani Pendke
Assistant Professor

Department of Emerging Technologies

(Artificial Intelligence & Data Science)

S. B. Jain Institute of Technology,

Management and Research, Nagpur
(An Autonomous Institute Affiliated to R. T.M. Nagpur University)
Academic Session: 2024-2025 (ODD)
Problem Statement:

Develop text summarization tool by using extractive summarization techniques.

Objectives:

1. Develop an Extractive Summarization Model: Design and implement an extractive text

summarization model that selects and ranks the most relevant sentences from a given text based on
predefined criteria such as frequency, position, and importance.

2. Improve Summarization Efficiency: Ensure that the model can handle large volumes of text
while maintaining high performance in terms of speed and accuracy. The tool should be capable of
summarizing content from various sources, including news articles, research papers, and reports.

3. Ensure Relevance and Coherence: Create summaries that are coherent and retain the essential
meaning of the original text. The tool must focus on producing summaries that are both concise and
representative of the main points.

4. Evaluate Model Performance: Establish a framework for evaluating the accuracy, precision, and
readability of the summaries generated by the tool. Use metrics such as ROUGE scores, human
evaluation, or domain-specific criteria to assess the quality of the summaries.

5. Provide a User-Friendly Interface: Develop a simple and intuitive user interface (UI) for end-
users to input text and receive a summary. The UI should cater to both technical and non-technical
users, ensuring accessibility and ease of use.

6. Support Multi-Domain Text: Ensure that the summarization tool can handle different types of
text across various domains, such as technical documents, legal papers, and general news, by
adjusting extraction strategies to suit the text type.

7. Incorporate Customization: Allow users to adjust the summary length or level of detail,
providing flexibility for generating summaries based on the user’s specific needs.
Introduction:

Automatic text summarization refers to a group of methods that employ algorithms to compress a
certain amount of text while preserving the text’s key points. Although it may not receive as much
attention as other machine learning successes, this field of computer automation has witnessed
consistent advancement and improvement. Therefore, systems capable of extracting the key concepts
from the text while maintaining the overall meaning have the potential to revolutionize a variety of
industries, including banking, law, and even healthcare.

• Types of Text Summarization

There are typically two basic methods for automatic text summarization:

1. Extractive summarization
2. Abstractive summarization

➢ Extractive Summarization
Extractive summarization algorithms are employed to generate a summary by selecting and
combining key passages from the source material. Unlike humans, these models emphasize creating
the most essential sentences from the original text rather than generating new ones.
Extractive summarization utilizes the Text Rank algorithm, which is highly suitable for text
summarization tasks. Let’s explore how it functions by considering a sample text summarization
scenario. The process of extractive summarizing involves picking the most relevant sentences from
an article and systematically organizing them. The sentences making up the summary are taken
verbatim from the source material. Extractive summarization systems, as we know them now,
revolve around three fundamental operations:

1) Construction of an intermediate representation of the input text

Topic representation and indicator representation are examples of representation-based methods. To

understand the subject(s) mentioned in the text, topic representation converts the text into an
intermediate representation.
2) Scoring the sentences based on the representation

At the time of the generation of the intermediate representation, each sentence is given a significance
score. When using a method that relies on topic representation, a sentence's score reflects how
effectively it elucidates critical concepts in the text. In indicator representation, the score is
computed by aggregating the evidence from different weighted indicators.

3) Selection of a summary comprising several sentences

To generate a summary, the summarizer software picks the top k sentences. For example, some
methods use greedy algorithms to pick and choose which sentences are most relevant, while others
may transform sentence selection into an optimization problem in which a set of sentences is
selected under the stipulation that it must maximize overall importance and coherence while
minimizing the quantity of redundant information.

➢ Utilizing the TextRank Algorithm for Extractive Text Summarization:

The implementation of TextRank offers a spaCy pipeline as an additional feature. SpaCy is an

excellent Python library for addressing challenges in natural language processing. Additionally, you
need pytextrank, a spaCy extension that effectively implements the TextRank algorithm. It is evident
that the TextRank algorithm can produce reasonably satisfactory results. Nevertheless, extractive
summarization techniques merely provide a modified version of the original text, retaining certain
phrases that were not eliminated, instead of generating new text (new data) to summarize the
information contained in the original text.
Code:

Spacy
To Install the Spacy and Dowload the English Language Dependency run the below code in terminal

!pip install spacy

To install the english laguage dependency

!python3 -m spacy download en_core_web_lg

TextRank
To Install the TextRank

!pip install pytextrank

Text Summarizations
This code uses spaCy and PyTextRank to automatically summarize a given text. It first installs the
required packages, downloads a spaCy language model, and loads the model with the TextRank
summarization pipeline. It then processes a lengthy text and generates a summary of the text’s key
phrases and sentences. The summary is limited to 2 phrases and 2 sentences.

import spacy
import pytextrank

nlp = spacy.load("en_core_web_lg")
nlp.add_pipe("textrank")

example_text = """Deep learning (also known as deep structured learning) is part of a broader family
of machine learning methods based on artificial neural networks with representation learning.
Learning can be supervised, semi-supervised or unsupervised. Deep-learning architectures such as
deep neural networks, deep belief networks, deep reinforcement learning, recurrent neural networks
and convolutional neural networks have been applied to fields including computer vision, speech
recognition, natural language processing, machine translation, bioinformatics, drug design, medical
image analysis, material inspection and board game programs, where they have produced results
comparable to and in some cases surpassing human expert performance. Artificial neural networks
(ANNs) were inspired by information processing and distributed communication nodes in biological
systems. ANNs have various differences from biological brains. Specifically, neural networks tend
to be static and symbolic, while the biological brain of most living organisms is dynamic (plastic)
and analogue. The adjective "deep" in deep learning refers to the use of multiple layers in the
network. Early work showed that a linear perceptron cannot be a universal classifier, but that a
network with a nonpolynomial activation function with one hidden layer of unbounded width
can.Deep learning is a modern variation which is concerned with an unbounded number of layers of
bounded size, which permits practical application and optimized implementation, while retaining
theoretical universality under mild conditions. In deep learning the layers are also permitted to be
heterogeneous and to deviate widely from biologically informed connectionist models, for the sake
of efficiency, trainability and understandability, whence the structured part."""

print('Original Document Size:',len(example_text))

doc = nlp(example_text)

for sent in doc._.textrank.summary(limit_phrases=2, limit_sentences=2):

print(sent)
print('Summary Length:',len(sent))

Output:
Conclusion:

The development of an extractive text summarization tool offers a practical solution to efficiently
process large volumes of text while retaining key information. By selecting the most relevant
sentences, this tool enhances productivity across industries like healthcare, finance, and law. Despite
challenges in maintaining coherence and relevance, extractive techniques provide a scalable
approach to summarization. Ultimately, the tool improves information retrieval and decision-
making, enabling users to quickly access essential insights from extensive data.

Evaluation Parameters
Sr. Roll No. Faculty Submissio Viva Total Signature
No /USN No. Name of Student Assessment n (3M) (3M) (10M)
(4M)
1 Aashay Subhash Kale
AD21061

2 Rohan Chouhan
AD21062

3 Rajesh Umesh Dandwe

AD21063

4 Tanu Prakash Patil

AD22D001

Signature of Course In-Charge

NLP Text Summarization Techniques
100% (1)
NLP Text Summarization Techniques
8 pages
Ann Rec054
No ratings yet
Ann Rec054
1 page
Abstractive Text Summarization Using Transformer Architecture
No ratings yet
Abstractive Text Summarization Using Transformer Architecture
5 pages
NLP Based Automated Text Summarization and Translation A Comprehensive Analysis
No ratings yet
NLP Based Automated Text Summarization and Translation A Comprehensive Analysis
4 pages
Text Summarization Using Word Frequency
No ratings yet
Text Summarization Using Word Frequency
3 pages
Arabic Text Summarization
No ratings yet
Arabic Text Summarization
3 pages
EASESUM: An Online Abstractive and Extractive Text Summarizer Using Deep Learning Technique
No ratings yet
EASESUM: An Online Abstractive and Extractive Text Summarizer Using Deep Learning Technique
12 pages
Text Summarization Using Python NLTK
No ratings yet
Text Summarization Using Python NLTK
8 pages
21 Automatic Text Summarization
No ratings yet
21 Automatic Text Summarization
1 page
Paper 3
No ratings yet
Paper 3
3 pages
Current Trends and Advances in Extractive Text Summarization A Comprehensive Review
No ratings yet
Current Trends and Advances in Extractive Text Summarization A Comprehensive Review
17 pages
Full Report PDF
No ratings yet
Full Report PDF
67 pages
Automatic Text Summarization Using Natural Language Processing PDF
No ratings yet
Automatic Text Summarization Using Natural Language Processing PDF
54 pages
Green Energy
No ratings yet
Green Energy
5 pages
Text Summarization in Python With SpaCy Library
No ratings yet
Text Summarization in Python With SpaCy Library
10 pages
Text Summarization Using NLP
No ratings yet
Text Summarization Using NLP
6 pages
An Overview of Extractive Based Automati
No ratings yet
An Overview of Extractive Based Automati
12 pages
Review of Data-Driven Generative AI Models For Knowledge Extraction From Scientific Literature in Healthcare
No ratings yet
Review of Data-Driven Generative AI Models For Knowledge Extraction From Scientific Literature in Healthcare
20 pages
A Hybrid Approach For Text Summarization Using Semantic Latent Dirichlet Allocation and Sentence Concept Mapping With Transformer
No ratings yet
A Hybrid Approach For Text Summarization Using Semantic Latent Dirichlet Allocation and Sentence Concept Mapping With Transformer
10 pages
AI Text Summarization Report
No ratings yet
AI Text Summarization Report
43 pages
11461-Article Text-20356-1-10-20211106
No ratings yet
11461-Article Text-20356-1-10-20211106
5 pages
Synopsis Creation For Research Paper Using Text Summarization Models
No ratings yet
Synopsis Creation For Research Paper Using Text Summarization Models
5 pages
Comparative Analysis of Modern Text Summarization Techniques
No ratings yet
Comparative Analysis of Modern Text Summarization Techniques
16 pages
NLP Miniproject
No ratings yet
NLP Miniproject
8 pages
Deep Learning Interview Questions - Deep Learning Questions
No ratings yet
Deep Learning Interview Questions - Deep Learning Questions
21 pages
Abstractive Text Summarizer A Comparative Study On Dot Product Attention and Cosine Similarity
No ratings yet
Abstractive Text Summarizer A Comparative Study On Dot Product Attention and Cosine Similarity
8 pages
Module 7
No ratings yet
Module 7
44 pages
Text Summarisation and Document Understanding Report
No ratings yet
Text Summarisation and Document Understanding Report
50 pages
5 LS
No ratings yet
5 LS
6 pages
Data Representation For Deep Learning - Based Arabic Text Summarization Performance Using Python Results
No ratings yet
Data Representation For Deep Learning - Based Arabic Text Summarization Performance Using Python Results
18 pages
Deep Learning Powered Text Summarization Framework For Creating A Highly Accurate Summary
No ratings yet
Deep Learning Powered Text Summarization Framework For Creating A Highly Accurate Summary
19 pages
State of The Art Text - Summarisation
No ratings yet
State of The Art Text - Summarisation
15 pages
A Survey of Advances in Text Summarization Methods
No ratings yet
A Survey of Advances in Text Summarization Methods
5 pages
Text Summarization Using Natural Language Processing
No ratings yet
Text Summarization Using Natural Language Processing
5 pages
Event Driven Programing Lab - Lec
No ratings yet
Event Driven Programing Lab - Lec
21 pages
Implementation of NLP Based Automatic Text Summarization Using Spacy
No ratings yet
Implementation of NLP Based Automatic Text Summarization Using Spacy
15 pages
Automatic Text Recognisation
No ratings yet
Automatic Text Recognisation
4 pages
Natural Language Processing For Automatic Text Summarization
No ratings yet
Natural Language Processing For Automatic Text Summarization
14 pages
Research Paper Summer Izer
No ratings yet
Research Paper Summer Izer
6 pages
Recent Approaches For Text Summarization
No ratings yet
Recent Approaches For Text Summarization
13 pages
Biomedical Text Summarization Using Conditional Generative Adversarial Network (CGAN)
No ratings yet
Biomedical Text Summarization Using Conditional Generative Adversarial Network (CGAN)
12 pages
Project File
No ratings yet
Project File
23 pages
Text Summarization
No ratings yet
Text Summarization
6 pages
Ir Case Study
No ratings yet
Ir Case Study
8 pages
Data Analyst Intern Resume Guide
100% (1)
Data Analyst Intern Resume Guide
4 pages
Applied Sciences: Abstractive vs. Extractive Summarization: An Experimental Review
No ratings yet
Applied Sciences: Abstractive vs. Extractive Summarization: An Experimental Review
20 pages
Irsw Project
No ratings yet
Irsw Project
8 pages
AI PPT Project to-Text-Summarization
No ratings yet
AI PPT Project to-Text-Summarization
10 pages
10 Objective Questions On AI
No ratings yet
10 Objective Questions On AI
2 pages
Abstractive Summarization Insights
No ratings yet
Abstractive Summarization Insights
38 pages
Advanced Text Summarization Techniques: Integrating RNNS, Transformers, and Pca For Enhanced Performance
No ratings yet
Advanced Text Summarization Techniques: Integrating RNNS, Transformers, and Pca For Enhanced Performance
8 pages
Extractive Text Summarization Using Word Frequency
No ratings yet
Extractive Text Summarization Using Word Frequency
6 pages
1331 4786 1 PB
No ratings yet
1331 4786 1 PB
14 pages
Operating
No ratings yet
Operating
3 pages
8921-Article Text-15992-1-10-20210614
No ratings yet
8921-Article Text-15992-1-10-20210614
7 pages
FALLSEM2024-25 BCSE409L TH VL2024250101879 2024-11-14 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE409L TH VL2024250101879 2024-11-14 Reference-Material-I
13 pages
ECT386 - Ktu Qbank
No ratings yet
ECT386 - Ktu Qbank
10 pages
Text Summarization with NLP
No ratings yet
Text Summarization with NLP
14 pages
Text Summarization - Articles - Weights & Biases
No ratings yet
Text Summarization - Articles - Weights & Biases
16 pages
TC6 PROJECT SYNOPSIS KrishShetty VedantLandge 231106 101402
No ratings yet
TC6 PROJECT SYNOPSIS KrishShetty VedantLandge 231106 101402
13 pages
p78 Domingos
No ratings yet
p78 Domingos
10 pages
Rane, Govilkar - 2019 - Recent Trends in Deep Learning Based Abstractive Text Summarization-Annotated
No ratings yet
Rane, Govilkar - 2019 - Recent Trends in Deep Learning Based Abstractive Text Summarization-Annotated
8 pages
Paper Work
No ratings yet
Paper Work
12 pages
Concepts and Techniques: Data Mining
No ratings yet
Concepts and Techniques: Data Mining
101 pages
Data Driven Technologies and Artificial Intelligence in Supply Chain (Mahesh Chand, Vineet Jain, Puneeta Ajmera) - 1
No ratings yet
Data Driven Technologies and Artificial Intelligence in Supply Chain (Mahesh Chand, Vineet Jain, Puneeta Ajmera) - 1
291 pages
Mini Project Report
No ratings yet
Mini Project Report
26 pages
Capstone Chapter 1 3
No ratings yet
Capstone Chapter 1 3
21 pages
Automatic Text Summarization Using Natural Language Processing
No ratings yet
Automatic Text Summarization Using Natural Language Processing
54 pages
Machine Learning for Text Summarization
No ratings yet
Machine Learning for Text Summarization
56 pages
A Study On Software Effort Prediction Using Machine Learning Techniques
No ratings yet
A Study On Software Effort Prediction Using Machine Learning Techniques
15 pages
4 Months Nasscom - SuprMentr Internship 2025
No ratings yet
4 Months Nasscom - SuprMentr Internship 2025
8 pages
Unit 3 CRM
No ratings yet
Unit 3 CRM
18 pages
Unit 4 DNLP
No ratings yet
Unit 4 DNLP
52 pages
Machine Learning Based Solar Photovoltaic Power Forecasting A Review and Comparison
No ratings yet
Machine Learning Based Solar Photovoltaic Power Forecasting A Review and Comparison
27 pages
Lec 1
No ratings yet
Lec 1
43 pages
Pract 2
No ratings yet
Pract 2
4 pages
Machine Learning's Role in AI
No ratings yet
Machine Learning's Role in AI
10 pages
Power Plays: Unleashing Machine Learning Magic in Smart Grids
No ratings yet
Power Plays: Unleashing Machine Learning Magic in Smart Grids
16 pages
Algorithmic Human Resource Management: Synthesizing Developments and Cross-Disciplinary Insights On Digital HRM
No ratings yet
Algorithmic Human Resource Management: Synthesizing Developments and Cross-Disciplinary Insights On Digital HRM
19 pages
Retele Neuronale Convolutionale
No ratings yet
Retele Neuronale Convolutionale
60 pages
Syllabus FinTech 21 22 4Y
No ratings yet
Syllabus FinTech 21 22 4Y
14 pages
AI - Human Computer Interaction Quiz - June 2024
No ratings yet
AI - Human Computer Interaction Quiz - June 2024
14 pages
Chiang Mai PM2.5 & PM10 Prediction Models
No ratings yet
Chiang Mai PM2.5 & PM10 Prediction Models
4 pages
Roadmap:: Six Months To Machine Learning
No ratings yet
Roadmap:: Six Months To Machine Learning
22 pages
Unit 2 CRM
No ratings yet
Unit 2 CRM
9 pages
Preprints202405 1285 v1
No ratings yet
Preprints202405 1285 v1
20 pages
Unit 5 DIS
No ratings yet
Unit 5 DIS
8 pages
Unit 4 DIS
No ratings yet
Unit 4 DIS
8 pages
Case Study-AD21063
No ratings yet
Case Study-AD21063
8 pages
Unit 6 DIS
No ratings yet
Unit 6 DIS
7 pages
ML Laboratory Lesson Plan-BISL607
No ratings yet
ML Laboratory Lesson Plan-BISL607
7 pages
(2024 Issue) ARDA - JOURNAL - 17223 - AL
No ratings yet
(2024 Issue) ARDA - JOURNAL - 17223 - AL
6 pages
Group - 11
No ratings yet
Group - 11
2 pages
Tamr: Unifying Hadoop Data Lakes
No ratings yet
Tamr: Unifying Hadoop Data Lakes
3 pages
Speech Emotion Recognition Insights
No ratings yet
Speech Emotion Recognition Insights
4 pages
Multi Label Classification For Emotion Analysis of Autism Spectrum Disorder Children Using Deep Neural Networks
No ratings yet
Multi Label Classification For Emotion Analysis of Autism Spectrum Disorder Children Using Deep Neural Networks
5 pages
Data Science Case Study Options 1.0
No ratings yet
Data Science Case Study Options 1.0
2 pages

DNLP ABL Project

Uploaded by

DNLP ABL Project

Uploaded by

A

Project Based Learning Report

“Develop text summarization tool by using extractive

Artificial Intelligence & Data Science

Under the guidance of

Department of Emerging Technologies

S. B. Jain Institute of Technology,

Develop text summarization tool by using extractive summarization techniques.

1. Develop an Extractive Summarization Model: Design and implement an extractive text

• Types of Text Summarization

1) Construction of an intermediate representation of the input text

Topic representation and indicator representation are examples of representation-based methods. To

3) Selection of a summary comprising several sentences

➢ Utilizing the TextRank Algorithm for Extractive Text Summarization:

The implementation of TextRank offers a spaCy pipeline as an additional feature. SpaCy is an

!pip install spacy

To install the english laguage dependency

!python3 -m spacy download en_core_web_lg

!pip install pytextrank

print('Original Document Size:',len(example_text))

for sent in doc._.textrank.summary(limit_phrases=2, limit_sentences=2):

3 Rajesh Umesh Dandwe

4 Tanu Prakash Patil

Signature of Course In-Charge

You might also like