0% found this document useful (0 votes)

62 views8 pages

Irsw Project

This document describes a project to build a text summarization system using natural language processing and machine learning. It discusses extractive and abstractive summarization approaches and describes implementing the TextRank algorithm for extractive summarization. The dataset contains product descriptions and the task is to summarize them into shorter versions while maintaining context. Finally, the generated summaries are added to a dataframe and converted to a CSV file.

Uploaded by

kartike tiwari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views8 pages

Irsw Project

Uploaded by

kartike tiwari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Text Summarizer

Synopsis

Submitted by:

Shwetank Verma (19103209)

Ishaan Raj Mishra

(19103210)

Varun Mittal (19103266)

Amritansh Gupta
(19103305)

Department of CSE/IT

JAYPEE INSTITUTE OF INFORMATION TECHNOLOGY

Table of Contents

Page No.

Abstract i

Introduction ii

Background Study iii

Flowchart and Dataset Description iv

References v
ABSTRACT

The amount of text data available has increased dramatically in recent years from a variety of sources.
This large volume of literature has a wealth of information and knowledge that must be adequately
summarized to be useful.

One of the most difficult NLP tasks is summarization, which is the process of generating a shorter
version of a piece of text while keeping critical context information.

The goal is to provide a condensed representation of an input text that captures the original text's basic
meaning.

To produce a condensed version, most successful summarizing systems use extractive algorithms that
crop out and stitch together Chunks of the text.

i
INTRODUCTION

Before producing the required summary texts, machine learning algorithms can be trained to interpret
documents and identify the areas that carry key facts and information.

Summarization improves the readability of publications, cuts down on time spent searching for
information, and allows for more information to be crammed into a given space.

We will be working on extraction-based summarization in this project.

The process of extractive text summarising entails extracting essential terms from the original document
and combining them to create a summary.

Extractive summarization is a type of machine learning that includes weighting the most important parts
of sentences and using the findings to construct summaries.

To determine the weights of the phrases, several algorithms and approaches can be employed to rank them
according to their relevance and resemblance to one another, and then link them to create a summary.

Even though the outcomes of extraction-based summarization aren't always grammatically correct, we
nevertheless get a concise and valuable piece of data.

ii
BACKGROUND STUDY

RESEARCH PAPER 1

TITLE: Analytical study of Text Summarization Techniques

AUTHOR: Dr. Pooja Raundale, Himanshu Shekhar

PUBLISHER: IEEE PUBLISHED IN: October 2021

SUMMARY: They implemented and compared the performance of various automatic summarization
methods to gain insight into how long the methods take to implement and how accurate and human-like
the generated summaries are.

Extractive techniques (TF-IDF and TextRank) achieve very high scores for ROUGE evaluation.

Abstractive techniques like Seq2Seq with Attention and Pointer-Generator score a lot lower as compared
to the above two since they generate human-like summaries that appear to be handwritten.

RESEARCH PAPER 2

TITLE: Extractive Text Summarization Using Sentence Ranking

AUTHOR: J.N. Madhuri, Ganesh Kumar R.

PUBLISHER: IEEE PUBLISHED IN: August 2019

SUMMARY: In this work, they proposed extractive-based text summarization using a statistical novel
approach based on the sentences ranking the sentences selected by the summarizer. The sentences which
are extracted are produced as a summarized text.

The sentences are sorted based on their weighted frequency ranks from highest rank to lowest. The
sentences are arranged in descending order. The summarizer will extract the high-weighted frequency
sentences to find a summary of a document.

iii
FLOWCHART REPRESENTATION

DATASET DESCRIPTION

It contains numerous paragraphs describing various types of medications available and how to consumethem
including the benefits and aftereffects of the medication. It also consists of the doctor’s directionson when to
consume them based on various situations and what to avoid while consuming them

iv
DESCRIPTION OF THE PROJECT

In this project, Automatic text summarization is summarizing the given paragraph using natural language
processing and machine learning. There has been an explosion in the amount of text data from a variety
of sources. This volume of text is an invaluable source of information and knowledge which needs to be
effectively summarized to be useful. In this review, the main approaches to automatic text summarization
are described.

The dataset used in this project contains long descriptions of products. The task is to make a text
summarizer that takes these descriptions as input and summarizes them into shorter versions without
losing the context. The length of the summary will also be adjustable by the user.

There are two general approaches to automatic summarization: Extraction and Abstraction.

Extractive Summarization: These methods rely on extracting several parts, such as phrases and sentences,
from a piece of text and stacking them together to create a summary. Therefore, identifying the right
sentences for summarization is of utmost importance in an extractive method.

Abstractive Summarization: These methods use advanced NLP techniques to generate an entirely new
summary. Some parts of this summary may not even appear in the original text. Such a summary might
include verbal innovations. Research has focused primarily on extractive methods, which are appropriate
for image collection and video summarization.

In this Jupyter notebook, the TextRank algorithm for extractive text summarization is implemented using
Google's PageRank search algorithm to generate correlations among sentences.

Finally, all the generated summary for each paragraph is added to the Dataframe and then the Dataframe
is converted to a CSV file.

v
REFERENCES

1. Luís Gonçalves , Automatic Text Summarization with Machine Learning, Apr

12, 2020

https://medium.com/luisfredgs/automatic-text-summarization- with-machine-
learning-an-overview-68ded5717a25

2. Shrivarsheni, Text Summarization Approaches for NLP, Oct 26 2020

https://www.machinelearningplus.com/nlp/text-summarization- approaches-nlp-
example/

3. Aravindpai, Comprehensive Guide to Text Summarization using Deep Learning

in Python, June 10 2019

https://www.analyticsvidhya.com/blog/2019/06/comprehensive

-guide-text-summarization-using-deep-learning-python/

4. Alfrick Opidi, Gentle Introduction to Text Summarization in Machine

Learning, Apr 15 2019

https://blog.floydhub.com/gentle-introduction-to-text- summarization-
in-machine-learning/

Extractive Text Summarization: Motilal Nehru National Institute of Technology Allahabad
No ratings yet
Extractive Text Summarization: Motilal Nehru National Institute of Technology Allahabad
29 pages
TC6 PROJECT SYNOPSIS KrishShetty VedantLandge 231106 101402
No ratings yet
TC6 PROJECT SYNOPSIS KrishShetty VedantLandge 231106 101402
13 pages
Synopsis Creation For Research Paper Using Text Summarization Models
No ratings yet
Synopsis Creation For Research Paper Using Text Summarization Models
5 pages
Implementation of NLP Based Automatic Text Summarization Using Spacy
No ratings yet
Implementation of NLP Based Automatic Text Summarization Using Spacy
15 pages
Research Paper 8
No ratings yet
Research Paper 8
4 pages
Project File
No ratings yet
Project File
23 pages
Automating Document Summarization
No ratings yet
Automating Document Summarization
12 pages
Abstractive Text Summarization Using Transformer Based Approach
No ratings yet
Abstractive Text Summarization Using Transformer Based Approach
10 pages
Automatic Text Summarization Using Natural Language Processing PDF
No ratings yet
Automatic Text Summarization Using Natural Language Processing PDF
54 pages
Automatic Text Summarization Using Natural Language Processing
No ratings yet
Automatic Text Summarization Using Natural Language Processing
54 pages
DNLP ABL Project
No ratings yet
DNLP ABL Project
7 pages
Text Summarization with NLP
No ratings yet
Text Summarization with NLP
14 pages
Research Final
No ratings yet
Research Final
6 pages
(Group-12) NLP Project File
No ratings yet
(Group-12) NLP Project File
23 pages
Research Paper 7
No ratings yet
Research Paper 7
8 pages
IEEE Conference Template 3
No ratings yet
IEEE Conference Template 3
4 pages
Final Year
No ratings yet
Final Year
31 pages
An Extractive Approach For English Text
No ratings yet
An Extractive Approach For English Text
11 pages
Text Summarisation and Document Understanding Report
No ratings yet
Text Summarisation and Document Understanding Report
50 pages
Comparative Analysis of Modern Text Summarization Techniques
No ratings yet
Comparative Analysis of Modern Text Summarization Techniques
16 pages
Ir Case Study
No ratings yet
Ir Case Study
8 pages
IEEE Conference Template 3 PDF
No ratings yet
IEEE Conference Template 3 PDF
4 pages
State of The Art Text - Summarisation
No ratings yet
State of The Art Text - Summarisation
15 pages
FALLSEM2024-25 BCSE409L TH VL2024250101879 2024-11-14 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE409L TH VL2024250101879 2024-11-14 Reference-Material-I
13 pages
IEEE Conference Template 1 PDF
No ratings yet
IEEE Conference Template 1 PDF
3 pages
NLP Text Summarization Techniques
100% (1)
NLP Text Summarization Techniques
8 pages
5 LS
No ratings yet
5 LS
6 pages
Unravel News: An Efficient Summarization Approach: Ankan Saha Abdullah Al Shafi
No ratings yet
Unravel News: An Efficient Summarization Approach: Ankan Saha Abdullah Al Shafi
6 pages
Analysis of Abstractive and Extractive Summarizati
No ratings yet
Analysis of Abstractive and Extractive Summarizati
11 pages
For MP
No ratings yet
For MP
13 pages
Green Energy
No ratings yet
Green Energy
5 pages
NLP Case Study
No ratings yet
NLP Case Study
5 pages
AI Text Summarization Report
No ratings yet
AI Text Summarization Report
43 pages
Rane, Govilkar - 2019 - Recent Trends in Deep Learning Based Abstractive Text Summarization-Annotated
No ratings yet
Rane, Govilkar - 2019 - Recent Trends in Deep Learning Based Abstractive Text Summarization-Annotated
8 pages
NLP Text Summarization Survey
No ratings yet
NLP Text Summarization Survey
23 pages
Sample Research
No ratings yet
Sample Research
29 pages
Text Summarization Using NLP
No ratings yet
Text Summarization Using NLP
6 pages
Paper 1
No ratings yet
Paper 1
23 pages
Abstractive Summarization Insights
No ratings yet
Abstractive Summarization Insights
38 pages
A Domain-Specific Automatic Text Summarization Using Fuzzy Logic
No ratings yet
A Domain-Specific Automatic Text Summarization Using Fuzzy Logic
13 pages
Paper 3
No ratings yet
Paper 3
3 pages
Condensed RP
No ratings yet
Condensed RP
5 pages
Paper Work
No ratings yet
Paper Work
12 pages
Technical Seminar Report-6607
No ratings yet
Technical Seminar Report-6607
11 pages
Text Summarization Using Natural Language Processing
No ratings yet
Text Summarization Using Natural Language Processing
5 pages
Research Paper Summarizer Using NLP Techniques
No ratings yet
Research Paper Summarizer Using NLP Techniques
9 pages
Malayalam 2
No ratings yet
Malayalam 2
4 pages
Text Summarizer Using NLP (Natural Language Processing) : © JUL 2022 - IRE Journals - Volume 6 Issue 1 - ISSN: 2456-8880
No ratings yet
Text Summarizer Using NLP (Natural Language Processing) : © JUL 2022 - IRE Journals - Volume 6 Issue 1 - ISSN: 2456-8880
6 pages
NLP Text Summarization Techniques
No ratings yet
NLP Text Summarization Techniques
21 pages
Types of Extractive Methods
No ratings yet
Types of Extractive Methods
22 pages
Automatic Text Summarization Using Text Rank Algorithm
No ratings yet
Automatic Text Summarization Using Text Rank Algorithm
6 pages
Text Summarisation Method in NLP
No ratings yet
Text Summarisation Method in NLP
38 pages
Viswajothi Technologies PR Ivate Limited: "Text Summarization Based On NLP"
67% (3)
Viswajothi Technologies PR Ivate Limited: "Text Summarization Based On NLP"
23 pages
Summerization Presentation
No ratings yet
Summerization Presentation
9 pages
NLP Based Automated Text Summarization and Translation A Comprehensive Analysis
No ratings yet
NLP Based Automated Text Summarization and Translation A Comprehensive Analysis
4 pages
American Uprising The Untold Story of Americas Largest Slave Revolt 1st Edition Daniel Rasmussen Instant Download
100% (2)
American Uprising The Untold Story of Americas Largest Slave Revolt 1st Edition Daniel Rasmussen Instant Download
39 pages
L3 - Substitution Cipher
No ratings yet
L3 - Substitution Cipher
22 pages
Lets Celebrate Diversity!: Actividad Stop Bullying (Día 2)
No ratings yet
Lets Celebrate Diversity!: Actividad Stop Bullying (Día 2)
5 pages
Arduino Motor Shield 2A
No ratings yet
Arduino Motor Shield 2A
6 pages
The Stolen Legacy Student's Name University Affiliation Course Number and Name Instructor Name Assignment Due Date
No ratings yet
The Stolen Legacy Student's Name University Affiliation Course Number and Name Instructor Name Assignment Due Date
6 pages
Advanced NX Meshing Techniques
No ratings yet
Advanced NX Meshing Techniques
22 pages
Phonetics Booklet - Key
No ratings yet
Phonetics Booklet - Key
11 pages
Who Are The Jews
No ratings yet
Who Are The Jews
17 pages
Ansys Fluent Text Command List
No ratings yet
Ansys Fluent Text Command List
582 pages
Understanding The Times
No ratings yet
Understanding The Times
21 pages
Another Side of Life
No ratings yet
Another Side of Life
960 pages
Practical Research 2
No ratings yet
Practical Research 2
13 pages
AI's Impact on Tech and Society
No ratings yet
AI's Impact on Tech and Society
8 pages
Error and Solution Ls Retail
No ratings yet
Error and Solution Ls Retail
10 pages
Grade Six Music Ornaments
No ratings yet
Grade Six Music Ornaments
4 pages
Free Modules 55 PDF
No ratings yet
Free Modules 55 PDF
13 pages
Scattering Theory
No ratings yet
Scattering Theory
1 page
Loading Data in +snowflake
No ratings yet
Loading Data in +snowflake
10 pages
ICT 204 - Lecture 4 Methods
No ratings yet
ICT 204 - Lecture 4 Methods
31 pages
Class 8 Grammar
No ratings yet
Class 8 Grammar
6 pages
Constructivist Pedagogy and Symbolism Vico Cassirer Piaget
No ratings yet
Constructivist Pedagogy and Symbolism Vico Cassirer Piaget
15 pages
REVIEW G Pratico and M V Van Pelt Basics
No ratings yet
REVIEW G Pratico and M V Van Pelt Basics
1 page
Soal UN Bahasa Inggris SMP Kelas IX Latihan 1
No ratings yet
Soal UN Bahasa Inggris SMP Kelas IX Latihan 1
4 pages
Tesla's TTPoE for AI Supercomputers
No ratings yet
Tesla's TTPoE for AI Supercomputers
23 pages
Proof The Quran Never Been Changed
0% (1)
Proof The Quran Never Been Changed
4 pages
English A1.1 Unit 3: World of Work
No ratings yet
English A1.1 Unit 3: World of Work
10 pages
Spreadsheet Evolution for Professionals
100% (6)
Spreadsheet Evolution for Professionals
18 pages
Literatures of South India Notes
No ratings yet
Literatures of South India Notes
3 pages
Cambridge Checkpoint Science Student's Book 1 Riley Peter Download
100% (2)
Cambridge Checkpoint Science Student's Book 1 Riley Peter Download
31 pages
Extra Grammar Exercises (Unit 3, Page 29) LESSON 1 The Simple Present Tense: Review
No ratings yet
Extra Grammar Exercises (Unit 3, Page 29) LESSON 1 The Simple Present Tense: Review
4 pages

Irsw Project

Uploaded by

Irsw Project

Uploaded by

Text Summarizer

Shwetank Verma (19103209)

Ishaan Raj Mishra

Varun Mittal (19103266)

JAYPEE INSTITUTE OF INFORMATION TECHNOLOGY

Background Study iii

Flowchart and Dataset Description iv

We will be working on extraction-based summarization in this project.

TITLE: Analytical study of Text Summarization Techniques

AUTHOR: Dr. Pooja Raundale, Himanshu Shekhar

PUBLISHER: IEEE PUBLISHED IN: October 2021

TITLE: Extractive Text Summarization Using Sentence Ranking

AUTHOR: J.N. Madhuri, Ganesh Kumar R.

PUBLISHER: IEEE PUBLISHED IN: August 2019

1. Luís Gonçalves , Automatic Text Summarization with Machine Learning, Apr

2. Shrivarsheni, Text Summarization Approaches for NLP, Oct 26 2020

3. Aravindpai, Comprehensive Guide to Text Summarization using Deep Learning

4. Alfrick Opidi, Gentle Introduction to Text Summarization in Machine

You might also like