0% found this document useful (0 votes)

13 views8 pages

Unit 5

Uploaded by

thamizhajith007

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views8 pages

Unit 5

Uploaded by

thamizhajith007

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Define three summarization test : Single,Multiple,Query

Single Summarization Test:

In a single summarization test, the system is evaluated based on its ability to generate a concise and
coherent summary from a single source document. The goal is to produce a condensed version of the
input document that retains the most important information while minimizing redundancy and
irrelevant details.

Example:
Input Document:

"The COVID-19 pandemic, caused by the novel coronavirus, has had a significant impact on global
health and economies. Governments worldwide implemented various measures such as lockdowns,
social distancing, and mass vaccination campaigns to curb the spread of the virus. Despite these
efforts, the pandemic has led to widespread illness, economic disruption, and loss of life."

Summary Generated by the System:

"The COVID-19 pandemic, caused by the novel coronavirus, has resulted in global health and
economic crises. Governments implemented measures like lockdowns and vaccination campaigns, but
widespread illness and economic disruption persist."

Multiple Summarization Test:

In a multiple summarization test, the system is evaluated based on its ability to generate summaries
from multiple source documents on the same topic. This type of test assesses the system's ability to
synthesize information from various sources and produce comprehensive summaries that capture
different perspectives or aspects of the topic.

Example:
Source Document 1:

"The COVID-19 pandemic has overwhelmed healthcare systems worldwide, leading to shortages of
medical supplies and personnel. Hospitals are struggling to accommodate the influx of patients, and
frontline workers are facing unprecedented challenges."

Source Document 2:

"Amid the pandemic, scientific research into COVID-19 vaccines has progressed rapidly. Several
vaccines have been developed and distributed globally, offering hope for controlling the spread of the
virus and returning to normalcy."

Summary Generated by the System:

"The COVID-19 pandemic has strained healthcare systems globally, causing shortages of medical
supplies and personnel. Meanwhile, rapid progress in vaccine development offers hope for controlling
the spread of the virus and returning to normalcy."

Query Focused Summarization Test:

In a query-focused summarization test, the system is evaluated based on its ability to generate
summaries that specifically address a given query or question. The system must extract relevant
information from the source documents to provide a concise and informative response to the query.

Example Query: "What are the measures taken by governments to combat the COVID-19 pandemic?"

Source Document:

"Governments worldwide have implemented various measures to combat the COVID-19 pandemic,
including lockdowns, social distancing guidelines, mask mandates, and mass vaccination campaigns.
These measures aim to slow the spread of the virus, protect public health, and reduce the burden on
healthcare systems."

Summary Generated by the System:

"To combat the COVID-19 pandemic, governments have implemented measures such as lockdowns,
social distancing guidelines, mask mandates, and mass vaccination campaigns. These efforts aim to
slow the spread of the virus and protect public health."

What are the issues with MAchine Translation with diagram

Machine Translation?
Machine translation is a sub-field of computational linguistics that focuses on developing systems
capable of automatically translating text or speech from one language to another. In Natural Language
Processing (NLP), the goal of machine translation is to produce translations that are not only
grammatically correct but also convey the meaning of the original content accurately.
Machine Translation Challenges

Despite the abovementioned perks of MT, there are certain problems. You can only overcome them by
hiring a human translator. So keep it in mind before choosing to use machine translation as these
problems with the translation will become business problems if they are not resolved:

● Buy nice or buy twice. The cost can also be a negative factor. You should understand what
quality you get with a free/cheap option.
● Easy does it. Similar to the above - if something is completed very quickly, there is generally
a reasonable expectation that it will not be of high quality. Quality work takes more time,
care, and attention.
● Lack of context. The MT process can take the same term when it appears in different sections
of a document and translates it differently. On the contrary, a human translator ensures that
terminology is consistent throughout a project. This attribute is crucial so you do not confuse
your reader when referring to the same thing.
● The safety is at risk. How can you be sure that the information you put into the free MT
solutions is secured? Such software is open for everybody, their engines are placed on servers
somewhere, and one should choose the translation system vendor very thoughtfully.
● Formatting. Complex formatting can pose a severe issue for MT. It will segment text in the
middle of sentences, which would make the MT have no context.
● Lack of creativity. The art of language involves a lot of creativity. This is important to
understand when communicating on the global market with your clients. Human translators
are more creative with the subject matter at hand and deliver a more creative solution that will
resonate with your business partners or customers.
● Linguistic Complexity: Languages vary greatly in terms of syntax, grammar, idiomatic
expressions, and cultural nuances. Translating between languages with vastly different
structures can lead to errors and loss of meaning.
● Ambiguity: Many words and phrases have multiple meanings depending on context, and
translating them accurately requires understanding the context. Machine Translation systems
often struggle with disambiguation, leading to incorrect translations.
● Domain Specificity: Translating specialized or technical content accurately is challenging
because Machine Translation systems may lack domain-specific knowledge and vocabulary.
● Rare and Low-Resource Languages: Machine Translation performance tends to be lower for
languages with fewer available training data, resources, and linguistic experts.
● Context Preservation: Translating text often requires preserving the context, tone, and style of
the original content, which can be difficult for Machine Translation systems to achieve
consistently.
● Post-Editing Overhead: Translations generated by Machine Translation systems often require
human post-editing to correct errors and improve quality, increasing the overall time and cost.
● Quality vs. Speed Tradeoff: Balancing translation quality with processing speed is a
challenge, especially for real-time or high-volume translation tasks.
●

Alternatives for Machine Translation

For most companies, the cost and time required to add just one new language to a product are
measured in substantial amounts of money and years. Because this addition includes UI apps,
documentation, design solutions, SEO localization, etc. For example, a single license for SDL Trados
Studio (one of the most popular CAT tools) can cost thousands of euros. In addition, it is only useful
for one individual, and the customizations are limited.

+-----------------------------------------+

| Issues with Machine |

| Translation |

+-----------------------------------------+

+-----------+-------------+

| |

v v

+------------------+ +------------------+

| Linguistic | | Domain Specific |

| Complexity | | Challenges |

| | | |

+------------------+ +------------------+
| |

| |

v v

+------------------+ +------------------+

| Ambiguity | | Rare and Low- |

| | | Resource |

| | | Languages |

+------------------+ +------------------+

| |

v v

+------------------+ +------------------+

| Context | | Post-Editing |

| Preservation | | Overhead |

| | | |

+------------------+ +------------------+

| |

v v

+------------------+ +------------------+

| Quality vs. Speed| | |

| Tradeoff | | |

| | | |

+------------------+ +------------------+
Source Language --> Machine Translation System --> Target Language

| |
v v
Inaccuracy * Context Ambiguity * Limited Vocabulary * Cultural Nuances

Define Machine Translation Evaluation

Machine Translation (MT) Evaluation refers to the process of assessing the quality and performance
of Machine Translation systems. It involves comparing the output translations generated by the MT
system against reference translations (i.e., human-generated translations or gold-standard translations)
to measure accuracy, fluency, and adequacy. MT evaluation is crucial for identifying strengths and
weaknesses of MT systems, guiding system improvements, and ensuring translations meet desired
quality standards. There are several evaluation metrics and methodologies used in MT evaluation,
including manual evaluation by human judges, automatic evaluation metrics, and human
judgment-based evaluations.

Manual Evaluation:

Manual evaluation involves human judges assessing the quality of translations generated by MT
systems. Judges compare the MT output against reference translations and assign scores based on
criteria such as accuracy, fluency, and adequacy. This process can be time-consuming and subjective
but provides detailed insights into the translation quality.

Example:

Let's consider an MT system translating a sentence from English to French. The reference translation
by a human translator is "The weather is nice today." The MT system outputs "Le temps est bon
aujourd'hui." Human judges would evaluate this translation based on its accuracy (whether it captures
the meaning of the original sentence), fluency (whether it reads naturally), and adequacy (whether it
conveys the intended message effectively).
Automatic Evaluation Metrics:

Automatic evaluation metrics use computational algorithms to assess the quality of MT output. These
metrics compare the MT output against reference translations and assign scores based on various
criteria such as word overlap, semantic similarity, and syntactic correctness. Common automatic
evaluation metrics include BLEU (Bilingual Evaluation Understudy), METEOR (Metric for
Evaluation of Translation with Explicit ORdering), TER (Translation Edit Rate), and ROUGE
(Recall-Oriented Understudy for Gisting Evaluation).

Example:

Using the BLEU metric, the MT output "Le temps est bon aujourd'hui" is compared to the reference
translation "The weather is nice today." BLEU calculates a score based on the n-gram overlap
between the MT output and reference translations, providing a quantitative measure of translation
quality.

Human Judgment-Based Evaluation:

Human judgment-based evaluation involves collecting feedback from human evaluators on the quality
of MT translations. Evaluators may be asked to rate translations on a scale (e.g., 1 to 5) based on
criteria such as fluency, adequacy, and overall quality. This approach combines the advantages of both
manual evaluation and automatic evaluation metrics while minimizing subjectivity.

Example:

Human evaluators are presented with several translations of the same sentence produced by different
MT systems and asked to rate each translation on fluency, adequacy, and overall quality. Their ratings
are then aggregated to assess the performance of the MT systems.

Example:

Suppose we have an English sentence: "The cat is sitting on the mat."

And we have two machine translation outputs from different systems:

System A: "The cat is sitting on the carpet."

System B: "A cat sits on the rug."

Now, let's say we have a reference translation by a human translator:

Reference: "The cat is sitting on the mat."

To evaluate the translations generated by System A and System B, we'll use the BLEU metric. Here's
how it works:
N-gram Matching: BLEU calculates the precision of n-grams (sequences of n words) in the
machine translation output compared to the reference translation. It considers unigrams
(single words), bigrams (pairs of words), trigrams (triplets of words), and so on.
Brevity Penalty: BLEU penalizes translations that are shorter than the reference translation to
discourage overly concise translations.

Let's calculate the BLEU score for both System A and System B:

For System A:

● Unigram precision: 6/7 (six out of seven words in the translation are present in the reference)
● Bigram precision: 5/6 (five out of six bigrams in the translation are present in the reference)
● Trigram precision: 4/5 (four out of five trigrams in the translation are present in the reference)
● Length penalty: 1.00 (since the length of the translation is the same as the reference)

BLEU score for System A: 0.88 (geometric mean of the n-gram precisions, multiplied by the brevity
penalty)

For System B:

● Unigram precision: 4/7

● Bigram precision: 2/6
● Trigram precision: 1/5
● Length penalty: 0.74 (since the translation is shorter than the reference)

BLEU score for System B: 0.23

Interpretation:

● System A has a higher BLEU score (0.88) compared to System B (0.23), indicating that
System A's translation is closer to the reference translation in terms of n-gram matching.
● System A's translation is considered to be of higher quality according to the BLEU metric.

Book Preparing For Weight Loss Surgery Workbook PDF
No ratings yet
Book Preparing For Weight Loss Surgery Workbook PDF
140 pages
Machine Translation
No ratings yet
Machine Translation
58 pages
Leeds 2006
No ratings yet
Leeds 2006
34 pages
Module 5
No ratings yet
Module 5
17 pages
Machine Translation
No ratings yet
Machine Translation
5 pages
Machine Translation Systems For Indian Languages: Review of Modelling Techniques, Challenges, Open Issues and Future Research Directions
No ratings yet
Machine Translation Systems For Indian Languages: Review of Modelling Techniques, Challenges, Open Issues and Future Research Directions
29 pages
Natural Language Processing
No ratings yet
Natural Language Processing
12 pages
NLP Unit V
No ratings yet
NLP Unit V
18 pages
Indian Language Machine Translation Review
No ratings yet
Indian Language Machine Translation Review
29 pages
Information 16 00440 v2
No ratings yet
Information 16 00440 v2
19 pages
An Introduction To Machine Translation: Andy Way, DCU
No ratings yet
An Introduction To Machine Translation: Andy Way, DCU
23 pages
Machine Translation Systems and Quality Assessment A Systematic Review
No ratings yet
Machine Translation Systems and Quality Assessment A Systematic Review
27 pages
Inglese 2
No ratings yet
Inglese 2
57 pages
Machine Translation Models & Tools
No ratings yet
Machine Translation Models & Tools
29 pages
Dialog Cards (Can You Guess The Mistake Here?)
No ratings yet
Dialog Cards (Can You Guess The Mistake Here?)
3 pages
Translation Theory
No ratings yet
Translation Theory
44 pages
Machine Translation Essentials
No ratings yet
Machine Translation Essentials
22 pages
INTERNSHIP1TASKS
No ratings yet
INTERNSHIP1TASKS
10 pages
Language Translator 1a
No ratings yet
Language Translator 1a
18 pages
Machine Translation: What Is It?
No ratings yet
Machine Translation: What Is It?
2 pages
Machine Translation Thesis PDF
100% (3)
Machine Translation Thesis PDF
8 pages
Commercial Systems: State of The Art in 1999
No ratings yet
Commercial Systems: State of The Art in 1999
13 pages
ChatGPTvs - GoogleTranslate HiT-IT-2023-proceedings
No ratings yet
ChatGPTvs - GoogleTranslate HiT-IT-2023-proceedings
12 pages
Módulo 3 - Updated
No ratings yet
Módulo 3 - Updated
196 pages
General Introduction - and Brief History
No ratings yet
General Introduction - and Brief History
9 pages
ASWIN TS Unit 3 NLP Translations Gen AI
No ratings yet
ASWIN TS Unit 3 NLP Translations Gen AI
5 pages
Outline 02 机翻
No ratings yet
Outline 02 机翻
3 pages
Lecture 13 Translation and Terminology Lecture Notes
No ratings yet
Lecture 13 Translation and Terminology Lecture Notes
4 pages
Machine Translation
No ratings yet
Machine Translation
234 pages
Machine Translation Overview
No ratings yet
Machine Translation Overview
30 pages
NLP Applications (Continued)
No ratings yet
NLP Applications (Continued)
14 pages
Translation Workbench Project Report
No ratings yet
Translation Workbench Project Report
47 pages
Machine Translation Approaches Issues An
No ratings yet
Machine Translation Approaches Issues An
7 pages
App PRJ
No ratings yet
App PRJ
11 pages
FN Paper 2
No ratings yet
FN Paper 2
13 pages
A Short Guide To Post-Editing Cap 6
No ratings yet
A Short Guide To Post-Editing Cap 6
6 pages
Machine Translation: o o o o o o o
No ratings yet
Machine Translation: o o o o o o o
2 pages
EAMT Paper15cameraready
No ratings yet
EAMT Paper15cameraready
12 pages
Non Literary Translation
No ratings yet
Non Literary Translation
48 pages
Machine Translation With Statistical Approach
No ratings yet
Machine Translation With Statistical Approach
33 pages
Can We Trust Machines? A Critical Look at Some Machine Translation Evaluation Metrics
No ratings yet
Can We Trust Machines? A Critical Look at Some Machine Translation Evaluation Metrics
13 pages
Machine Translation Insights
No ratings yet
Machine Translation Insights
5 pages
Administrador,+Brita+Banitzr+CT+40 1 PdfA
No ratings yet
Administrador,+Brita+Banitzr+CT+40 1 PdfA
18 pages
English Amharic Document Translation Using Hybrid Approach - by Samrawit Zewgneh - Addis Ababa University
100% (1)
English Amharic Document Translation Using Hybrid Approach - by Samrawit Zewgneh - Addis Ababa University
62 pages
Machine Learning in Translation (Peng Wang, David B. Sawyer) (Z-Library)
No ratings yet
Machine Learning in Translation (Peng Wang, David B. Sawyer) (Z-Library)
219 pages
2025 CTI215 Seminar 1b (New)
No ratings yet
2025 CTI215 Seminar 1b (New)
82 pages
Machine Translation:: What You Need To Know
No ratings yet
Machine Translation:: What You Need To Know
4 pages
Machine Translation:: What You Need To Know
No ratings yet
Machine Translation:: What You Need To Know
4 pages
Translators: Emilio Aguinaldo College-CAVITE
No ratings yet
Translators: Emilio Aguinaldo College-CAVITE
17 pages
There Are Four Types of MT
No ratings yet
There Are Four Types of MT
8 pages
Machine Translation Vs
No ratings yet
Machine Translation Vs
1 page
SuperText Vs DeepL
No ratings yet
SuperText Vs DeepL
5 pages
Теорія і практика перекладу ЕКЗ
No ratings yet
Теорія і практика перекладу ЕКЗ
15 pages
Coli A 00356
No ratings yet
Coli A 00356
44 pages
Making MT Commonplace in Translation
No ratings yet
Making MT Commonplace in Translation
7 pages
Google vs Padideh: MT Quality Evaluation
No ratings yet
Google vs Padideh: MT Quality Evaluation
12 pages
Machine Translation Dissertation
100% (2)
Machine Translation Dissertation
6 pages
Interprete 2.0
No ratings yet
Interprete 2.0
9 pages
INKLUZIVNO OBRAZOVANJE Italija
No ratings yet
INKLUZIVNO OBRAZOVANJE Italija
15 pages
Gali Pathshala
No ratings yet
Gali Pathshala
36 pages
List of "A" Grade Pharmacist's Registration Certificate
No ratings yet
List of "A" Grade Pharmacist's Registration Certificate
1 page
Test Bank For Medical Assisting Administrative and Clinical Competencies 8th Edition by Blesi Download
No ratings yet
Test Bank For Medical Assisting Administrative and Clinical Competencies 8th Edition by Blesi Download
74 pages
SANS 347 2024 (Ed. 3.01)
No ratings yet
SANS 347 2024 (Ed. 3.01)
59 pages
IJNRD2309341
No ratings yet
IJNRD2309341
10 pages
Nursing Care Plan Endocarditis
No ratings yet
Nursing Care Plan Endocarditis
2 pages
Social Protection Module 1
No ratings yet
Social Protection Module 1
21 pages
Times NIE Web Ed Oct22 2021 Page1 4
No ratings yet
Times NIE Web Ed Oct22 2021 Page1 4
4 pages
Training Plan Intermediate en
No ratings yet
Training Plan Intermediate en
19 pages
Busitema University Private Admission List For Academic Year 2017/2018
100% (3)
Busitema University Private Admission List For Academic Year 2017/2018
20 pages
Child-Reported Hospital Fears in 4 To 6-Year-Old C
No ratings yet
Child-Reported Hospital Fears in 4 To 6-Year-Old C
11 pages
Obesity's Impact on Male Fertility
No ratings yet
Obesity's Impact on Male Fertility
4 pages
The Impact of Social Media in The Academic Achievement of Grade 10
No ratings yet
The Impact of Social Media in The Academic Achievement of Grade 10
14 pages
Body Movements
No ratings yet
Body Movements
30 pages
Wbjee Jepbn Topics & Pattern
No ratings yet
Wbjee Jepbn Topics & Pattern
2 pages
2022 08 29 - Guidelines For Medical Device Donations
No ratings yet
2022 08 29 - Guidelines For Medical Device Donations
20 pages
Pradeep Millionaire Routine Tracker
No ratings yet
Pradeep Millionaire Routine Tracker
2 pages
How Did The Cabin Crew Handle The Issues in Both Cases 1st Case
No ratings yet
How Did The Cabin Crew Handle The Issues in Both Cases 1st Case
3 pages
Medical Reps: Job Satisfaction Study
No ratings yet
Medical Reps: Job Satisfaction Study
91 pages
SDG Zero Hunger
No ratings yet
SDG Zero Hunger
14 pages
Global Animal Health Governance
No ratings yet
Global Animal Health Governance
1 page
Tissue Healing Timeline
No ratings yet
Tissue Healing Timeline
1 page
Interfere
No ratings yet
Interfere
2 pages
Social Media's Mental Health Impact
No ratings yet
Social Media's Mental Health Impact
2 pages
Handbook of Mental Health and Acculturation in Asian American Families 1st Edition Richard M. Suinn (Auth.)
100% (1)
Handbook of Mental Health and Acculturation in Asian American Families 1st Edition Richard M. Suinn (Auth.)
55 pages
Suicide Risk Assessment Guide
No ratings yet
Suicide Risk Assessment Guide
2 pages
Personal-Development-Module Q1 Week 5-8
100% (1)
Personal-Development-Module Q1 Week 5-8
87 pages
Comparison Between Vacuum and Forceps Extraction To Neonatal Outcome On Prolonged Second Stage of Labor
No ratings yet
Comparison Between Vacuum and Forceps Extraction To Neonatal Outcome On Prolonged Second Stage of Labor
4 pages

Unit 5

Uploaded by

Unit 5

Uploaded by

Define three summarization test : Single,Multiple,Query

​ Single Summarization Test:

Summary Generated by the System:

​ Multiple Summarization Test:

Summary Generated by the System:

​ Query Focused Summarization Test:

Summary Generated by the System:

What are the issues with MAchine Translation with diagram

Alternatives for Machine Translation

| Issues with Machine |

| Linguistic | | Domain Specific |

| Ambiguity | | Rare and Low- |

| Quality vs. Speed| | |

Define Machine Translation Evaluation

Human Judgment-Based Evaluation:

Suppose we have an English sentence: "The cat is sitting on the mat."

And we have two machine translation outputs from different systems:

System A: "The cat is sitting on the carpet."

System B: "A cat sits on the rug."

Now, let's say we have a reference translation by a human translator:

Reference: "The cat is sitting on the mat."

● Unigram precision: 4/7

BLEU score for System B: 0.23

You might also like

Single Summarization Test:

Multiple Summarization Test:

Query Focused Summarization Test: