0% found this document useful (0 votes)

9 views44 pages

Translation Theory

Uploaded by

kimmdinhh164

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views44 pages

Translation Theory

Uploaded by

kimmdinhh164

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 44

Translation Theory

MACHINE
TRANSLATION
CONCLUSION
The bilinguals Bilingualism
showed superior resulted in greater
performance on "mental flexibility"
certain cognitive and "abstract
tasks. thought".
TYPES OF
LEARNING SITUATIONS
Sequential
Learning

Simutaneous
Learning
Sequential Bilingualism
This occurs when a person
learns a second language
after they have already
begun to acquire their first
language.
Example
Chinese child learning English
after acquiring Chinese
at home.
Four typical stages
use home language even when others
don’t understand.
be silent, then use gesture.
understand parts of second language,
produce abbreviated utterances.
produce grammatical utterances in
appropriate situations
Simultaneous Learning
Learning two languages at the same
time, usually from birth
Ex: A child who grows up with a
mother speaking English, a father
speaking Spanish
The Transfer Effect:
Language Relationships
Facilitation Across
Language
Different
Similarity Languages
INTRODUCTION
The presentation on Machine
Translation (MT) begins by defining the
core concept: the use of computer
software to translate text or speech from
one language to another automatically.
MT is a sub-field of computational
linguistics that has evolved significantly
over the decades.
THE HISTORY OF MT
Before the 1940s: Some pioneering studies explored translation
automation.
1940s – mid-1960s: With the advent of the first computers,
several teams built operational MT systems. Approaches were
sometimes naïve but laid the groundwork for later
developments.
1990s: The emergence of Statistical Machine Translation (SMT)
based on large bilingual corpora, pioneered by IBM researchers
in the late 1980s. SMT became the basis of major systems like
Google Translate and Bing Translator.
Mid-2010s onward: Neural Machine Translation (NMT) using
deep learning revolutionized the field. Rising demand for online
translation brought MT back to the forefront of computational
linguistics.
EARLY DEVELOPMENTS IN MT
(STATISTICAL MODELS)
Before SMT: Rule-based MT (RBMT) – handcrafted grammar
rules, limited scalability.

Emergence of SMT (late 1980s–1990s):

Based on probability theory and bilingual corpora.
Uses Bayes’ Theorem
The Rise
The Rise of
of Neural
Neural Machine
Machine
Translation (NMT)
Translation (NMT)
Limitations of SMT: Improvements:
Breaking sentences into small More fluent translations with
chunks loses long-distance fewer grammatical errors.
semantic relationships. Better semantic preservation in
Difficulty handling rare words long sentences.
and low-resource languages. Stronger ability to capture
Often produces translations that context and nuance compared
sound unnatural or lack fluency to SMT.
Case Study – Manning & Schütze’s
Foundational Work (1999)

Foundations of Statistical Natural

Language Processing (Manning &
Schütze, 1999) – While not solely
about MT, it is a foundational resource
for statistical NLP, with SMT as a key
component.
Neural vs. Statistical Machine
Translation
- Overview of statistical machine translation
(SMT)
- Neural networks in translation (Neubig’s
contributions)
- Case studies comparing SMT and NMT
Overview of SMT

Statistical Machine Translation

(SMT) is an approach where The fundamental SMT formula,
translations are generated based based on Bayes' theorem
on statistical models derived from
bilingual text corpora

Translation Language
Model Model
That means
Word-based models (e.g., IBM Models
1–5)
Phrase-based models (more flexible,
captures short sequences instead of
single words)
Syntax-based models (incorporates
grammatical structures)
Neural Networks in Translation
(Neubig’s Contributions)
a paradigm shift by using deep learning to directly model the
translation process as a single, end-to-end neural network.

Encoder–Decoder Architecture: Attention Mechanism:

Source sentences are encoded into Improves over fixed-length
continuous vector representations; encoding by letting the model
decoders generate target focus on relevant source words
sentences from these vectors. during translation.
Case studies
1 Translation Fluency
comparing SMT
and NMT 2 Accuracy

3 Data Requirements

4 Generalization
SMT NMT

Translation Fluency

Accuracy

Data Requirements

Generalization
Source sentence: “The weather is nice today, let’s go for a walk.”
SMT Output: “Thời tiết thì tốt hôm nay, hãy đi cho một đi bộ.”
→ Rigid, word-for-word, unnatural Vietnamese.
NMT Output: “Hôm nay thời tiết đẹp, mình đi dạo nhé.”
→ Fluent, natural, and feels human-like.
Chapter 3 Evaluation and Challenges of MT

1/ We need fast, consistent ways to check MT quality.

2/ Human evaluation is accurate but slow & costly.

3/ Automatic metrics help developers improve systems daily.

Metrics for evaluating
MT

BLEU METEOR
BLEU
(Bilingual Evaluation Understudy)
1/ Measures: closeness to human translations.
2/ Method: Modified n-gram precision × Brevity Penalty.
3/ Strength: Simple, language-independent.
4/ Weakness: Focuses on surface match.
BLEU
Examples
Ref: “It is a guide to action that ensures the military will heed Party
commands.”
Cand 1: “It is a guide to action which ensures that the military obeys
the party.” ✅ (high BLEU)
Cand 2: “It is to insure the troops forever hearing the activity
guidebook that party direct.” ❌ (low BLEU)
BLEU
Examples
Ref: “It is a guide to action that ensures the military will heed Party
commands.”
Cand 1: “It is a guide to action which ensures that the military obeys
the party.” ✅ (high BLEU)
Cand 2: “It is to insure the troops forever hearing the activity
guidebook that party direct.” ❌ (low BLEU)
METEOR
(Metric for Evaluation of Translation with Explicit Ordering)

1/ Measures: word matching with semantic awareness.

2/ Matches: exact, stems, synonyms, paraphrases.
3/ Uses: precision + recall + penalty for disordered matches.
4/ Strength: Captures meaning better than BLEU.
5/ Weakness: Slower, needs language tools.
METEOR
(Examples)

Ref: “He purchased a car.”

Cand: “He bought an automobile.”
BLEU: low score (different words)
METEOR: high score (matches “purchased” ↔ “bought”,
“car” ↔ “automobile”
II . Challenges in MT

Cultural
Idiom Context
nuances
Cultural nuances
Example: Vietnamese “ăn Tết”
→ Correct: celebrate Lunar New Year / Wrong literal: eat Tet
BLEU METEOR

- Scores low for “eat Tet” because - Scores low for “eat Tet”
n-grams differ from reference.
because no synonym/semantic
- Cannot recognize cultural
adaptation beyond exact word
match with “celebrate Lunar
overlap. New Year”.
Idiom
Example: Kick the bucket
→ Meaning: die / Literal: kick a bucket
BLEU METEOR

- Literal “kick a bucket” may - Literal “kick a bucket” scores low if

reference is “die” because no
still score high if reference
synonym link.
also contains “kick” and
- If synonym for “die” is in database,
“bucket”, even if meaning is METEOR can reward correct figurative
wrong. meaning.
Context
Example: “He sat on the bank of the river”
→ Meaning: river shore / Wrong: financial bank
BLEU METEOR

- Better than BLEU if synonyms

- Wrong “financial bank”
like “shore” and “riverbank” are
translation scores low recognized.
BLEU METEOR
Measures surface Adds synonym, stemming,

word overlap and paraphrase matching

→ better for meaning, but
→ quick but blind to
still not perfect for culture,
meaning errors.
idioms, or deep context.
Chapter 4 Ethics and Future Trends in
Machine Translation

Ethical concerns in
MT The future of MT

Impact of MT on the translation profession

Ethical concerns in MT
Popovic & Seligman's perspectives

1/ Bias & Translation Shifts

MT may preserve or amplify unnatural translation shifts
that harm target language quality.
Reducing all shifts to match evaluation metrics can make
translations less natural.
Ethical concerns in MT
Popovic & Seligman's perspectives

2/ Transparency
MT systems work like “black boxes” → hard to explain why
errors occur.
3/ Evaluation Reliability
Using only one human reference in evaluation can skew
results due to translator-specific style.
The future of MT

1/ Human-in-the-loop
Example: In legal translation, MT drafts a contract → lawyer-
translator checks terminology to ensure it matches jurisdiction
requirements.
2/ Interactive MT
Example: Translating a live news broadcast — MT suggests terms
instantly, translator edits for accuracy before airing.
The future of MT
3/ Domain Adaptation
Example: MT adapted for finance → understands “bond” in
context as a debt instrument, not a glue or attachment.
4/ Post-editing as a profession
Example: E-commerce platforms use MT for product listings →
post-editors refine descriptions to avoid mistranslation that could
mislead buyers.
Impact of MT on the
translation profession
Shift in skills
Job market changes
Increased productivity
Pressure on time & cost
..........
1/ The Growing Role of MT in Translation
MT is now a core tool in many translation workflows, from daily
communication to large-scale localization projects.
Advances in neural MT have significantly improved fluency and
accuracy.
MT enables faster turnaround and higher content volume,
especially for global businesses.
2/ Future Prospects & The Need for Human Involvement
MT will continue to evolve with domain adaptation, real-time
feedback, and semantic-aware evaluation.
The most effective model: Human + Machine collaboration — MT
handles speed and scale, humans ensure accuracy, nuance, and
ethical integrity.
Discussion
1. What are contributions of MT?
2. Identify main features of NMT?
CONTRIBUTIONS TO MT

Detailed explanation of probabilistic principles in

language processing.
Development of bilingual translation models and
language models.
Techniques for word alignment – the backbone of SMT.
Provided formulas, examples, and SMT error analyses,
laying the groundwork for later research (e.g., Neubig,
2017 on NMT).
@reallygreatsite
Key Features:
End-to-end modeling: The entire translation process is
learned directly from bilingual data without breaking it into
separate modules.
Encoder–Decoder architecture:
Encoder converts the source sentence into a semantic
vector.
Decoder generates the target sentence from this
vector.
Attention mechanism (Bahdanau et al., 2015): Allows the
model to “focus” on relevant words/phrases during
translation.

GMP Training for Medical Devices
67% (3)
GMP Training for Medical Devices
110 pages
CS Unplugged-How Is It Used, and Does It Work?: Abstract. Computer Science Unplugged Has Been Used For Many Years
No ratings yet
CS Unplugged-How Is It Used, and Does It Work?: Abstract. Computer Science Unplugged Has Been Used For Many Years
25 pages
Apple iPhone 6S Plus Invoice Receipt
No ratings yet
Apple iPhone 6S Plus Invoice Receipt
5 pages
2013 SNUG SV Synthesizable SystemVerilog Paper
No ratings yet
2013 SNUG SV Synthesizable SystemVerilog Paper
45 pages
NMT vs SMT: Trends and Future Insights
No ratings yet
NMT vs SMT: Trends and Future Insights
23 pages
NLP Unit 1
100% (1)
NLP Unit 1
34 pages
Review Article: Example-Based Machine Translation
No ratings yet
Review Article: Example-Based Machine Translation
46 pages
(Slide) Neural Machine Translation
No ratings yet
(Slide) Neural Machine Translation
37 pages
Music Notation Shortcuts Guide
No ratings yet
Music Notation Shortcuts Guide
7 pages
Artificial Intelligence Questions
No ratings yet
Artificial Intelligence Questions
15 pages
Neural Machine Translation Advised by Statistical Machine Translation
No ratings yet
Neural Machine Translation Advised by Statistical Machine Translation
7 pages
Deep Learning in Machine Translation
No ratings yet
Deep Learning in Machine Translation
9 pages
Machine Translation Systems For Indian Languages: Review of Modelling Techniques, Challenges, Open Issues and Future Research Directions
No ratings yet
Machine Translation Systems For Indian Languages: Review of Modelling Techniques, Challenges, Open Issues and Future Research Directions
29 pages
Smart Care
No ratings yet
Smart Care
47 pages
An Introduction To Machine Translation: Andy Way, DCU
No ratings yet
An Introduction To Machine Translation: Andy Way, DCU
23 pages
A Gentle Introduction To Neural Machine Translation
No ratings yet
A Gentle Introduction To Neural Machine Translation
14 pages
Android App Development Exercises
No ratings yet
Android App Development Exercises
89 pages
FN Paper 2
No ratings yet
FN Paper 2
13 pages
Machine Translation Insights
No ratings yet
Machine Translation Insights
71 pages
Interpr&TranslTrain 14 (2018) 4 Moorkens, What To Expect From Neural Machine Translation. A Practical In-Class Translation Evaluation Exercise
No ratings yet
Interpr&TranslTrain 14 (2018) 4 Moorkens, What To Expect From Neural Machine Translation. A Practical In-Class Translation Evaluation Exercise
14 pages
Unit 5
No ratings yet
Unit 5
42 pages
Multilingual NMT Challenges
No ratings yet
Multilingual NMT Challenges
27 pages
Challenges in NMT - 2004.05809
No ratings yet
Challenges in NMT - 2004.05809
22 pages
Challenges in NMT - 1706.03872
No ratings yet
Challenges in NMT - 1706.03872
12 pages
Machine Translation Systems and Quality Assessment A Systematic Review
No ratings yet
Machine Translation Systems and Quality Assessment A Systematic Review
27 pages
Neural Machine Translation A Review of Methods Resources and - 2020 - AI Ope
No ratings yet
Neural Machine Translation A Review of Methods Resources and - 2020 - AI Ope
17 pages
Snapdragon 616 Processor Product Brief
No ratings yet
Snapdragon 616 Processor Product Brief
2 pages
Machine Translation Overview
No ratings yet
Machine Translation Overview
30 pages
Math Analysis for Business Students
No ratings yet
Math Analysis for Business Students
54 pages
Machine Translation
No ratings yet
Machine Translation
5 pages
Linux Chrome Shortcut Guide
No ratings yet
Linux Chrome Shortcut Guide
2 pages
Leeds 2006
No ratings yet
Leeds 2006
34 pages
Example Network Diagram: Msa Bts1 Bsc1 Msc/Vlr1 Air Interface/Lapdm Abis Interface/Lapd A Interface Map - E Interface
No ratings yet
Example Network Diagram: Msa Bts1 Bsc1 Msc/Vlr1 Air Interface/Lapdm Abis Interface/Lapd A Interface Map - E Interface
40 pages
ECEN3250 Lab 7: Design of Common-Source MOS Amplifiers Prelab Assignment
No ratings yet
ECEN3250 Lab 7: Design of Common-Source MOS Amplifiers Prelab Assignment
14 pages
Comparative Study of Machine Translation Techniques
No ratings yet
Comparative Study of Machine Translation Techniques
16 pages
13 Machine Translation
No ratings yet
13 Machine Translation
22 pages
Pothole Detection via Lightweight Networks
No ratings yet
Pothole Detection via Lightweight Networks
90 pages
Neural Translation Breakthroughs
No ratings yet
Neural Translation Breakthroughs
9 pages
MSil 3.2 Post Editing 2
No ratings yet
MSil 3.2 Post Editing 2
18 pages
Power Supply Unit Ps-203-60A: Unicont SPB LTD
No ratings yet
Power Supply Unit Ps-203-60A: Unicont SPB LTD
7 pages
Computer 1
No ratings yet
Computer 1
8 pages
Weak-Measurement Elements of Reality: Lev Vaidman
No ratings yet
Weak-Measurement Elements of Reality: Lev Vaidman
11 pages
Natural Language Processing For Language Translation
No ratings yet
Natural Language Processing For Language Translation
23 pages
Machine Translation in CAT Tools
No ratings yet
Machine Translation in CAT Tools
14 pages
ChatGPTvs - GoogleTranslate HiT-IT-2023-proceedings
No ratings yet
ChatGPTvs - GoogleTranslate HiT-IT-2023-proceedings
12 pages
Prasana Kumar.S: Educational Qualification
No ratings yet
Prasana Kumar.S: Educational Qualification
2 pages
DC Charging TCP/IP (Optional) Micro Usb (Optional) USB Link: Realtime T502
No ratings yet
DC Charging TCP/IP (Optional) Micro Usb (Optional) USB Link: Realtime T502
1 page
KHDA - Staff Approval Application - V1.22-June 2022
No ratings yet
KHDA - Staff Approval Application - V1.22-June 2022
4 pages
iDS-7200HQHI-M2/S SERIES Turbo Acusense DVR: Key Feature
No ratings yet
iDS-7200HQHI-M2/S SERIES Turbo Acusense DVR: Key Feature
4 pages
Final - Emt 11 - 12 Q2 0802 PS
No ratings yet
Final - Emt 11 - 12 Q2 0802 PS
53 pages
Machine Translation Approaches Issues An
No ratings yet
Machine Translation Approaches Issues An
7 pages
Machine Translation Thesis PDF
100% (3)
Machine Translation Thesis PDF
8 pages
Native Otp Authentication With Netscaler
No ratings yet
Native Otp Authentication With Netscaler
14 pages
Electronics 14 00243
No ratings yet
Electronics 14 00243
30 pages
Block Retráctil
No ratings yet
Block Retráctil
1 page
Indian Language Machine Translation Review
No ratings yet
Indian Language Machine Translation Review
29 pages
Duplichecker Plagiarism Report
No ratings yet
Duplichecker Plagiarism Report
4 pages
7 Ways To Optimize Jenkins
No ratings yet
7 Ways To Optimize Jenkins
15 pages
Is Neural Machine Translation The New State of The Art?
No ratings yet
Is Neural Machine Translation The New State of The Art?
12 pages
Machine Translation
No ratings yet
Machine Translation
13 pages
Philipp Koehn: Neural Machine Translation
No ratings yet
Philipp Koehn: Neural Machine Translation
11 pages
RCSHPPR 22
No ratings yet
RCSHPPR 22
5 pages
S1-K12 Laser Service Manual
No ratings yet
S1-K12 Laser Service Manual
10 pages
TTFifth Lecture
No ratings yet
TTFifth Lecture
8 pages
Other Skills in Resume
100% (1)
Other Skills in Resume
8 pages
Machine Translation
No ratings yet
Machine Translation
58 pages
Module 5
No ratings yet
Module 5
17 pages
Machine Translation Final Draft
No ratings yet
Machine Translation Final Draft
27 pages
Disco Externo Iomega Datasheet
No ratings yet
Disco Externo Iomega Datasheet
2 pages
An Introduction To Machine Translation (MT)
No ratings yet
An Introduction To Machine Translation (MT)
2 pages
Trustpoint - One Machine Translation
No ratings yet
Trustpoint - One Machine Translation
20 pages
ASWIN TS Unit 3 NLP Translations Gen AI
No ratings yet
ASWIN TS Unit 3 NLP Translations Gen AI
5 pages
Neural and Statistical Machine Translation: Confronting The State of The Art
No ratings yet
Neural and Statistical Machine Translation: Confronting The State of The Art
13 pages
Paper Review
No ratings yet
Paper Review
41 pages
Neural and Statistical Machine Translation: Confronting The State of The Art
No ratings yet
Neural and Statistical Machine Translation: Confronting The State of The Art
13 pages
Natural Language Processing Unit 5
No ratings yet
Natural Language Processing Unit 5
23 pages
Tanujasynopsis
No ratings yet
Tanujasynopsis
8 pages
2018 - Generating Noun Declension-Case Markers For English To Indian Languages in Declension Rule Based MT Systems
No ratings yet
2018 - Generating Noun Declension-Case Markers For English To Indian Languages in Declension Rule Based MT Systems
7 pages
Hazardous in Underground Mines
No ratings yet
Hazardous in Underground Mines
26 pages
Natural Language Processing For Multilingual Translation Systems
No ratings yet
Natural Language Processing For Multilingual Translation Systems
8 pages
Divai2020 Benkova
No ratings yet
Divai2020 Benkova
11 pages
Expository Essay Naoko
No ratings yet
Expository Essay Naoko
6 pages
Module 5 NLP
No ratings yet
Module 5 NLP
6 pages
Luận Văn Integrated Linguistic to Statistical Machine Translation Tích Hợp Thông Tin Ngôn Ngữ Vào Dịch Máy Tính Thống Kê
No ratings yet
Luận Văn Integrated Linguistic to Statistical Machine Translation Tích Hợp Thông Tin Ngôn Ngữ Vào Dịch Máy Tính Thống Kê
16 pages
Window 7 Pro +
No ratings yet
Window 7 Pro +
15 pages
Machine Translation and Natural Language
No ratings yet
Machine Translation and Natural Language
5 pages
TTT Class 1
No ratings yet
TTT Class 1
15 pages

Translation Theory

Uploaded by

Translation Theory

Uploaded by

Translation Theory

Emergence of SMT (late 1980s–1990s):

Foundations of Statistical Natural

Statistical Machine Translation

Encoder–Decoder Architecture: Attention Mechanism:

1/ We need fast, consistent ways to check MT quality.

2/ Human evaluation is accurate but slow & costly.

3/ Automatic metrics help developers improve systems daily.

1/ Measures: word matching with semantic awareness.

Ref: “He purchased a car.”

- Literal “kick a bucket” may - Literal “kick a bucket” scores low if

- Better than BLEU if synonyms

word overlap and paraphrase matching

Impact of MT on the translation profession

1/ Bias & Translation Shifts

Detailed explanation of probabilistic principles in

You might also like