๐ Developed by XI TJKT 2 | 2024/2025 | โ Any commercial use or unauthorized exploitation is prohibited
Release:
โโโ โโโ โโโโโโโโโ โโโโโโโโ
โโโ โโโ โโโโโโโโโ โโโโโโโโ
โโโ โโโ โโโ โโโโโโโโ
โโโ โโโ โโโ โโโโโโโโ
โโโ โโโโโโโโ โโโ โโโโโโโโ
โโโ โโโโโโโโ โโโ โโโโโโโโ
-------------------------------
ILTE - Indigenous Language Translator Engine
The Indigenous Language Translator Engine (ILTE) now offers four distinct versions, each tailored to different translation needs:
- ๐ฑ ILTE-ALT (Optimized for Speed) โ A lightweight, dictionary-based translator optimized for fast, low-resource translations.
- ๐ง ILTE-ZS (Hybrid, Multi-Processing) โ Combines dictionary-based rules, RBMT, FST, semantic matching, and zero-shot translation while efficiently handling large text files.
- ๐ง ILTE-ADV (AI-Powered, Context-Aware) โ An advanced, AI-driven translation engine that integrates context awareness, semantic similarity, and zero-shot learning.
- ๐ฎ ILTE-ATI (Advanced Attention & Iterative Processing) โ The most sophisticated version with hierarchical normalization, iterative refinement, attention-based translation, and multi-level candidate selection.
- โ Dictionary-Based Lookup for direct translations.
- โ Basic Stemming for Indonesian (ID) & English (EN).
- โ Levenshtein Distance Matching for closest word lookup.
- โ Automated Confidence Scoring for accuracy estimation.
- โ Structured DOCX Report Generation.
- โ Low Memory Usage โ Optimized for lower-end machines.
- โก Dictionary + RBMT + FST + Semantic Matching + Zero-Shot Translation.
- โ๏ธ Handles Large Files Efficiently via chunking & batch multi-processing.
- ๐ ๏ธ Optimized Resource Management, cleans memory and GPU after processing.
- ๐ Auto-Parallelized Translation Pipeline.
- โณ Faster Preprocessing, No Unnecessary Computation.
- ๐ง Contextual Translation using IndoBERT & Sentence Transformers.
- ๐ Zero-Shot Learning for Handling Unknown Words.
- ๐ Pattern-Based Learning & Semantic Matching.
- ๐ ๏ธ Enhanced Translation Confidence Metrics.
- โก Leverages GPU Acceleration for Faster Processing.
- โจ Hierarchical Normalization for Better Preprocessing.
- ๐ Iterative Translation for Context Awareness.
- ๐ Attention-Based Translation for Multi-Level Candidate Generation.
- โ๏ธ Refined Confidence Scoring & Adaptive Refinement.
- โ Full Formatting Preservation in DOCX Reports.
- ๐ Optimized for Dynamic, Multi-Stage Translation Processes.
| Feature | Model Used |
|---|---|
| Translation (ID-EN, EN-ID) | Helsinki-NLP/opus-mt-id-en, Helsinki-NLP/opus-mt-en-id |
| Stemming | Sastrawi (Indonesian), SnowballStemmer (English) |
| Fuzzy Matching | Levenshtein Distance |
| Feature | Model Used |
|---|---|
| Dictionary-Based Lookup | JSON-based dictionary |
| Rule-Based Translation (RBMT, FST) | Custom FST Rules |
| Semantic Similarity | sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 |
| Zero-Shot Translation | facebook/mbart-large-50-many-to-many-mmt |
| Feature | Model Used |
|---|---|
| Contextual Embeddings | cahya/bert-base-indonesian-1.5G |
| Semantic Matching | sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 |
| Zero-Shot Classification | typeform/distilbert-base-uncased-mnli |
| Translation (ID-EN, EN-ID) | Helsinki-NLP/opus-mt-id-en, Helsinki-NLP/opus-mt-en-id |
| Feature | Model Used |
|---|---|
| Hierarchical Normalization | Regex + Dynamic Stemming |
| Contextual Translation | sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 |
| Iterative Processing | Multi-Level Candidate Refinement |
| Translation (ID-DYK, DYK-ID) | Enhanced Dictionary Lookup + Semantic Matching |
| Feature | ILTE-ALT | ILTE-ZS | ILTE-ADV | ILTE-ATI |
|---|---|---|---|---|
| Translation Approach | Dictionary | Hybrid | AI-Based | Attention-Based + Iterative |
| Processing Speed | Fast | Moderate | Slower | Balanced |
| Handling Large Files | Struggles | Efficient Chunking | Slower | Optimized Processing |
| Memory Usage | Low | Moderate | High | Optimized |
| Context Awareness | None | Partial | Strong | ๐ฎ Very Strong |
| Idiomatic Expressions | Limited | Rule-Based | AI-Based | AI + Attention |
| Parallelization | Minimal | Yes | DataLoader | Thread + Process Pool |
| Zero-Shot Capability | No | Yes | Yes | Yes |
| Best Use Case | Fast translation | Large text processing | Context-Aware | High-Accuracy, AI-Powered |
python engine_ALT.pypython engine_ZS.pypython engine_ADV.pypython engine_ATI.pyChoose the version that best suits your needs and contribute to indigenous language preservation. ๐
- โ ALT: For lightweight, dictionary-based translations.
- โ ZS: For handling large files efficiently with hybrid translation techniques.
- โ ADV: For AI-powered, context-aware translations.
- โ ATI For advanced attention towards content and context.