EuLLM — Open-source sovereign LLM platform
Description
EuLLM is an open-source platform for creating, distributing, and running sovereign EU-compliant Large Language Models, designed for verticalization across domains, languages, and brands while ensuring AI Act compliance.
The platform consists of three components: Engine — a Rust-based inference runtime built on top of llama.cpp, exposing OpenAI-compatible and Ollama-compatible APIs on the same port, with TurboQuant KV cache compression for up to 4× context length on consumer GPUs and a continuous batching scheduler for parallel decode of multiple concurrent requests; Forge — a verticalization pipeline written in Python that compresses 14B-30B foundation models down to specialized 7B domain experts via structural pruning, knowledge distillation, quantization, and identity LoRA fine-tuning; Hub — an EU-hosted model registry providing AI Act compliance cards, provenance documentation, and verifiable model lineage tracking, distributed via European cloud infrastructure to ensure data sovereignty.
Use cases include sovereign EU LLM deployment for regulated industries (legal, healthcare, finance), domain-specific verticalized models, and EU AI Act compliant inference infrastructure.
Notes
Files
eullm/eullm-EuLLM-v0.4.4.zip
Files
(1.2 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:518c1a770f8745e42588c17bcb434b27
|
1.2 MB | Preview Download |
Additional details
Related works
- Is documented by
- Other: https://eullm.eu (URL)
- Is supplement to
- Software: https://github.com/eullm/eullm (URL)
Software
- Repository URL
- https://github.com/eullm/eullm