Stars
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
simulstream is a Python library for simultaneous/streaming speech recognition and translation. It enables both the simulation with existing files to score systems, like in the SimulEval project, an…
A unified evaluation suite for speech-to-text translation, covering SpeechLLMs, SFMs, and cascaded systems across diverse real-world speech phenomena.
Platform for Evaluating and Reviewing of Multilingual Tasks
Course Materials for Interpretability of Large Language Models (0368.4264) at Tel Aviv University
LanguageTechnologiesUnit' s repository for tokenizer training and evaluation scripts.
[EMNLP2025] Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling
[EMNLP'25] Code for paper "MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning"
Code for "QE4PE: Word-level Quality Estimation for Human Post-Editing" ✍️
A framework for evaluating Machine Translation models.
Everything about federated learning, including research papers, books, codes, tutorials, videos and beyond
Everything you want about DP-Based Federated Learning, including Papers and Code. (Mechanism: Laplace or Gaussian, Dataset: femnist, shakespeare, mnist, cifar-10 and fashion-mnist. )
Training Sparse Autoencoders on Language Models
Code for the paper "Investigating the translation capabilities of Large Language Models trained on parallel data only"
Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
The geometry of multilingual language model representations (EMNLP 2022).
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Implementation of Hinton's forward-forward (FF) algorithm - an alternative to back-propagation
The code for the Gravitational Dimensionality Reduction (GDR) algorithm
Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).
A corpus of the Christmas speeches delivered by the head of state of Spain from 1937 to 2021
Examples of how to create colorful, annotated equations in Latex using Tikz.
Time series forecasting with PyTorch
Multivariate Time Series Forecasting with efficient Transformers. Code for the paper "Long-Range Transformers for Dynamic Spatiotemporal Forecasting."