-
University of Tübingen
- Tübingen
- https://sebastiandziadzio.com
- @sbdzdz
- in/sebastiandziadzio
Stars
Our library for RL environments + evals
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Repository for the paper: "TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining" ACL Oral 2025
[ACL'25] The official code for "ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities"
The simplest, fastest repository for training/finetuning small-sized VLMs.
An extremely fast Python type checker and language server, written in Rust.
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
This repository serves as a collection of research notes and resources on training large language models (LLMs) and Reinforcement Learning from Human Feedback (RLHF). It focuses on the latest resea…
2026 AI/ML internship & new graduate job list updated daily
An extremely fast Python package and project manager, written in Rust.
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
🎉 PILOT: A Pre-trained Model-Based Continual Learning Toolbox
llama3 implementation one matrix multiplication at a time
Astro template to help you build an interactive project page for your research paper
Building blocks for foundation models.
A repository of links with advice related to grad school applications, research, phd etc
Ongoing research training transformer models at scale
Tools for merging pretrained large language models.
DSPy: The framework for programming—not prompting—language models
A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP
PyCIL: A Python Toolbox for Class-Incremental Learning
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Official PyTorch implementation for "Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few Labels"
Extended LaTeX template for CVPR/ICCV papers
Utility code from STAI (https://scalabletrustworthyai.github.io/)