tokenbender

tokenbender tokenbender

pretrain/RL/distributed training • eXperiments labs

218 followers · 0 following

language modelling specialisation
https://tokenbender.com/
@tokenbender

Achievements

x2 x2

Achievements

x2 x2

Starred repositories

furlat / Abstractions

A Collection of Pydantic Models to Abstract IRL

Python 35 2 Updated Dec 10, 2025

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,750 271 Updated Jul 18, 2025

saran-gangster / avatarl-lightning

lightning implementation of avatarl

Python 6 Updated Aug 29, 2025

NVIDIA / accelerated-computing-hub

NVIDIA curated collection of educational resources related to general purpose GPU programming.

Jupyter Notebook 1,036 188 Updated Dec 12, 2025

SWE-agent / mini-swe-agent

The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!

Python 2,403 309 Updated Dec 29, 2025

Royaltyprogram / Crux1

The State Of The Art, intelligence

Python 157 25 Updated Aug 12, 2025

VKCOM / YouTokenToMe

Unsupervised text tokenizer focused on computational efficiency

C++ 975 109 Updated Mar 29, 2024

SakanaAI / treequest

A Tree Search Library with Flexible API for LLM Inference-Time Scaling

Python 509 65 Updated Dec 9, 2025

anhvth / opensloth

Python 242 36 Updated Sep 30, 2025

ScalingIntelligence / tokasaurus

Python 461 34 Updated Nov 25, 2025

mdy666 / mdy_triton

Jupyter Notebook 150 13 Updated Jul 4, 2025

ChinmayK0607 / komorebi

LM/VLM implementations

Python 9 2 Updated Oct 25, 2025

simplescaling / s1

s1: Simple test-time scaling

Python 6,623 764 Updated Jun 25, 2025

sail-sg / oat

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

Python 605 55 Updated Dec 23, 2025

Scale3-Labs / dspy-examples

A collection of example AI programs built using DSPy and maitained by the Langtrace AI team.

Python 48 6 Updated Nov 20, 2024

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 36,407 4,303 Updated Dec 31, 2025

MLSysOps / MLE-agent

🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide better code/research plans 🧰 OpenAI, Anthropic, Gemini, Ollam…

Python 1,481 97 Updated Jul 27, 2025

deepseek-ai / DeepSeek-VL2

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 5,169 1,807 Updated Feb 26, 2025

microsoft / markitdown

Python tool for converting files and office documents to Markdown.

Python 84,829 4,884 Updated Dec 1, 2025

facebookresearch / blt

Code for BLT research paper

Python 2,018 189 Updated Nov 3, 2025

davanstrien / awesome-synthetic-datasets

awesome synthetic (text) datasets

Jupyter Notebook 315 15 Updated Nov 19, 2025

wasiahmad / Awesome-LLM-Synthetic-Data

A reading list on LLM based Synthetic Data Generation 🔥

1,498 90 Updated Jun 5, 2025

datadreamer-dev / DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤

Python 1,086 55 Updated Feb 2, 2025

richards199999 / Thinking-Claude

Let your Claude able to think

TypeScript 16,642 1,964 Updated Nov 4, 2025

zhangfaen / finetune-Qwen2-VL

Python 384 44 Updated Feb 8, 2025

JH-LEE-KR / l2p-pytorch

PyTorch Implementation of Learning to Prompt (L2P) for Continual Learning @ CVPR22

Python 201 27 Updated Oct 14, 2023

Christina200 / Online-LoRA-official

[WACV 2025] Official implementation of "Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation" by Xiwen Wei, Guihong Li and Radu Marculescu

Python 53 2 Updated Aug 26, 2025

openai / swarm

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 20,751 2,220 Updated Mar 11, 2025

JoshuaPurtell / SmallBench

Small, simple agent task environments for training and evaluation

Python 19 Updated Nov 1, 2024

THUDM / LongCite

LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA

Python 516 32 Updated Dec 31, 2024

Natural language processing

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tokenbender tokenbender

Achievements

Achievements

Block or report tokenbender

Starred repositories

furlat / Abstractions

facebookresearch / lingua

saran-gangster / avatarl-lightning

NVIDIA / accelerated-computing-hub

SWE-agent / mini-swe-agent

Royaltyprogram / Crux1

VKCOM / YouTokenToMe

SakanaAI / treequest

anhvth / opensloth

ScalingIntelligence / tokasaurus

mdy666 / mdy_triton

ChinmayK0607 / komorebi

simplescaling / s1

sail-sg / oat

Scale3-Labs / dspy-examples

jingyaogong / minimind

MLSysOps / MLE-agent

deepseek-ai / DeepSeek-VL2

microsoft / markitdown

facebookresearch / blt

davanstrien / awesome-synthetic-datasets

wasiahmad / Awesome-LLM-Synthetic-Data

datadreamer-dev / DataDreamer

richards199999 / Thinking-Claude

zhangfaen / finetune-Qwen2-VL

JH-LEE-KR / l2p-pytorch

Christina200 / Online-LoRA-official

openai / swarm

JoshuaPurtell / SmallBench

THUDM / LongCite

Starred topics

Natural language processing