willccbb

will brown willccbb

838 followers · 3 following

Achievements

x2 x2 x3

Achievements

x2 x2 x3

Organizations

Lists (1)

Sort

Gyms

Stars

run-house / kubetorch

Distribute and run AI workloads magically in Python, like PyTorch for ML infra.

Python 1,062 42 Updated Oct 26, 2025

alexzhang13 / rlm

Super basic implementation (gist-like) of RLMs with REPL environments.

Python 210 26 Updated Oct 17, 2025

firstbatchxyz / mem-agent

Memory Agent monorepo

Python 65 6 Updated Oct 9, 2025

meta-pytorch / torchforge

PyTorch-native post-training at scale

Python 418 40 Updated Oct 26, 2025

thinking-machines-lab / tinker-cookbook

Post-training with Tinker

Python 1,123 79 Updated Oct 27, 2025

thinking-machines-lab / tinker

Training API

Python 177 13 Updated Oct 17, 2025

openai / frontier-evals

OpenAI Frontier Evals

Python 924 107 Updated Oct 21, 2025

firecrawl / firecrawl

🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data

TypeScript 64,912 5,131 Updated Oct 26, 2025

Keen-Technologies / physical_atari

Platform for evaluating reinforcement learning (RL) algorithms on a physical Atari system.

Python 125 2 Updated Aug 28, 2025

arcee-ai / NeMo-RL

Scalable toolkit for efficient model reinforcement

Python 10 1 Updated Oct 27, 2025

thinking-machines-lab / batch_invariant_ops

Python 854 60 Updated Oct 14, 2025

wootzapp / wootz-browser

A mobile browser & a first-of-its-kind app store. battery optimised background agents, optimized extensions, zk private identity, private ads. Optimized for RL

C++ 100 50 Updated Oct 5, 2025

run-llama / semtools

Semantic search and document parsing tools for the command line

Rust 1,262 99 Updated Oct 3, 2025

arcprize / ARC-AGI-3-Agents

Python 92 42 Updated Sep 30, 2025

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,101 148 Updated Oct 27, 2025

PrimeIntellect-ai / pccl

PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP

C++ 133 8 Updated Sep 12, 2025

microsoft / agent-lightning

The absolute trainer to light up AI agents.

Python 2,391 187 Updated Oct 27, 2025

quotient-ai / judges

A small library of LLM judges

Python 295 30 Updated Jul 31, 2025

jacobphillips99 / open-rubric

Forked from PrimeIntellect-ai/verifiers

Fork of verifiers focused on multi-step rubric evaluation complete with multi-step environments and synthetic data generators

Python 4 Updated Aug 18, 2025

facebookresearch / moodist

moodist

C 22 5 Updated Oct 1, 2025

voice-from-the-outer-world / lisan-bench

LisanBench is a lightweight benchmark for LLMs that stresses forward planning, vocabulary depth, constraint adherence, attention, and long-context "stamina" all at once.

Python 11 2 Updated Jun 1, 2025