rawsh

Robert Washbourne rawsh

I enjoy linux, programming, and writing for my blog, devpy.me. git.io/startpage

31 followers · 11 following

Achievements

Highlights

Developer Program Member

Organizations

Stars

open-tinker / OpenTinker

OpenTinker is an RL-as-a-Service infrastructure for foundation models

Python 397 27 Updated Dec 27, 2025

collinear-ai / spider

Streamline on-policy/off-policy distillation workflows in a few lines of code

Python 84 4 Updated Dec 27, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes for ML SYS.

Python 4,825 310 Updated Dec 24, 2025

LiveCodeBench / LiveCodeBench

Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"

Python 749 155 Updated Jul 16, 2025

wouterdebie / i2cssh

csshX like ssh tool for iTerm2

Python 564 68 Updated Nov 4, 2025

UW-Madison-Lee-Lab / LLM-judge-reporting

A simple plug-in framework that corrects bias and computes confidence intervals in reporting LLM-as-a-judge evaluation, and an adaptive algorithm that efficiently allocates calibration samples to r…

Jupyter Notebook 60 3 Updated Nov 27, 2025

mixedbread-ai / mgrep

A calm, CLI-native way to semantically grep everything, like code, images, pdfs and more.

TypeScript 2,376 103 Updated Dec 26, 2025

JackHopkins / factorio-learning-environment

A non-saturating, open-ended environment for evaluating LLMs in Factorio

Python 871 59 Updated Dec 24, 2025

waynchi / editbench

Python 18 1 Updated Dec 25, 2025

Zhiyuan-Zeng / RLVE

[Preprint] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Python 162 16 Updated Nov 14, 2025

openai / simple-evals

Python 4,248 461 Updated Jul 31, 2025

abertsch72 / oolong

A challenging aggregation benchmark for long-context models

Python 13 2 Updated Nov 10, 2025

wizard-III / Archer2.0

Archer2.0 evolves from its predecessor by introducing ASPO, which overcomes fundamental PPO-Clip limitations to prevent premature convergence and unlock greater RL potential.

Python 26 2 Updated Oct 10, 2025

laude-institute / harbor

Harbor is a framework for running agent evaluations and creating and using RL environments.

Python 251 173 Updated Dec 27, 2025

python-discord / snekbox

Easy, safe evaluation of arbitrary Python code

Python 269 47 Updated Dec 15, 2025

Intro0siddiqui / Phantom-Fragment

A lightweight ai sandbox environment

Go 32 1 Updated Dec 14, 2025

daytonaio / daytona

Daytona is a Secure and Elastic Infrastructure for Running AI-Generated Code

TypeScript 40,588 3,296 Updated Dec 25, 2025

beam-cloud / beta9

Ultrafast serverless GPU inference, sandboxes, and background jobs

Go 1,518 133 Updated Nov 26, 2025

abshkbh / arrakis

A fully customizable and self-hosted sandboxing solution for AI agent code execution and computer use. It features out-of-the-box support for backtracking, a simple REST API and Python SDK, automat…

Go 729 70 Updated Jun 2, 2025

e2b-dev / code-interpreter

Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app

MDX 2,147 196 Updated Dec 17, 2025

VsonicV / es-fine-tuning-paper

This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"

Python 276 27 Updated Nov 24, 2025

oeis / oeisdata

Content of Online Encyclopedia of Integer Sequences (OEIS)

113 23 Updated Dec 27, 2025

ucker / why-low-precision-training-fails

Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention

Python 33 3 Updated Oct 16, 2025

bigcode-project / bigcode-evaluation-harness

A framework for the evaluation of autoregressive code generation language models.

Python 1,010 253 Updated Jul 22, 2025

SWE-agent / mini-swe-agent

The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!

Python 2,386 306 Updated Dec 23, 2025

Quentin-Anthony / torch-profiling-tutorial

Python 536 32 Updated Aug 6, 2025

meta-pytorch / torchforge

PyTorch-native post-training at scale

Python 577 72 Updated Dec 27, 2025

Ledzy / StreamBP

Official code of "StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs".

Python 74 5 Updated Jun 23, 2025

WooooDyy / BAPO

Codes for the paper "BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping" by Zhiheng Xi et al.

Python 89 5 Updated Oct 25, 2025

ScalingIntelligence / tokasaurus

Python 461 34 Updated Nov 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Robert Washbourne rawsh

Achievements

Achievements

Highlights

Organizations

Block or report rawsh

Stars

open-tinker / OpenTinker

collinear-ai / spider

zhaochenyang20 / Awesome-ML-SYS-Tutorial

LiveCodeBench / LiveCodeBench

wouterdebie / i2cssh

UW-Madison-Lee-Lab / LLM-judge-reporting

mixedbread-ai / mgrep

JackHopkins / factorio-learning-environment

waynchi / editbench

Zhiyuan-Zeng / RLVE

openai / simple-evals

abertsch72 / oolong

wizard-III / Archer2.0

laude-institute / harbor

python-discord / snekbox

Intro0siddiqui / Phantom-Fragment

daytonaio / daytona

beam-cloud / beta9

abshkbh / arrakis

e2b-dev / code-interpreter

VsonicV / es-fine-tuning-paper

oeis / oeisdata

ucker / why-low-precision-training-fails

bigcode-project / bigcode-evaluation-harness

SWE-agent / mini-swe-agent

Quentin-Anthony / torch-profiling-tutorial

meta-pytorch / torchforge

Ledzy / StreamBP

WooooDyy / BAPO

ScalingIntelligence / tokasaurus