thomlake

thom lake thomlake

29 followers · 1 following

Indeed
Austin, TX, USA
http://thomlake.github.io/

Achievements

Highlights

Stars

ChenmienTan / RL2

Python 962 101 Updated Dec 21, 2025

MIT-MI / MEM1

Python 202 15 Updated Oct 27, 2025

policy-gradient / GRPO-Zero

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,716 81 Updated Apr 18, 2025

sail-sg / oat

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

Python 583 50 Updated Oct 31, 2025

mingyin0312 / RLFromScratch

Python 465 37 Updated Aug 28, 2025

aypan17 / machiavelli

Python 143 33 Updated Jul 23, 2025

TsinghuaC3I / MARTI

A Framework for LLM-based Multi-Agent Reinforced Training and Inference

Python 377 42 Updated Nov 20, 2025

lsdefine / lsrl

Low ReSource Reinforcement Learning with CPU Offloading Training Support

Python 78 7 Updated Dec 10, 2025

microsoft / agent-lightning

The absolute trainer to light up AI agents.

Python 9,769 790 Updated Dec 21, 2025

PrimeIntellect-ai / verifiers

Our library for RL environments + evals

Python 3,654 453 Updated Dec 21, 2025

langfengQ / verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,297 116 Updated Dec 11, 2025

TIGER-AI-Lab / verl-tool

A version of verl to support diverse tool use

Python 768 63 Updated Dec 10, 2025

OpenPipe / ART

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 8,051 643 Updated Dec 19, 2025

meta-pytorch / monarch

PyTorch Single Controller

Rust 929 120 Updated Dec 20, 2025

cpldcpu / MisguidedAttention

A collection of prompts to challenge the reasoning abilities of large language models in presence of misguiding information

Python 452 26 Updated Jul 31, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 2,084 119 Updated Jun 2, 2025

axolotl-ai-cloud / axolotl-cookbook

Python 36 7 Updated Aug 1, 2025

ContextualAI / HALOs

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 896 50 Updated Sep 30, 2025

mangiucugna / json_repair

A python module to repair invalid JSON from LLMs

Python 4,195 161 Updated Dec 17, 2025

AlexWan0 / rag-convincingness

Python 27 3 Updated Feb 26, 2024

Avaiga / taipy

Turns Data and AI algorithms into production-ready web applications in no time.

Python 18,967 1,961 Updated Dec 2, 2025

yuchenlin / rebiber

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

Python 2,957 164 Updated Jul 9, 2025

laactechnology / foxcross

AsyncIO serving for data science models

Python 24 Updated Dec 8, 2022

PAIR-code / lit

The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.

TypeScript 3,618 370 Updated Dec 5, 2025

facebookresearch / hydra

Hydra is a framework for elegantly configuring complex applications

Python 10,051 760 Updated Dec 11, 2025

github / CodeSearchNet

Datasets, tools, and benchmarks for representation learning of code.

Jupyter Notebook 2,400 410 Updated Jan 31, 2022

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 154,114 31,504 Updated Dec 20, 2025

asgordon / TriangleCOPA

One hundred challenge problems for logical formalizations of commonsense psychology

27 1 Updated Oct 9, 2025

explosion / thinc

🔮 A refreshing functional take on deep learning, compatible with your favorite libraries

Python 2,884 288 Updated Dec 12, 2025

googlecreativelab / quickdraw-dataset

Documentation on how to access and use the Quick, Draw! Dataset.

6,613 1,024 Updated Mar 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

thom lake thomlake

Achievements

Achievements

Highlights

Block or report thomlake

Stars

ChenmienTan / RL2

MIT-MI / MEM1

policy-gradient / GRPO-Zero

sail-sg / oat

mingyin0312 / RLFromScratch

aypan17 / machiavelli

TsinghuaC3I / MARTI

lsdefine / lsrl

microsoft / agent-lightning

PrimeIntellect-ai / verifiers

langfengQ / verl-agent

TIGER-AI-Lab / verl-tool

OpenPipe / ART

meta-pytorch / monarch

cpldcpu / MisguidedAttention

Open-Reasoner-Zero / Open-Reasoner-Zero

axolotl-ai-cloud / axolotl-cookbook

ContextualAI / HALOs

mangiucugna / json_repair

AlexWan0 / rag-convincingness

Avaiga / taipy

yuchenlin / rebiber

laactechnology / foxcross

PAIR-code / lit

facebookresearch / hydra

github / CodeSearchNet

huggingface / transformers

asgordon / TriangleCOPA

explosion / thinc

googlecreativelab / quickdraw-dataset