Stars
The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
Collection of reinforcement learning algorithms
Website for Practical Deep Learning for Coders 2022
An autoregressive character-level language model for making more things
verl: Volcano Engine Reinforcement Learning for LLMs
Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples
SGLang is a fast serving framework for large language models and vision language models.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Huly — All-in-One Project Management Platform (alternative to Linear, Jira, Slack, Notion, Motion)
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
magnusja / ppo
Forked from pat-coady/trpoProximal Policy Optimization with TensorFlow and OpenAI Gym
Simple framework for image and video deblurring, implemented by PyTorch
Python Implementations of Monte Carlo Tree Search
A replica of the AlphaZero methodology for deep reinforcement learning in Python
An educational resource to help anyone learn deep reinforcement learning.
Python Implementation of Reinforcement Learning: An Introduction
Train auto_car in CARLA simulator with RL algorithms(SAC).
A high-performance distributed training framework for Reinforcement Learning