Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View lan-lc's full-sized avatar
  • UCLA
  • Los Angeles

Block or report lan-lc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 51,285 8,591 Updated Nov 12, 2025

Exploring Expert Failures Improves LLM Agent Tuning

1 Updated Apr 21, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,867 3,824 Updated Dec 21, 2025

UCLA Thesis LaTeX style

TeX 141 91 Updated Jun 15, 2020

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,321 4,780 Updated Jun 2, 2025
Jupyter Notebook 641 83 Updated Nov 10, 2025

[NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents"

Python 29 Updated Mar 14, 2024

[ICML'2023 Oral] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"

Python 65 3 Updated Oct 21, 2023

[CoRL'23] Parting with Misconceptions about Learning-based Vehicle Motion Planning

Python 679 75 Updated May 16, 2025

Hydra is a framework for elegantly configuring complex applications

Python 10,052 760 Updated Dec 11, 2025
Python 16 2 Updated Aug 2, 2023

The devkit of the nuPlan dataset.

Python 931 198 Updated Aug 27, 2025

Related papers for reinforcement learning, including classic papers and latest papers in top conferences

508 36 Updated Nov 11, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,052 4,669 Updated Dec 19, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 154,115 31,501 Updated Dec 21, 2025

Sample efficiency and generalisation in reinforcement learning using procedural generation.

Python 4 2 Updated Dec 30, 2021

research

Python 3 Updated Mar 7, 2023

PyTorch code to train and evaluate Procgen tasks

Python 25 2 Updated Nov 1, 2020

CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning

Python 239 26 Updated Nov 1, 2022

Robust Reinforcement Learning with the Alternating Training of Learned Adversaries (ATLA) framework

Python 67 12 Updated Jan 26, 2021

Attack AlphaZero Go agents (NeurIPS 2022)

C++ 22 Updated Dec 3, 2022

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, or on-prem).

Python 9,128 879 Updated Dec 21, 2025
C++ 7 3 Updated Sep 25, 2018

[NeurIPS 2020 Spotlight] State-adversarial PPO for robust deep reinforcement learning

Python 31 4 Updated Nov 18, 2021

RLMeta is a light-weight flexible framework for Distributed Reinforcement Learning Research.

Python 283 25 Updated Feb 11, 2023

PECOS - Prediction for Enormous and Correlated Spaces

Python 542 110 Updated Feb 1, 2025