This repository implements a Best-of-N (BoN) strategy for inference-aware fine-tuning of large language models. The system supports multiple leading LLM providers and includes comprehensive testing…

Python 3 Updated Dec 26, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 28,452 2,875 Updated Apr 30, 2025

huggingface / finetrainers

Scalable and memory-optimized training of diffusion models

Python 1,327 142 Updated Jun 4, 2025

Robot-VLAs / RoboVLMs

Python 425 21 Updated Nov 29, 2025

hilookas / SimplerEnv

Forked from simpler-env/SimplerEnv

Evaluating FSD on SimplerEnv

Jupyter Notebook 8 Updated Jul 10, 2025

DelinQu / SimplerEnv-OpenVLA

Forked from simpler-env/SimplerEnv

Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo, and OpenVLA) in simulation under common setups (e.g., Google Robot, WidowX+Bridge)

Jupyter Notebook 261 43 Updated Jun 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fangqi-Zhu

Block or report fangqi-Zhu

Stars

19935541831 / qwen3vl_robot_annotator

starVLA / starVLA

WM-PO / WMPO

bytedance / IRASim

ByteDance-Seed / Bagel

PRIME-RL / SimpleVLA-RL

MaxSobolMark / PolicyAgnosticRL

EvolvingLMMs-Lab / lmms-eval

showlab / Awesome-Robotics-Diffusion

thuml / iVideoGPT

CleanDiffuserTeam / CleanDiffuser

allenzren / open-pi-zero

ai-in-pm / Inference-SFT-and-RL

hpcaitech / Open-Sora

huggingface / finetrainers

Robot-VLAs / RoboVLMs

hilookas / SimplerEnv

DelinQu / SimplerEnv-OpenVLA

Tencent-Hunyuan / HunyuanVideo

Dawn-LX / CausalCache-VDM

xuyang-liu16 / Awesome-Generation-Acceleration

prathebaselva / FORA

etched-ai / open-oasis

RoboTwin-Platform / RoboTwin

Tiiny-AI / PowerInfer

1x-technologies / 1xgpt

robocasa / robocasa

NevSNev / FloED-main

NevSNev / UniST

dexsuite / dex-urdf