Lists (27)
Sort Name ascending (A-Z)
3DGS
Agent
AIGC
Animation
Calibration
Concept
DIBR
DigitalHuman
Fusion
GPT
ImageTask2D
Library
LLM
LocoManip
MeshProcess
NERF
ObjectGeneration
Reconstruction
Render
Robot
SceneGen
Survey
Tools
VideoGen
VideoInterpolation
VLA
WorldModel
Starred repositories
Code for kai0, including training, inference and data collection.
[IROS 2025] Generalizable Humanoid Manipulation with 3D Diffusion Policies. Part 1: Train & Deploy of iDP3
[CoRL 2025] TWIST: Teleoperated Whole-Body Imitation System
Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals
Open-sourced code for "HOMIE: Humanoid Loco-Manipulation with Isomorphic Exoskeleton Cockpit".
[L4DC 2026] "FALCON: Learning Force-Adaptive Humanoid Loco-Manipulation"
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
Tensor's VLA Training Infrastructure for Real-World Robotics in PyTorch
[arXiv 2025] TWIST2: Scalable, Portable, and Holistic Humanoid Data Collection System
HoloMotion: A Foundation Model for Whole-Body Humanoid Control
A Paper List for Humanoid Robot Learning.
[ICLR 2026] Towards Unified Latent VLA for Whole-body Loco-manipulation Control
Welcome to GR00T Whole-Body Control (WBC)! This is a unified platform for developing and deploying advanced humanoid controllers. This includes: Decoupled WBC models used in NVIDIA Isaac-Gr00t, Gr0…
Builder and index for PyTorch packages
[ICLR 2026] UniVideo: Unified Understanding, Generation, and Editing for Videos
Spirit-v1.5: A Robotic Foundation Model by Spirit AI
Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)
A unified inference and post-training framework for accelerated video generation.
(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
Official repo for vidar and vidarc: video foundation model for robotics.
Lets make video diffusion practical!
Zotero is a free, easy-to-use tool to help you collect, organize, annotate, cite, and share your research sources.
InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation
DreamGen: Nvidia GEAR Lab's initiative to solve the robotics data problem using world models
📹 A more flexible framework that can generate videos at any resolution and creates videos from images.
Wan: Open and Advanced Large-Scale Video Generative Models
Official code implementation of "Mitty: Diffusion-based Human-to-Robot Video Generation"