Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View fafancier's full-sized avatar

Block or report fafancier

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)

Python 132 10 Updated Jul 31, 2024

A unified inference and post-training framework for accelerated video generation.

Python 2,930 236 Updated Jan 10, 2026

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 1,153 66 Updated Aug 7, 2025

Official repo for vidar and vidarc: video foundation model for robotics.

Python 32 Updated Dec 22, 2025

More relighting!

Python 8,339 525 Updated Feb 20, 2025

Lets make video diffusion practical!

Python 16,499 1,615 Updated Oct 16, 2025

Zotero is a free, easy-to-use tool to help you collect, organize, annotate, cite, and share your research sources.

JavaScript 13,172 927 Updated Jan 7, 2026

InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation​

Python 233 9 Updated Jan 9, 2026

Nvidia GEAR Lab's initiative to solve the robotics data problem using world models

Jupyter Notebook 433 42 Updated Oct 24, 2025

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,784 135 Updated Jan 6, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 13,495 1,601 Updated Dec 17, 2025

Official code implementation of "Mitty: Diffusion-based Human-to-Robot Video Generation"

Python 11 1 Updated Dec 21, 2025

A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

2,314 99 Updated Dec 27, 2025

Official code of Motus: A Unified Latent Action World Model

Python 548 9 Updated Jan 5, 2026

Cosmos-Transfer2.5, built on top of Cosmos-Predict2.5, produces high-quality world simulations conditioned on multiple spatial control inputs.

Python 334 48 Updated Jan 6, 2026

Cosmos-Predict2 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world models for downstream applications.

Python 701 95 Updated Oct 29, 2025

Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.

Python 632 57 Updated Jan 5, 2026

Sharp Monocular View Synthesis in Less Than a Second

Python 6,722 443 Updated Dec 19, 2025

A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.

1,676 72 Updated Jan 6, 2026

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 4,958 603 Updated Mar 23, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 28,262 2,840 Updated Apr 30, 2025

Light Image Video Generation Inference Framework

Python 1,748 134 Updated Jan 9, 2026

A curated list of recent diffusion models for video generation, editing, and various other applications.

5,357 331 Updated Dec 15, 2025

Enjoy the magic of Diffusion models!

Python 11,402 1,088 Updated Jan 8, 2026

[ICLR 2025] LAPA: Latent Action Pretraining from Videos

Python 435 30 Updated Jan 22, 2025
Python 5 13 Updated May 27, 2024

Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN

Python 3,546 633 Updated May 15, 2024

NVIDIA Isaac GR00T N1.6 - A Foundation Model for Generalist Robots.

Jupyter Notebook 5,873 926 Updated Dec 18, 2025

GigaBrain-0: A World Model-Powered Vision-Language-Action Model

Python 1,510 117 Updated Nov 28, 2025

LATTICE: Democratize High-Fidelity 3D Generation at Scale

206 2 Updated Dec 4, 2025
Next