fafancier

fafancier

5 followers · 8 following

Lists (25)

Sort

Starred repositories

schmidtdominik / LAPO

Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)

Python 132 10 Updated Jul 31, 2024

hao-ai-lab / FastVideo

A unified inference and post-training framework for accelerated video generation.

Python 2,930 236 Updated Jan 10, 2026

tianweiy / CausVid

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 1,153 66 Updated Aug 7, 2025

thu-ml / vidar

Official repo for vidar and vidarc: video foundation model for robotics.

Python 32 Updated Dec 22, 2025

lllyasviel / IC-Light

More relighting!

Python 8,339 525 Updated Feb 20, 2025

lllyasviel / FramePack

Lets make video diffusion practical!

Python 16,499 1,615 Updated Oct 16, 2025

zotero / zotero

Zotero is a free, easy-to-use tool to help you collect, organize, annotate, cite, and share your research sources.

JavaScript 13,172 927 Updated Jan 7, 2026

InternRobotics / InternVLA-A1

InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation

Python 233 9 Updated Jan 9, 2026

NVIDIA / GR00T-Dreams

Nvidia GEAR Lab's initiative to solve the robotics data problem using world models

Jupyter Notebook 433 42 Updated Oct 24, 2025

aigc-apps / VideoX-Fun

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,784 135 Updated Jan 6, 2026

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 13,495 1,601 Updated Dec 17, 2025

showlab / Mitty

Official code implementation of "Mitty: Diffusion-based Human-to-Robot Video Generation"

Python 11 1 Updated Dec 21, 2025

jonyzhang2023 / awesome-embodied-vla-va-vln

A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

2,314 99 Updated Dec 27, 2025

thu-ml / Motus

Official code of Motus: A Unified Latent Action World Model

Python 548 9 Updated Jan 5, 2026

nvidia-cosmos / cosmos-transfer2.5

Cosmos-Transfer2.5, built on top of Cosmos-Predict2.5, produces high-quality world simulations conditioned on multiple spatial control inputs.

Python 334 48 Updated Jan 6, 2026

nvidia-cosmos / cosmos-predict2

Cosmos-Predict2 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world models for downstream applications.

Python 701 95 Updated Oct 29, 2025

nvidia-cosmos / cosmos-predict2.5

Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.

Python 632 57 Updated Jan 5, 2026

apple / ml-sharp

Sharp Monocular View Synthesis in Less Than a Second

Python 6,722 443 Updated Dec 19, 2025

knightnemo / Awesome-World-Models

A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.

1,676 72 Updated Jan 6, 2026

openvla / openvla

Forked from TRI-ML/prismatic-vlms

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 4,958 603 Updated Mar 23, 2025

fafancier

Lists (25)

3DGS

AIGC

Animation

Calibration

Concept

DIBR

DigitalHuman

Fusion

GPT

ImageTask2D

Library

LLM

MeshProcess

NERF

ObjectGeneration

Reconstruction

Render

Robot

SceneGen

Survey

Tools

VideoGen

VideoInterpolation

VLA

WorldModel

Starred repositories

3d-generation

bundle-adjustment

stereo-matching