Thanks to visit codestin.com
Credit goes to Github.com

sjtuytc

Follow

🎯

Focusing

Zelin Zhao sjtuytc

🎯

Focusing

Follow

Researcher

240 followers · 79 following

Achievements

Achievements

Highlights

Pro

Lists (1)

Sort

🔮 Future ideas

Starred repositories

bytedance / tarsier

Tarsier -- a family of large-scale video-language models, which is designed to generate high-quality video descriptions , together with good capability of general video understanding.

Python 520 28 Updated Aug 14, 2025

NVlabs / vibetensor

Our first fully AI generated deep learning system

Python 531 38 Updated Feb 2, 2026

nvidia-cosmos / cosmos-reason1

Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long chain-of-thought reasoning processes.

Python 909 82 Updated Jan 6, 2026

ZGCTroy / CamI2V

official repo of paper for "CamI2V: Camera-Controlled Image-to-Video Diffusion Model"

Python 164 10 Updated Sep 29, 2025

bowang-lab / MedRAX

MedRAX: Medical Reasoning Agent for Chest X-ray - ICML 2025

Python 1,099 194 Updated Oct 31, 2025

Tencent-Hunyuan / HunyuanWorld-Voyager

Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.

Python 1,509 154 Updated Dec 17, 2025

CadQuery / cadquery

A python parametric CAD scripting framework based on OCCT

Python 4,507 422 Updated Feb 12, 2026

knightnemo / Awesome-World-Models

A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.

1,961 82 Updated Feb 12, 2026

sjtuytc / CETCam_project_page.github.io

project page of CETCam

CSS 1 Updated Nov 26, 2025

InternRobotics / InternVLA-M1

InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy

Python 367 19 Updated Feb 11, 2026

ShuhongLL / SGS-SLAM

[ECCV 2024] SGS-SLAM: Semantic Gaussian Splatting For Neural Dense SLAM

Jupyter Notebook 487 46 Updated Nov 20, 2025

wren93 / tuna

88 3 Updated Dec 12, 2025

open-gigaai / giga-world-0

GigaWorld-0: World Models as Data Engine to Empower Embodied AI

Python 1,471 121 Updated Dec 3, 2025

openvla / openvla

Forked from TRI-ML/prismatic-vlms

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 5,260 634 Updated Mar 23, 2025

baaivision / UniVLA

[ICLR 2026] Unified Vision-Language-Action Model

Python 275 20 Updated Oct 15, 2025

NVlabs / stylegan3

Official PyTorch implementation of StyleGAN3

Python 6,890 1,235 Updated Sep 12, 2023

QUVA-Lab / escnn

Equivariant Steerable CNNs Library for Pytorch https://quva-lab.github.io/escnn/

Python 501 61 Updated Oct 31, 2024

WEIRDLabUW / unified-world-model

Unfied World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets

Python 186 11 Updated Oct 8, 2025

google-deepmind / alphageometry

Python 4,772 566 Updated Jan 13, 2026

PointsCoder / OpenReal2Sim

A toolbox for real-to-sim reconstruction and robotic simulation

Python 191 15 Updated Feb 13, 2026

Tencent-Hunyuan / HunyuanWorld-Mirror

Fast and Universal 3D reconstruction model for versatile tasks

Python 999 90 Updated Feb 6, 2026

alex4727 / MotionStream

MotionStream: Real-Time Video Generation with Interactive Motion Controls

500 17 Updated Feb 6, 2026

guandeh17 / Self-Forcing

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 3,135 240 Updated Sep 12, 2025

Physical-Intelligence / openpi

Python 10,220 1,466 Updated Dec 27, 2025

showlab / Show-1

[IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Python 1,134 58 Updated Sep 13, 2025

snap-research / ac3d

AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers

Python 153 11 Updated Sep 16, 2025

mayuelala / Awesome-Controllable-Video-Generation

[ArXiv 2025] A survey about controllable video generation: This repo is the official awesome of "Controllable video generation: A survey"

673 39 Updated Nov 11, 2025

showlab / MotionDirector

[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

Python 1,038 60 Updated Aug 21, 2024

Lifelong-Robot-Learning / LIBERO

Benchmarking Knowledge Transfer in Lifelong Robot Learning

Jupyter Notebook 1,483 306 Updated Mar 15, 2025

huggingface / lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 21,606 3,767 Updated Feb 12, 2026

Starred topics

Awesome Lists