Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
View yanx27's full-sized avatar
🤣
I may be slow to respond
🤣
I may be slow to respond

Organizations

@CUHKSZ-TQL

Block or report yanx27

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 1,157 65 Updated Oct 13, 2025

Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence

Python 1,320 90 Updated Jan 31, 2025

Drive-Pi0 and DriveMoE on End-to-end Autonomous Driving

Python 136 17 Updated Dec 14, 2025

[NeurIPS 2025] AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning

351 9 Updated Dec 2, 2025

[ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling

Python 554 6 Updated Oct 26, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,081 1,281 Updated Oct 11, 2025

(ICCV2025) End-to-End Driving with Online Trajectory Evaluation via BEV World Model

Python 180 16 Updated Jun 29, 2025

[NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling

Python 4,144 323 Updated Sep 26, 2025

official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*

Python 57 1 Updated Jan 10, 2025

Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving (AAAI-25)

Python 84 10 Updated Feb 12, 2025

Code for "DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT"

Python 229 22 Updated Jan 15, 2025

This is a collective repository for all 3DGS related progresses in research and industry world

692 32 Updated Jan 19, 2025

A suite of image and video neural tokenizers

Jupyter Notebook 1,696 85 Updated Feb 11, 2025

Large Driving Models

265 11 Updated Jan 27, 2025

[NeurIPS 2024] SCube: Instant Large-Scale Scene Reconstruction using VoxSplats

Python 512 23 Updated Oct 14, 2025

[CVPR 2025] UniScene: Unified Occupancy-centric Driving Scene Generation

Python 537 28 Updated Oct 30, 2025

Code release for https://kovenyu.com/WonderWorld/

Python 695 34 Updated Apr 14, 2025
Python 133 4 Updated Mar 25, 2025
Python 103 4 Updated Nov 21, 2024

[ECCV2024 Oral🔥] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"

Python 358 15 Updated Jan 14, 2025

OPUS: Occupancy Prediction Using a Sparse Set

Python 130 7 Updated Dec 9, 2025
73 3 Updated Aug 17, 2025

[CVPR 2024] PixelLM is an effective and efficient LMM for pixel-level reasoning and understanding.

Python 246 9 Updated Feb 11, 2025

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 12,103 1,071 Updated Oct 29, 2025

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,835 81 Updated Dec 26, 2025

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,351 2,138 Updated Dec 18, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,163 2,298 Updated Dec 25, 2024

[ECCV 2024] Embodied Understanding of Driving Scenarios

Python 208 14 Updated Jul 2, 2025

mllm-npu: training multimodal large language models on Ascend NPUs

Python 95 2 Updated Aug 29, 2024
Next