Thanks to visit codestin.com
Credit goes to Github.com

yanx27

Follow

🤣

I may be slow to respond

Benny yanx27

🤣

I may be slow to respond

Follow

CUHK-SZ, Huawei Noah's Ark Lab. Focus: 3D CV and AD.

542 followers · 31 following

The Chinises University of Hong Kong, Shenzhen
China, Shenzhen
https://yanx27.github.io/

Achievements

Achievements

Organizations

Lists (4)

Sort

Point Cloud Analysis

Semantic Segmentation

12 repositories

Tracking and Detection

Vision and Language

Stars

PRIME-RL / SimpleVLA-RL

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 1,157 65 Updated Oct 13, 2025

allenzren / open-pi-zero

Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence

Python 1,320 90 Updated Jan 31, 2025

Thinklab-SJTU / DriveMoE

Drive-Pi0 and DriveMoE on End-to-end Autonomous Driving

Python 136 17 Updated Dec 14, 2025

Physical-Intelligence / openpi

Python 9,555 1,280 Updated Dec 27, 2025

ucla-mobility / AutoVLA

[NeurIPS 2025] AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning

351 9 Updated Dec 2, 2025

InternRobotics / Aether

[ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling

Python 554 6 Updated Oct 26, 2025

facebookresearch / vggt

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,081 1,281 Updated Oct 11, 2025

liyingyanUCAS / WoTE

(ICCV2025) End-to-End Driving with Online Trajectory Evaluation via BEV World Model

Python 180 16 Updated Jun 29, 2025

manycore-research / SpatialLM

[NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling

Python 4,144 323 Updated Sep 26, 2025

gusongen / DOME

official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*

Python 57 1 Updated Jan 10, 2025

yuyang-cloud / Drive-OccWorld

Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving (AAAI-25)

Python 84 10 Updated Feb 12, 2025

YvanYin / DrivingWorld

Code for "DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT"

Python 229 22 Updated Jan 15, 2025

yangjiheng / 3DGS_and_Beyond_Docs

This is a collective repository for all 3DGS related progresses in research and industry world

692 32 Updated Jan 19, 2025

NVIDIA / Cosmos-Tokenizer

A suite of image and video neural tokenizers

Jupyter Notebook 1,696 85 Updated Feb 11, 2025

wzzheng / LDM

Large Driving Models

265 11 Updated Jan 27, 2025

nv-tlabs / SCube

[NeurIPS 2024] SCube: Instant Large-Scale Scene Reconstruction using VoxSplats

Python 512 23 Updated Oct 14, 2025

Arlo0o / UniScene-Unified-Occupancy-centric-Driving-Scene-Generation

[CVPR 2025] UniScene: Unified Occupancy-centric Driving Scene Generation

Python 537 28 Updated Oct 30, 2025

KovenYu / WonderWorld

Code release for https://kovenyu.com/WonderWorld/

Python 695 34 Updated Apr 14, 2025

EnVision-Research / OmniBooth

Python 133 4 Updated Mar 25, 2025

EnVision-Research / SyntheOcc

Python 103 4 Updated Nov 21, 2024

Haiyang-W / GiT

[ECCV2024 Oral🔥] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"

Python 358 15 Updated Jan 14, 2025

jbwang1997 / OPUS

OPUS: Occupancy Prediction Using a Sparse Set

Python 130 7 Updated Dec 9, 2025

EMZucas / minidrive

73 3 Updated Aug 17, 2025

MaverickRen / PixelLM

[CVPR 2024] PixelLM is an effective and efficient LMM for pixel-level reasoning and understanding.

Python 246 9 Updated Feb 11, 2025

PKU-YuanGroup / Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 12,103 1,071 Updated Oct 29, 2025

showlab / Show-o

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,835 81 Updated Dec 26, 2025

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,351 2,138 Updated Dec 18, 2025

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,163 2,298 Updated Dec 25, 2024

OpenDriveLab / ELM

[ECCV 2024] Embodied Understanding of Driving Scenarios

Python 208 14 Updated Jul 2, 2025

TencentARC / mllm-npu

mllm-npu: training multimodal large language models on Ascend NPUs

Python 95 2 Updated Aug 29, 2024