Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View huang2202's full-sized avatar

Block or report huang2202

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

BEHAVIOR-1K: a platform for accelerating Embodied AI research. Join our Discord for support: https://discord.gg/bccR5vGFEx

Python 1,073 120 Updated Oct 31, 2025

robosuite: A Modular Simulation Framework and Benchmark for Robot Learning

Python 1,988 602 Updated Oct 25, 2025
Python 2 4 Updated Apr 23, 2025
Python 551 53 Updated May 23, 2025

[ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion

Python 287 21 Updated Jul 15, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 9,410 730 Updated Sep 22, 2025

MoIIE: Mixture of Intra- and Inter-Modality Experts for Large Vision Language Models

Python 13 Updated Aug 14, 2025

[NeurIPS 2019, Spotlight] Point-Voxel CNN for Efficient 3D Deep Learning

Python 672 132 Updated Apr 12, 2022

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 24,311 3,422 Updated Oct 28, 2025

[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model

Python 584 20 Updated Oct 29, 2024

CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making

Jupyter Notebook 653 63 Updated Apr 20, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 11,482 1,177 Updated Oct 11, 2025

Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"

Python 193 6 Updated May 30, 2025

[CVPR 2024 Oral, Best Paper Runner-Up] Code for "pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction" by David Charatan, Sizhe Lester Li, Andrea Tagliasacch…

Python 1,143 71 Updated Jan 13, 2025

Submanifold sparse convolutional networks

C++ 2,132 335 Updated Jan 9, 2024

[NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning

Python 87 4 Updated Oct 14, 2024

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation

Python 5,258 1,489 Updated Nov 30, 2023
Python 34 3 Updated Jun 17, 2023

[IROS 2025] Generalizable Humanoid Manipulation with 3D Diffusion Policies. Part 1: Train & Deploy of iDP3

Python 435 31 Updated Jun 16, 2025

[RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations

Python 1,095 111 Updated Oct 17, 2025

AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation

29 Updated Jul 19, 2025

Official implementation of the paper "EmbodiedMAE: A Unified 3D Multi-Modal Representation for Robot Manipulation".

Python 7 1 Updated Jul 2, 2025

A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

1,865 78 Updated Oct 31, 2025

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 4,264 509 Updated Mar 23, 2025

RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots

Python 975 112 Updated Sep 13, 2025

A collection of vision-language-action model post-training methods.

108 4 Updated Oct 28, 2025

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 15,459 2,220 Updated Sep 3, 2025

Book_2_《可视之美》 | 鸢尾花书:从加减乘除到机器学习,欢迎批评指正

Jupyter Notebook 3,437 718 Updated Sep 11, 2024

LeRobot sim2real code. Train in fast simulation and deploy visual policies zero shot to the real world

Python 262 29 Updated Sep 9, 2025
Python 46 4 Updated Oct 24, 2025
Next