Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View geyan21's full-sized avatar

Highlights

  • Pro

Block or report geyan21

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR'25] DepthSplat: Connecting Gaussian Splatting and Depth

Python 1,094 57 Updated Apr 27, 2025

Galaxea's first VLA release

Python 324 18 Updated Oct 23, 2025

Official code of RDT 2

Python 605 29 Updated Dec 3, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,318 1,449 Updated Nov 28, 2025

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 1,129 62 Updated Oct 13, 2025

[CoRL 2025] ManiFlow: A General Robot Manipulation Policy via Consistency Flow Training

Python 142 12 Updated Nov 13, 2025

Code for RSS 2025 paper "Can We Detect Failures Without Failure Data? Uncertainty-Aware Runtime Failure Detection for Imitation Learning Policies"

Python 24 5 Updated Jun 18, 2025

Visual Imitation Enables Contextual Humanoid Control. CoRL 2025, Best Student Paper Award.

Python 646 45 Updated Nov 25, 2025

Cameras as Relative Positional Encoding

Python 633 11 Updated Dec 18, 2025

[CoRL 2025] RISE-2: A Generalizable Imitation Learning Policy

Python 55 Updated Nov 29, 2025

[NeurIPS 2025, Spotlight] Rectified Point Flow: Generic Point Cloud Pose Estimation

Python 160 11 Updated Dec 2, 2025

🦾 A Dual-System VLA with System2 Thinking

Python 123 1 Updated Aug 21, 2025

[CVPR 2025] UniK3D: Universal Camera Monocular 3D Estimation

Python 649 55 Updated Sep 14, 2025

Universal Monocular Metric Depth Estimation

Python 1,098 102 Updated May 18, 2025
Python 165 11 Updated Nov 27, 2025

[NeurIPS 2025] Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"

Python 213 6 Updated Dec 16, 2025

Nvidia GEAR Lab's initiative to solve the robotics data problem using world models

Jupyter Notebook 421 41 Updated Oct 24, 2025

NVIDIA Isaac GR00T N1.6 - A Foundation Model for Generalist Robots.

Jupyter Notebook 5,661 889 Updated Dec 18, 2025

A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

2,203 95 Updated Dec 17, 2025
Jupyter Notebook 86 4 Updated Sep 23, 2025

DexUMI: Using Human Hand as the Universal Manipulation Interface for Dexterous Manipulation

C 153 14 Updated Oct 2, 2025

Pytorch Implementation (unofficial) of the paper "Mean Flows for One-step Generative Modeling" by Geng et al.

Python 983 58 Updated Dec 17, 2025

Attention mappers and visualisation for transformer-based Physical AI policies

Python 139 17 Updated Nov 17, 2025

[ICLR 25] Code for "Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning"

C++ 110 16 Updated May 16, 2025

[ICRA 25] FLaRe: Achieving Masterful and Adaptive Robot Policies with Large-Scale Reinforcement Learning Fine-Tuning

Python 42 2 Updated Jan 5, 2025
Python 43 7 Updated Apr 2, 2025

✈️ [ICCV 2025] Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints

Python 78 1 Updated Jul 10, 2025
Next