Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View zhengyuan-xie's full-sized avatar
😅
😅

Block or report zhengyuan-xie

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is the official code (based on Pytorch framework) for the paper "Open-Det: An Efficient Learning Framework for Open-Ended Detection".

4 Updated May 29, 2025

[NeurIPS 2025] the official project page of a paper, "PLANA3R: Zero-shot Metric Planar 3D Reconstruction via Feed-Forward Planar Splatting"

Python 41 2 Updated Oct 24, 2025
208 1 Updated Oct 15, 2025

starVLA: A Lego-like Codebase for Vision-Language-Action Model Developing

Python 337 14 Updated Oct 31, 2025

DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping

Python 407 26 Updated Aug 10, 2025

Official implementation of Spatial-Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model

Python 94 2 Updated Oct 30, 2025

EgoDex: Learning Dexterous Manipulation from Large-Scale Egocentric Video

Python 54 2 Updated Aug 20, 2025

DemoGrasp: Universal Dexterous Grasping from a Single Demonstration

25 Updated Sep 30, 2025
Python 218 3 Updated May 12, 2025

Omniverse Kit App Template

Python 728 234 Updated Oct 6, 2025

MomentSeg: Moment-Centric Sampling for Enhanced Video Pixel Understanding

18 Updated Oct 13, 2025

Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos

Python 176 7 Updated Sep 4, 2025

Official repository for VCoT-Grasp.

Python 10 Updated Oct 13, 2025

[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion

Python 3,265 601 Updated Dec 24, 2024

Democratizing AI scientists with ToolUniverse

Python 614 92 Updated Nov 1, 2025

Official code of RDT 2

Python 552 22 Updated Oct 11, 2025

Offical implementation of "Visual Instruction Pretraining for Domain-Specific Foundation Models"

Python 80 1 Updated Oct 21, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 16,636 1,265 Updated Oct 29, 2025

InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation​

Python 51 Updated Sep 18, 2025

Fully Open Framework for Democratized Multimodal Training

Python 595 40 Updated Oct 21, 2025

WiLoR: End-to-end 3D hand localization and reconstruction in-the-wild

Python 390 30 Updated Aug 1, 2025

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 937 43 Updated Oct 13, 2025

Building General-Purpose Robots Based on Embodied Foundation Model

Python 570 38 Updated Oct 31, 2025

[NeurIPS 2025] CogVLA: Cognition-Aligned Vision-Language-Action Models via Instruction-Driven Routing & Sparsification

Python 108 6 Updated Oct 12, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,994 2,402 Updated Nov 1, 2025

[TPAMI2025] Improving Generalized Visual Grounding with Instance-aware Joint Learning

Python 18 1 Updated Oct 29, 2025

The code implementation for the paper "DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation".

25 Updated Sep 1, 2025

A Comprehensive Survey on Continual Learning in Generative Models.

81 6 Updated Aug 12, 2025
Next