Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View jiangpin-legend's full-sized avatar
🎯
Focusing
🎯
Focusing
  • zhejiang university

Block or report jiangpin-legend

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Dexbotic: Open-Source Vision-Language-Action Toolbox

Python 315 24 Updated Oct 29, 2025

Official Implementation of TIP 2025: "PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting"

C++ 5 Updated Sep 20, 2025

RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.

Python 850 82 Updated Oct 30, 2025

[IROS 2024] Incrementally Building Room-Scale Language-Embedded Gaussian Splats (LEGS) with a Mobile Robot

Jupyter Notebook 39 4 Updated May 7, 2025

The code for PixelRefer & VideoRefer

Jupyter Notebook 284 13 Updated Oct 28, 2025
Python 159 8 Updated Oct 30, 2025

InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy

Python 219 10 Updated Oct 24, 2025

starVLA: A Lego-like Codebase for Vision-Language-Action Model Developing

Python 267 12 Updated Oct 27, 2025

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 38,638 4,647 Updated Aug 19, 2024

[NeurIPS 2025 Spotlight] Official implementation of the SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment

Python 124 5 Updated Sep 25, 2025

🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.

Python 557 31 Updated Jun 23, 2025
Python 516 38 Updated Jun 8, 2025
Python 334 20 Updated Oct 29, 2025

Survey: https://arxiv.org/pdf/2507.20198

186 13 Updated Oct 24, 2025

Official implementation for "JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation"

Python 207 7 Updated Oct 29, 2025
Python 39 3 Updated Oct 17, 2025

大麦自动抢票,支持人员、城市、日期场次、价格选择

Python 5,421 664 Updated Oct 22, 2025

Official implementation of Continuous 3D Perception Model with Persistent State

Python 1,154 61 Updated Aug 27, 2025

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

Python 2,111 118 Updated Oct 24, 2025

VGGT 3D Vision Agent optimized for Apple Silicon with Metal Performance Shaders

Python 74 7 Updated Sep 19, 2025

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工,非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》,并做归纳、编辑与整理。CookLikeHOC.

JavaScript 21,796 2,191 Updated Oct 17, 2025

Octree-GS

Python 178 15 Updated Mar 3, 2025

[TRO 2025] OmniMap: A General Mapping Framework Integrating Optics, Geometry, and Semantics

Python 92 9 Updated Sep 11, 2025

这个文档是使用Habitat-sim的中文教程

Python 65 4 Updated Mar 10, 2023

[SIGGRAPH ASIA 2024] Frankenstein: Generating Semantic-Compositional 3D Scenes in One Tri-Plane

Python 18 3 Updated Nov 25, 2024
Python 32 Updated Jul 8, 2025

Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"

Python 278 15 Updated Sep 28, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,577 2,091 Updated Jul 17, 2025

This repository collects research papers of large Vision Language Models in Autonomous driving and Intelligent Transportation System. The repository will be continuously updated to track the lates…

415 32 Updated Apr 1, 2025
Next