A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites

784 41 Updated Dec 17, 2025

EmbodiedFoundation / AnyPos

AnyPos: Automated Task-Agnostic Actions for Bimanual Manipulation

Python 31 1 Updated Jul 25, 2025

yaofeng1998 / Vidar

Python 26 2 Updated Aug 27, 2025

tomato1mule / diffusion_edf

[CVPR 2024 Highlight] Diffusion-EDFs: Bi-equivariant Denoising Generative Modeling on SE(3) for Visual Robotic Manipulation

Python 57 4 Updated Apr 5, 2024

hkchengrex / XMem

[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

Python 1,932 206 Updated Nov 15, 2024

NVIDIA / Isaac-GR00T

NVIDIA Isaac GR00T N1.6 - A Foundation Model for Generalist Robots.

Jupyter Notebook 5,657 888 Updated Dec 18, 2025

FlagOpen / RoboBrain2.0

RoboBrain 2.0: Advanced version of RoboBrain. See Better. Think Harder. Do Smarter. 🎉🎉🎉

Python 731 61 Updated Dec 16, 2025

real-stanford / universal_manipulation_interface

Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots

Python 1,165 214 Updated Jul 21, 2025

InternRobotics / PPI

[RSS 2025] Gripper Keypose and Object Pointflow as Interfaces for Bimanual Robotic Manipulation

Python 75 Updated Jul 22, 2025

TEA-Lab / DemoGen

[RSS25] Official implementation of DemoGen: Synthetic Demonstration Generation for Data-Efficient Visuomotor Policy Learning

Python 218 9 Updated Jul 18, 2025

dexwild / dexwild

DexWild: Dexterous Human Interactions for In-the-Wild Robot Policies

Python 35 4 Updated Aug 14, 2025

ShuangLI59 / unified_video_action

Official PyTorch Implementation of Unified Video Action Model (RSS 2025)

Python 309 24 Updated Jul 23, 2025

WangYixuan12 / d3fields

[CoRL 24 Oral] D^3Fields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Rearrangement

Python 178 12 Updated Nov 2, 2024

lyttttt3333 / CodeDiffuser

Jupyter Notebook 36 2 Updated Jun 19, 2025

zai-org / GLM-V

GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 2,062 140 Updated Dec 18, 2025

DepthAnything / Depth-Anything-V2

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 7,262 726 Updated Jan 22, 2025

isl-org / ZoeDepth

Metric depth estimation from a single image

Jupyter Notebook 2,761 269 Updated May 5, 2025

OpenDriveLab / DetAny3D

[ICCV 2025] Detect Anything 3D in the Wild

Python 243 14 Updated Dec 14, 2025

ZheyiZhao LemonWade

Lists (6)

Bench

Java

LLM_work

mypaper

robot-gpt

work

Stars