Lists (18)
Sort Name ascending (A-Z)
Stars
This MCP server provides tools for listing and retrieving content from different knowledge bases.
Build a knowledge base into a tar.gz and give it to this MCP server, and it is ready to serve.
Build your personal knowledge base with Trilium Notes
A Model Context Protocol (MCP) server enabling AI assistants to interact with Outline documentation services.
The fastest knowledge base for growing teams. Beautiful, realtime collaborative, feature packed, and markdown compatible.
[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model
[RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions
[IROS 2025 Award Finalist] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems
The official codebase of paper "GMT: General Motion Tracking for Humanoid Whole-Body Control"
🔮 UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning (NeurIPS 2025)
A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.
A versatile, all-in-one toolbox for whole-body humanoid robot control.
[RSS 2025] "ASAP: Aligning Simulation and Real-World Physics for Learning Agile Humanoid Whole-Body Skills"
Official Implementation of "KungfuBot: Physics-Based Humanoid Whole-Body Control for Learning Highly-Dynamic Skills"
[arXiv 2025] GMR: General Motion Retargeting. Retarget human motions into diverse humanoid robots in real time on CPU. Retargeter for TWIST.
A Paper List for Humanoid Robot Learning.
[IROS 2024] Learning Human-to-Humanoid Real-Time Whole-Body Teleoperation. [CoRL 2024] OmniH2O: Universal and Dexterous Human-to-Humanoid Whole-Body Teleoperation and Learning
Dexbotic: Open-Source Vision-Language-Action Toolbox
Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."
Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input
An open-source, GPU-accelerated physics simulation engine built upon NVIDIA Warp, specifically targeting roboticists and simulation researchers.
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
[NeurIPS 2025] CogVLA: Cognition-Aligned Vision-Language-Action Models via Instruction-Driven Routing & Sparsification
[NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
KV cache compression for high-throughput LLM inference