Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View Colmar-zlicheng's full-sized avatar
❄️
Focusing
❄️
Focusing

Highlights

  • Pro

Block or report Colmar-zlicheng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Write PyTorch controllers, test them in simulation, and seamlessly transfer to real-time hardware.

Python 77 2 Updated Jul 1, 2021

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Python 1,398 237 Updated Jul 31, 2024

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 18,732 2,872 Updated Oct 27, 2025

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Python 1,418 134 Updated Oct 13, 2025
Jupyter Notebook 429 31 Updated Sep 26, 2024

🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.

Python 550 31 Updated Jun 23, 2025

Paper list in the survey: A Survey on Vision-Language-Action Models: An Action Tokenization Perspective

297 8 Updated Jul 3, 2025

Intel® RealSense™ SDK

C++ 8,249 4,917 Updated Oct 27, 2025

Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success

Python 779 90 Updated Sep 9, 2025

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 907 40 Updated Oct 13, 2025

A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

1,834 75 Updated Oct 27, 2025

Kronos: A Foundation Model for the Language of Financial Markets

Python 8,389 1,742 Updated Oct 26, 2025

Bitcoin Core integration/staging tree

C++ 86,457 38,115 Updated Oct 27, 2025

Let us control diffusion models!

Python 33,214 2,975 Updated Feb 25, 2024

Cosmos-Transfer1 is a world-to-world transfer model designed to bridge the perceptual divide between simulated and real-world environments.

Python 706 96 Updated Oct 27, 2025

Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge) (CoRL 2024)

Jupyter Notebook 804 145 Updated Mar 28, 2025

Benchmarking Knowledge Transfer in Lifelong Robot Learning

Jupyter Notebook 1,027 210 Updated Mar 15, 2025

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 4,221 507 Updated Mar 23, 2025

[TPAMI 2025] Towards Visual Grounding: A Survey

Shell 246 21 Updated Aug 19, 2025

[ICCV 2025] DyWA:Dynamics-adaptive World Action Model for Generalizable Non-prehensile Manipulation

Python 61 2 Updated Sep 23, 2025

Unified framework for robot learning built on NVIDIA Isaac Sim

Python 5,261 2,548 Updated Oct 25, 2025

[IROS 2025 Award Finalist] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 2,550 175 Updated Oct 27, 2025

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,496 109 Updated Oct 27, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,400 6,449 Updated Oct 27, 2025

Official implementation of OpenWBT.

Python 763 83 Updated Jul 30, 2025

[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Python 1,556 72 Updated Oct 23, 2025
Next