Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View choidaedae's full-sized avatar

Block or report choidaedae

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method (CVPR-25)

Jupyter Notebook 225 13 Updated Aug 20, 2025
Python 119 18 Updated Jul 9, 2024

Implementation of VLM4VLA

Python 69 2 Updated Jan 18, 2026

Official implementation of paper Neural Green’s Functions (NeurIPS 2025)

Python 4 Updated Jan 15, 2026

Implementation for VPBench proposed in paper Visually Prompted Benchmarks Are Surprisingly Fragile

Python 7 Updated Jan 14, 2026

[RA-L 2025] FrontierNet: Learning Visual Cues to Explore

Python 138 9 Updated Jan 15, 2026

The official repository of BEAR: Benchmarking and Enhancing Multimodal Language Models with Atomic Embodied Capabilities

26 1 Updated Oct 26, 2025

Public release for "Explore until Confident: Efficient Exploration for Embodied Question Answering"

Python 72 8 Updated Jul 5, 2024

Vision-and-Language Navigation in Continuous Environments using Habitat

Python 691 76 Updated Jan 7, 2025

Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"

Python 376 25 Updated Nov 2, 2025

A simulation platform for versatile Embodied AI research and developments.

Python 1,172 68 Updated Sep 4, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 21,007 3,521 Updated Jan 16, 2026
Python 127 6 Updated Nov 19, 2025

Thinking in 360°: Humanoid Visual Search in the Wild

Python 108 Updated Dec 5, 2025

Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success

Python 976 114 Updated Sep 9, 2025

Training Visual Reasoners with Multimodal Verifiers

Jupyter Notebook 9 Updated Dec 13, 2025

The Best Agent Harness. Meet Sisyphus: The Batteries-Included Agent that codes like you.

TypeScript 19,330 1,347 Updated Jan 19, 2026

[NeurIPS 2025] EOC-Bench, an innovative benchmark designed to systematically evaluate object-centric embodied cognition in dynamic egocentric scenarios.

Python 22 1 Updated Jun 17, 2025
Python 3 Updated Dec 17, 2025

[ECCV 2024 Oral 🔥] Arc2Face: A Foundation Model for ID-Consistent Human Faces ------------------------ [ICCVW 2025] ID-Consistent, Precise Expression Generation with Blendshape-Guided Diffusion

Python 778 55 Updated Oct 10, 2025

Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection

Jupyter Notebook 19 Updated Jan 5, 2026

VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction

Python 326 24 Updated Sep 1, 2025

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

TypeScript 169,847 53,773 Updated Jan 18, 2026

Official implementation of "Repurposing Video Diffusion Transformers for Robust Point Tracking"

Python 31 4 Updated Dec 24, 2025

Dexterous World Models

68 Updated Dec 22, 2025

Code for "EgoX: Egocentric Video Generation from a Single Exocentric Video"

Python 478 25 Updated Jan 15, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 66,038 8,025 Updated Jan 17, 2026

Unofficial reimplementation of VLA-0 using TRL's SFTTrainer.

Python 57 5 Updated Jan 15, 2026

Native and Compact Structured Latents for 3D Generation

Python 2,999 267 Updated Jan 10, 2026
Next