Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View geyan21's full-sized avatar

Highlights

  • Pro

Block or report geyan21

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

MLLM

14 repositories

✨✨Latest Advances on Multimodal Large Language Models

17,036 1,095 Updated Dec 22, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,199 2,686 Updated Aug 12, 2024

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

2,079 129 Updated Oct 27, 2025

[ECCV2024] Official code implementation of Merlin: Empowering Multimodal LLMs with Foresight Minds

Python 96 2 Updated Jul 4, 2024

Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".

Python 53 9 Updated Apr 11, 2024

Official implementation of ICCV 2023 paper "3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment"

Python 214 13 Updated Sep 7, 2023

[ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts.

Python 61 1 Updated Oct 1, 2024

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 1,167 74 Updated Oct 21, 2024

[ACL 2024] GroundingGPT: Language-Enhanced Multi-modal Grounding Model

Python 341 18 Updated Nov 4, 2024

JARVIS-1: Open-world Multi-task Agents with Memory-Augmented Multimodal Language Models

Java 387 20 Updated Apr 8, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,975 134 Updated Nov 7, 2025

Universal memory layer for AI Agents

Python 44,558 4,842 Updated Dec 17, 2025