Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View Luckydog-lhy's full-sized avatar

Block or report Luckydog-lhy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This project is the official implementation of 'DreamOmni2: Multimodal Instruction-based Editing and Generation''

Python 2,325 197 Updated Oct 20, 2025

Pytorch0.4.1 codes for InsightFace

Jupyter Notebook 1,867 425 Updated Nov 22, 2022

Official Pytorch implementation of 6DRepNet: 6D Rotation representation for unconstrained head pose estimation.

Python 611 85 Updated Jul 2, 2024

Code for Group Critical-token Policy Optimization for Autoregressive Image Generation

Python 57 Updated Oct 4, 2025

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

Jupyter Notebook 965 102 Updated Jan 3, 2024

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,156 1,660 Updated Sep 24, 2025

State-of-the-art 2D and 3D Face Analysis Project

Python 26,895 5,800 Updated Sep 27, 2025

OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation

Python 225 5 Updated Sep 22, 2025

Ultralytics YOLO 🚀

Python 48,039 9,273 Updated Oct 30, 2025

Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Learning.

Python 294 10 Updated Oct 16, 2025

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 4,959 379 Updated Oct 30, 2025

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 32,996 4,120 Updated Aug 6, 2024

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,777 107 Updated Sep 27, 2024

[ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation

Python 177 5 Updated May 21, 2025

[ICCV2025] PyTorch implementation of "Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models"

Python 107 4 Updated Jul 25, 2025

大模型多维度中文对齐评测基准 (ACL 2024)

Python 418 30 Updated Oct 25, 2025

3000000+语义理解与匹配数据集。可用于无监督对比学习、半监督学习等构建中文领域效果最好的预训练模型

Python 305 41 Updated Oct 11, 2022

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 61,401 10,887 Updated Oct 30, 2025

Official Implementation of GENIUS: A Generative Framework for Universal Multimodal Search, CVPR 2025

Python 35 Updated Aug 8, 2025

GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 1,719 101 Updated Oct 28, 2025

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Jupyter Notebook 5,587 524 Updated Aug 29, 2025

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 12,155 1,111 Updated Sep 26, 2025

Official inference repo for FLUX.1 models

Python 24,566 1,803 Updated Jul 31, 2025

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

Python 639 42 Updated Oct 16, 2024

Mobile-Agent: The Powerful GUI Agent Family

Python 6,139 614 Updated Oct 28, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,012 529 Updated Oct 29, 2025

This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]

Python 449 41 Updated Oct 24, 2025

Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving

Python 378 27 Updated Aug 19, 2025

Build resilient language agents as graphs.

Python 20,388 3,594 Updated Oct 29, 2025

The absolute trainer to light up AI agents.

Python 4,015 272 Updated Oct 30, 2025
Next