Luckydog-lhy

lhy Luckydog-lhy

6 followers · 0 following

Achievements

Stars

dvlab-research / DreamOmni2

This project is the official implementation of 'DreamOmni2: Multimodal Instruction-based Editing and Generation''

Python 2,325 197 Updated Oct 20, 2025

TreB1eN / InsightFace_Pytorch

Pytorch0.4.1 codes for InsightFace

Jupyter Notebook 1,867 425 Updated Nov 22, 2022

thohemp / 6DRepNet

Official Pytorch implementation of 6DRepNet: 6D Rotation representation for unconstrained head pose estimation.

Python 611 85 Updated Jul 2, 2024

zghhui / GCPO

Code for Group Critical-token Policy Optimization for Autoregressive Image Generation

Python 57 Updated Oct 4, 2025

kakaobrain / rq-vae-transformer

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

Jupyter Notebook 965 102 Updated Jan 3, 2024

OpenBMB / MiniCPM-V

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,156 1,660 Updated Sep 24, 2025

deepinsight / insightface

State-of-the-art 2D and 3D Face Analysis Project

Python 26,895 5,800 Updated Sep 27, 2025

onecat-ai / OneCAT

OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation

Python 225 5 Updated Sep 22, 2025

ultralytics / ultralytics

Ultralytics YOLO 🚀

Python 48,039 9,273 Updated Oct 30, 2025

HorizonWind2004 / reconstruction-alignment

Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Learning.

Python 294 10 Updated Oct 16, 2025

InternLM / xtuner

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 4,959 379 Updated Oct 30, 2025

xinntao / Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 32,996 4,120 Updated Aug 6, 2024

LTH14 / mar

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,777 107 Updated Sep 27, 2024

wusize / Harmon

[ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation

Python 177 5 Updated May 21, 2025

nonwhy / PURE

[ICCV2025] PyTorch implementation of "Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models"

Python 107 4 Updated Jul 25, 2025