[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,595 556 Updated Nov 10, 2025

gregor-ge / FOCI-Benchmark

We present **FOCI**, a benchmark for Fine-grained Object ClassIfication for large vision language models (LVLMs).

Jupyter Notebook 18 1 Updated Jun 21, 2024

sihyun-yu / REPA

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,522 78 Updated Mar 16, 2025

cvlab-kaist / VIRAL

Official implementation of "VIRAL: Visual Representation Alignment for MLLMs".

Python 146 8 Updated Sep 21, 2025

zjunlp / Deco

[ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation

Python 131 11 Updated Sep 11, 2025

ECNU-SII / Continual-NExT

Python 224 19 Updated Nov 5, 2025

ECNU-AILab-SII / EmboSceneExplorer

Python 53 3 Updated Aug 4, 2025

albumentations-team / albumentations

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Python 15,266 1,706 Updated Jun 25, 2025

PKU-ICST-MIPL / Finedefics_ICLR2025

Python 83 5 Updated Apr 21, 2025

QwenLM / qwen-code

An open-source AI agent that lives in your terminal.

TypeScript 17,744 1,554 Updated Jan 26, 2026

modelscope / modelscope-classroom

Jupyter Notebook 1,283 156 Updated Jan 4, 2026

Trae1ounG / Awesome-LLM-Adaptive-Thinking

UP-TO-DATE LLM Adaptive thinking paper. 🔥🔥🔥

13 1 Updated Jul 31, 2025

wyczzy / AIGI-Holmes

(ICCV 2025)This repository is the official implementation of AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models

Python 157 5 Updated Jul 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wenbin Liang AllenLiang96

Highlights

Block or report AllenLiang96

Starred repositories

wnma3mz / wechat_articles_spider

jincan333 / LoR-VP

google-research / task_adaptation

Grady10086 / EscherVerse

open-compass / VLMEvalKit

huggingface / trl

Wakals / CoVT

NOVAglow646 / Monet

eric-ai-lab / DMLR

UMass-Embodied-AGI / Mirage

qiqitao77 / Awesome-Comprehensive-Deepfake-Detection

YU-deep / Awesome-Latent-Space

InternLM / CapRL

saccharomycetes / mllms_know

NVlabs / ffhq-dataset

TIGER-AI-Lab / Pixel-Reasoner

FoundationVision / vaex

FoundationVision / VAR