Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
View AllenLiang96's full-sized avatar
  • East China Normal University
  • Shanghai
  • 18:14 (UTC +08:00)

Highlights

  • Pro

Block or report AllenLiang96

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

微信公众号文章的爬虫

Python 3,338 755 Updated Apr 18, 2024

LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation (ICLR 2025)

Python 37 3 Updated Feb 5, 2025
Python 7 Updated Jan 9, 2026

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,747 617 Updated Jan 15, 2026

Train transformer language models with reinforcement learning.

Python 17,143 2,449 Updated Jan 26, 2026

Official repo of "Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens"

Python 273 16 Updated Jan 6, 2026

Official codes of "Monet: Reasoning in Latent Visual Space Beyond Image and Language"

Python 120 2 Updated Dec 30, 2025

Official codebase for the paper "Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space"

Python 54 1 Updated Dec 17, 2025

Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens (arXiv 2025)

Python 234 16 Updated Aug 2, 2025

A paper list of Awesome Latent Space.

305 9 Updated Jan 21, 2026

An official implementation of "CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning"

Python 178 6 Updated Dec 26, 2025

[ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'

Python 333 16 Updated Apr 20, 2025

Flickr-Faces-HQ Dataset (FFHQ)

Python 4,094 605 Updated Nov 18, 2022

Pixel-Level Reasoning Model trained with RL [NeuIPS25]

Python 268 10 Updated Nov 6, 2025

🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook

Python 105 8 Updated Jun 23, 2024

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,595 556 Updated Nov 10, 2025

We present **FOCI**, a benchmark for Fine-grained Object ClassIfication for large vision language models (LVLMs).

Jupyter Notebook 18 1 Updated Jun 21, 2024

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,522 78 Updated Mar 16, 2025

Official implementation of "VIRAL: Visual Representation Alignment for MLLMs".

Python 146 8 Updated Sep 21, 2025

[ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation

Python 131 11 Updated Sep 11, 2025
Python 224 19 Updated Nov 5, 2025

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Python 15,266 1,706 Updated Jun 25, 2025

An open-source AI agent that lives in your terminal.

TypeScript 17,744 1,554 Updated Jan 26, 2026
Jupyter Notebook 1,283 156 Updated Jan 4, 2026

UP-TO-DATE LLM Adaptive thinking paper. 🔥🔥🔥

13 1 Updated Jul 31, 2025

(ICCV 2025)This repository is the official implementation of AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models

Python 157 5 Updated Jul 22, 2025
Next