Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
View dawei03896's full-sized avatar

Block or report dawei03896

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official Repository of VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents

Jupyter Notebook 52 2 Updated Jan 26, 2026

This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).

1,192 71 Updated Jan 26, 2026

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,579 458 Updated Oct 27, 2025

Millions-Level Face/Human-Scene Image-Text Datasets

24 Updated Jun 9, 2025

[IEEE T-BIOM] FaceXBench: Evaluating Multimodal LLMs on Face Understanding

Python 20 1 Updated Jan 15, 2026

Famous Vision Language Models and Their Architectures

Markdown 1,163 54 Updated Jan 11, 2026

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,751 617 Updated Jan 15, 2026

PyTorch implementation of Real-ESRGAN model

Python 627 217 Updated Apr 15, 2024

A Multimodal Large Language Model for Face Understanding

Python 5 Updated Jul 30, 2025

Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).

Python 416 11 Updated Aug 26, 2025

A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, Qwen Image, Hunyuan Video, LTX Video and Flux.

Python 4,242 586 Updated Jan 26, 2026

neosr is an open-source framework for training super-resolution models.

Python 304 46 Updated Jun 2, 2025
Python 3 Updated Sep 3, 2024

🚀 An awesome list of curated Nano Banana pro prompts and examples. Your go-to resource for mastering prompt engineering and exploring the creative potential of the Nano banana pro(Nano banana 2) AI…

8,537 730 Updated Jan 14, 2026

SpotEdit:Selective Region Editing in Diffusion Transformers

Python 168 8 Updated Jan 5, 2026

This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''

37 Updated Dec 30, 2025
Python 25 2 Updated Dec 30, 2025

[AAAI 2026] Offical implementation of the paper "IdentityStory: Taming Your Identity-Preserving Generator for Human-Centric Story Generation".

8 Updated Jan 18, 2026

ThinkGen: Generalized Thinking for Visual Generation

Python 46 Updated Dec 30, 2025

A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation

Python 86 Updated Sep 27, 2025

A curated list of research papers, resources, and advancements on Diffusion Cache and related efficient diffusion model acceleration techniques.

69 3 Updated Nov 4, 2025

[CVPR 2022] We unify pixel-to-pixel transformation and color-to-color transformation in a coherent framework for high-resolution image harmonization. We also release 100 high-resolution real compos…

Python 141 13 Updated May 26, 2025

RePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image Editing

Python 58 2 Updated Dec 26, 2025

🧑‍🚀 全世界最好的LLM资料总结(多模态生成、Agent、辅助编程、AI审稿、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.

7,416 722 Updated Jan 23, 2026

All my self trained & released AI upscaling models. After gathering and applying over 600 different upscaling models, I learned how to train my own models, and these are the results.

Python 544 36 Updated Nov 14, 2025

RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards.

Python 310 19 Updated Dec 29, 2025

Official codes for DeSRA (ICML 2023)

Python 141 Updated Feb 2, 2024

Official implementation of the paper 'Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution' in CVPR 2022

Python 280 25 Updated Apr 11, 2022

HQ-50K: A Large-scale, High-quality Dataset for Image Restoration

Python 88 6 Updated May 9, 2024
Next