Thanks to visit codestin.com
Credit goes to GitHub.com

Skip to content
View HorizonWind2004's full-sized avatar
😇
In Desperation
😇
In Desperation

Highlights

  • Pro

Block or report HorizonWind2004

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Curated Collection of Frontier Language Model Architectures

5 Updated Jan 17, 2026

UniVideo: Unified Understanding, Generation, and Editing for Videos

Python 335 15 Updated Jan 8, 2026

[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"

Python 605 32 Updated Jun 17, 2025

PyTorch implementation of NEPA

Python 291 17 Updated Dec 24, 2025

Evaluation codes and data for GenEval2

Python 51 Updated Jan 8, 2026

Code release for "SegLLM: Multi-round Reasoning Segmentation"

Python 127 10 Updated Feb 20, 2025

The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.

Python 24 Updated Dec 30, 2025

HunyuanVideo-1.5: A leading lightweight video generation model

Python 3,410 116 Updated Jan 2, 2026
Python 70 1 Updated Aug 6, 2025

[Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning

Python 418 21 Updated Dec 22, 2024

VideoCoF: Unified Video Editing with Temporal Reasoner

Python 124 7 Updated Jan 2, 2026

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / verl / LLaMA Factory / ms-swift / U…

Python 3,437 183 Updated Jan 17, 2026

⚡ Dynamically generated stats for your github readmes

JavaScript 77,984 29,587 Updated Jan 12, 2026

official code for unigame

Python 18 Updated Nov 26, 2025

[NeurIPS 2025 Spotlight] Seeing Sound, Hearing Sight: Uncovering Modality Bias and Conflict of AI models in Sound Localization

C# 4 1 Updated Jan 2, 2026

https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT

Python 113 7 Updated Nov 1, 2025

Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward

Python 59 Updated Nov 27, 2025

Official inference repo for FLUX.2 models

Python 1,449 80 Updated Jan 15, 2026

Official repo of "Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens"

Python 264 16 Updated Jan 6, 2026

Code release for "UnSAMv2: Self-Supervised Learning Enables Segment Anything at Any Granularity"

Jupyter Notebook 70 2 Updated Nov 20, 2025

The official repository for the paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"

Jupyter Notebook 137 3 Updated Jan 9, 2026

Echo: "Constantly Improving Image Models Need Constantly Improving Benchmarks"

Jupyter Notebook 17 Updated Oct 20, 2025

Official implementation of "UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing"

Python 123 4 Updated Nov 21, 2025

GIR-Bench: Versatile Benchmark for Generating Images with Reasoning

Python 30 1 Updated Oct 14, 2025

Official repo of paper "SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models". A post-training framework that creates a cost-effective, self-iterative optimization loop.

Python 91 6 Updated Nov 26, 2025
Python 22 1 Updated Jan 2, 2026

Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning

Python 236 8 Updated May 30, 2025

NeurIPS 2025

Python 14 Updated Dec 18, 2025

(NeurIPS 2025 D&B Track) OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps

Python 23 2 Updated Nov 12, 2025
Next