The official repository to build SAT-DS, a medical data collection of over 72 public segmentation datasets, contains over 22K 3D images, 302K segmentation masks and 497 classes from 3 different mod…

Python 124 2 Updated Aug 23, 2025

willisma / SiT

Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"

Python 993 59 Updated Mar 12, 2024

FoundationVision / VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,445 542 Updated May 18, 2025

CASE-Lab-UMD / Unified-MoE-Compression

The official implementation of the paper "Towards Efficient Mixture of Experts: A Holistic Study of Compression Techniques (TMLR)".

Python 79 6 Updated Mar 19, 2025

DerrickYLJ / LessIsMore

Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning

Python 25 2 Updated Sep 12, 2025

moeru-ai / airi

💖🧸 Self hosted, you owned Grok Companion, a container of souls of waifu, cyber livings to bring them into our worlds, wishing to achieve Neuro-sama's altitude. Capable of realtime voice chat, Minec…

TypeScript 15,203 1,347 Updated Oct 23, 2025

SmallDoges / flash-dmattn

Trainable fast and memory-efficient sparse attention

C++ 401 36 Updated Oct 22, 2025

changjonathanc / flex-nano-vllm

FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.

Python 299 17 Updated Aug 7, 2025

attention-survey / Efficient_Attention_Survey

A Survey of Efficient Attention Methods: Hardware-efficient, Sparse, Compact, and Linear Attention

205 4 Updated Aug 26, 2025

Intelligent-Computing-Lab-Panda / GPTAQ

Code implementation of GPTAQ (https://arxiv.org/abs/2504.02692)

Python 68 Updated Jul 28, 2025

Lucky-Lance / Expert_Sparsity

[ACL 2024] Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models

Python 106 12 Updated May 24, 2024

kamanphoebe / Look-into-MoEs

[NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models

Python 55 1 Updated Feb 7, 2025

ZunhaiSu / Super-Experts-Profilling

Unveiling Super Experts in Mixture-of-Experts Large Language Models

Python 29 4 Updated Sep 25, 2025

mit-han-lab / streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,088 391 Updated Jul 11, 2024

SmallPond / D-BOT

一个可 AI 控制的桌面机器人， X-Knob 智能旋钮的变换形态

C 333 48 Updated Jul 27, 2025

meta-pytorch / attention-gym

Helpful tools and examples for working with flex-attention

Python 1,029 63 Updated Oct 21, 2025

Zefan-Cai / KVCache-Factory

Unified KV Cache Compression Methods for Auto-Regressive Models

Python 1,268 160 Updated Jan 4, 2025

shufangxun / LLaVA-MoD

[ICLR 2025] LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation

Python 205 15 Updated Mar 31, 2025

shareAI-lab / analysis_claude_code

本仓库包含对 Claude Code v1.0.33 进行逆向工程的完整研究和分析资料。包括对混淆源代码的深度技术分析、系统架构文档，以及重构 Claude Code agent 系统的实现蓝图。主要发现包括实时 Steering 机制、多 Agent 架构、智能上下文管理和工具执行管道。该项目为理解现代 AI agent 系统设计和实现提供技术参考。

JavaScript 10,942 2,886 Updated Jul 19, 2025

yaolinli / DeCo

Code for DeCo: Decoupling token compression from semanchc abstraction in multimodal large language models

Python 74 3 Updated Jul 14, 2025

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,241 517 Updated Oct 21, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 3,864 291 Updated Oct 20, 2025

MoonshotAI / Kimi-VL

Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities

1,076 51 Updated Jul 15, 2025

OpenBMB / MiniCPM-V

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,114 1,656 Updated Sep 24, 2025

youweiliang / RichHF

Code for CVPR'24 best paper: Rich Human Feedback for Text-to-Image Generation (https://arxiv.org/pdf/2312.10240)

Python 23 1 Updated Sep 5, 2025

gokayfem / awesome-vlm-architectures

Famous Vision Language Models and Their Architectures

Markdown 1,045 52 Updated Feb 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tabris Tabrisrei

Achievements