IDKiro

IDKiro IDKiro

Stroll in the abyss

165 followers · 7 following

Zhejiang University
Hangzhou

Achievements

Stars

MiniMax-AI / MiniMax-M2.1

MiniMax M2.1, a SOTA model for real-world dev & agents.

166 10 Updated Dec 26, 2025

facebookresearch / pixio

Pixio: a capable vision encoder dedicated to dense prediction, simply by pixel reconstruction

Python 268 8 Updated Dec 26, 2025

MiniMax-AI / VTP

Towards Scalable Pre-training of Visual Tokenizers for Generation

Python 369 8 Updated Dec 16, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66,279 12,221 Updated Dec 27, 2025

yunncheng / OmniAID

Official PyTorch Code for "OmniAID: Decoupling Semantic and Artifacts for Universal AI-Generated Image Detection in the Wild".

Python 12 Updated Dec 11, 2025

yuemingPAN / SFD

Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion

Python 296 3 Updated Dec 21, 2025

OpenGVLab / InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 2,145 135 Updated Dec 15, 2025

PaperDebugger / paperdebugger

Paper Debugger is the best overleaf companion

TypeScript 1,180 56 Updated Dec 21, 2025

Emericen / tiny-qwen

A minimal PyTorch re-implementation of Qwen3 VL with a fancy CLI

Python 294 17 Updated Dec 2, 2025

Tongyi-MAI / Z-Image

Python 8,018 470 Updated Dec 25, 2025

Tencent-Hunyuan / HunyuanVideo-1.5

HunyuanVideo-1.5: A leading lightweight video generation model

Python 2,208 104 Updated Dec 25, 2025

lillian039 / VARC

Python 166 9 Updated Nov 26, 2025

LTH14 / JiT

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 1,874 112 Updated Dec 8, 2025

tyfeld / MMaDA-Parallel

Official Implementation of "MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation"

Python 281 7 Updated Nov 19, 2025

rbalestr-lab / lejepa

Python 780 67 Updated Dec 9, 2025

NVIDIA / NVTX

The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resources in your applications.

C++ 491 66 Updated Dec 26, 2025

facebookresearch / perception_models

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 2,003 133 Updated Dec 18, 2025

alexzhang13 / flashattention2-custom-mask

Triton implementation of FlashAttention2 that adds Custom Masks.

Python 157 15 Updated Aug 14, 2024

OpenGVLab / unmasked_teacher

[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models

Python 343 19 Updated May 27, 2024

KlingTeam / VMoBA

Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"

Python 58 3 Updated Jul 1, 2025

FoundationVision / InfinityStar

[NeurIPS 2025 Oral]Infinity⭐️: Uniﬁed Spacetime AutoRegressive Modeling for Visual Generation

Python 670 24 Updated Nov 27, 2025

nvidia-cosmos / cosmos-predict2.5

Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.

Python 548 48 Updated Dec 20, 2025

kingToolbox / WindTerm

A professional cross-platform SSH/Sftp/Shell/Telnet/Tmux/Serial terminal.

C 29,168 2,251 Updated Mar 11, 2025

iterate-ch / cyberduck

Cyberduck is a libre FTP, SFTP, WebDAV, Amazon S3, Backblaze B2, Microsoft Azure & OneDrive and OpenStack Swift file transfer client for Mac and Windows.

Java 4,146 326 Updated Dec 26, 2025

MiniMax-AI / MiniMax-M2

MiniMax-M2, a model built for Max coding & agentic workflows.

2,150 164 Updated Nov 13, 2025

Yuliang-Liu / MultimodalOCR

On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)

Python 776 55 Updated Jul 5, 2025

QwenLM / Qwen-Image

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 6,553 369 Updated Dec 24, 2025

MCDFsteve / NipaPlay-Reload

NipaPlay-Reload 是一个现代化的跨平台本地视频播放器，支持 Windows、macOS、Linux、Android 和 iOS。集成了弹幕显示、多格式字幕支持、多音频轨道切换，新番查看等功能，支持挂载Emby/Jellyfin媒体库。采用 Flutter 开发，提供统一的用户体验。

Dart 1,083 48 Updated Dec 20, 2025

AMAP-ML / EPG

124 3 Updated Dec 8, 2025

bytetriper / RAE

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,656 55 Updated Dec 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IDKiro IDKiro

Achievements

Achievements

Block or report IDKiro

Stars

MiniMax-AI / MiniMax-M2.1

facebookresearch / pixio

MiniMax-AI / VTP

vllm-project / vllm

yunncheng / OmniAID

yuemingPAN / SFD

OpenGVLab / InternVideo

PaperDebugger / paperdebugger

Emericen / tiny-qwen

Tongyi-MAI / Z-Image

Tencent-Hunyuan / HunyuanVideo-1.5

lillian039 / VARC

LTH14 / JiT

tyfeld / MMaDA-Parallel

rbalestr-lab / lejepa

NVIDIA / NVTX

facebookresearch / perception_models

alexzhang13 / flashattention2-custom-mask

OpenGVLab / unmasked_teacher

KlingTeam / VMoBA

FoundationVision / InfinityStar

nvidia-cosmos / cosmos-predict2.5

kingToolbox / WindTerm

iterate-ch / cyberduck

MiniMax-AI / MiniMax-M2

Yuliang-Liu / MultimodalOCR

QwenLM / Qwen-Image

MCDFsteve / NipaPlay-Reload

AMAP-ML / EPG

bytetriper / RAE