Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View KD-TAO's full-sized avatar

Block or report KD-TAO

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[arXiv 2026] dVoting: Fast Voting for dLLMs

Python 19 3 Updated Feb 13, 2026

Elevate your AI research writing, no more tedious polishing ✨

5,702 439 Updated Feb 11, 2026

Awesome streaming video understanding

5 Updated Jan 16, 2026

[EMNLP 2025 Main] Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language Models

Python 62 3 Updated Feb 4, 2026
JavaScript 10 Updated Oct 26, 2025

slime is an LLM post-training framework for RL Scaling.

Python 4,035 522 Updated Feb 13, 2026

Autonomous Agents (LLMs) research papers. Updated Daily.

1,145 84 Updated Jan 29, 2026
Python 19 Updated Jan 30, 2026

OmniAgent: Audio-Guided Active Perception Agent for Omnimodal Audio-Video Understanding

7 Updated Jan 3, 2026

[NeurIPS'25] FreqExit: Enabling Early-Exit Inference for Visual Autoregressive Models via Frequency-Aware Guidance

Python 15 Updated Dec 15, 2025

Academic Personal Homepage of Keda Tao

HTML 2 Updated Feb 1, 2026

OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models

Python 53 2 Updated Feb 1, 2026

This repo integrates DyCoke's token compression method with VLMs such as Gemma3 and InternVL3

Python 5 Updated Nov 11, 2025

[TMLR 2026] Survey: https://arxiv.org/pdf/2507.20198

301 20 Updated Feb 10, 2026

All-in-one training for vision models (YOLO, ViTs, RT-DETR, DINOv3): pretraining, fine-tuning, distillation.

Python 1,314 61 Updated Feb 13, 2026

[ICLR'25] Streaming Video Question-Answering with In-context Video KV-Cache Retrieval

Python 101 6 Updated Nov 4, 2025

VidKV: Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models

Python 25 Updated Mar 26, 2025

[CVPR 2025] DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models

Python 100 4 Updated Nov 22, 2025

[ICLR 2025 Spotlight] Overcoming False Illusions in Real-World Face Restoration with Multi-Modal Guided Diffusion Model

12 Updated Apr 23, 2025

face attribute classification based on pytorch

Python 25 3 Updated Jan 27, 2021

这是一个arcface-pytorch的源码,可以用于训练自己的模型。

Python 209 31 Updated Aug 27, 2023