Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View zhijl's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Xidian University

Block or report zhijl

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

GPU documentation for humans

Python 397 47 Updated Nov 10, 2025

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLM, VLM, and video generation models.

Python 616 63 Updated Nov 10, 2025

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 47,694 6,665 Updated Jun 11, 2025

The best ChatGPT that $100 can buy.

Python 36,468 4,374 Updated Nov 5, 2025

dInfer: An Efficient Inference Framework for Diffusion Language Models

Python 299 26 Updated Nov 7, 2025

An algorithm for weight-activation quantization (W4A4, W4A8) of LLMs, supporting both static and dynamic quantization

Python 164 14 Updated Oct 31, 2025

My learning notes/codes for ML SYS.

Python 4,136 250 Updated Nov 10, 2025

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 15,158 1,730 Updated Nov 7, 2025

Advanced Quantization Algorithm for LLMs and VLMs, with support for CPU, Intel GPU, CUDA and HPU.

Python 705 59 Updated Nov 12, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 15,613 1,128 Updated Nov 12, 2025

A Quirky Assortment of CuTe Kernels

Python 653 60 Updated Oct 30, 2025

A high-performance inference engine for LLMs, optimized for diverse AI accelerators.

C++ 697 77 Updated Nov 12, 2025

AGENTS.md — a simple, open format for guiding coding agents

TypeScript 8,101 631 Updated Oct 22, 2025

开源白板工具(SaaS),一体化白板,包含思维导图、流程图、自由画等。All in one open-source whiteboard tool with mind, flowchart, freehand and etc.

TypeScript 12,358 980 Updated Nov 10, 2025

AI 视频笔记生成工具 让 AI 为你的视频做笔记

Python 4,099 485 Updated Oct 18, 2025

A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience

TypeScript 81,530 6,028 Updated Nov 12, 2025

青稞Talk

160 1 Updated Nov 5, 2025

【C++面试&C++学习指南】 这里整理了C++后端研发工程师面试和工作必备的知识点 。

2,918 421 Updated Apr 14, 2025

🔥中文 prompt 精选🔥,ChatGPT 使用指南,提升 ChatGPT 可玩性和可用性!🚀

5,108 450 Updated Oct 22, 2025

Interactive Pytorch forward pass visualization in notebooks

Python 607 24 Updated Nov 1, 2025

UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)

C++ 977 87 Updated Nov 12, 2025

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 15,582 2,236 Updated Sep 3, 2025

🧡 Folo is the AI Reader

TypeScript 35,743 1,767 Updated Nov 12, 2025

Nano vLLM

Python 8,694 1,051 Updated Nov 3, 2025

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 2,390 279 Updated Nov 12, 2025

depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.

Python 757 26 Updated Oct 13, 2025

A research prototype of a human-centered web agent

Python 7,925 823 Updated Nov 3, 2025

The official implementation of the EMNLP 2023 paper LLM-FP4

Python 217 21 Updated Dec 15, 2023

SkyReels-V2: Infinite-length Film Generative model

Python 4,936 704 Updated Aug 11, 2025
Next