Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
View BolinSNLHM's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report BolinSNLHM

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.

Python 440 16 Updated Dec 16, 2025

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,118 1,630 Updated Jan 15, 2026

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 2,131 243 Updated Jan 13, 2026
Rust 1,476 157 Updated Aug 8, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,957 287 Updated May 15, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,098 801 Updated Jan 16, 2026

A book for Learning the Foundations of LLMs

15,514 1,459 Updated Dec 12, 2025

An extremely fast Python linter and code formatter, written in Rust.

Rust 45,186 1,698 Updated Jan 18, 2026

An ML Systems Onboarding list

970 36 Updated Jan 24, 2025

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous …

Python 32,196 1,925 Updated Jan 6, 2026
Rust 289 53 Updated Nov 27, 2024

KECC: KAIST Educational C Compiler. IMPORTANT: DON'T FORK!

Rust 176 19 Updated Jun 13, 2025

A Easy-to-understand TensorOp Matmul Tutorial

C++ 404 52 Updated Jan 10, 2026

A categorized list of C++ resources.

5,197 524 Updated Jan 18, 2026

KAIST CS420: Compiler Design

547 32 Updated Apr 3, 2025

compiler learning resources collect.

Python 2,664 364 Updated Mar 19, 2025

how to optimize some algorithm in cuda.

Cuda 2,770 250 Updated Jan 16, 2026

A curated list for Efficient Large Language Models

Python 1,933 148 Updated Jun 17, 2025

Awesome-LLM: a curated list of Large Language Model

26,040 2,262 Updated Jul 31, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 67,744 12,652 Updated Jan 18, 2026

Inference Llama 2 in one file of pure C

C 19,117 2,437 Updated Aug 6, 2024

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 9,388 925 Updated Jan 18, 2026

GPU programming related news and material links

1,910 112 Updated Sep 17, 2025

High-speed GEMV kernels, at most 2.7x speedup compared to pytorch baseline.

Cuda 124 7 Updated Jul 13, 2024

[EuroSys'24] Minuet: Accelerating 3D Sparse Convolutions on GPUs

Cuda 79 3 Updated Jun 7, 2024

程序员延寿指南 | A programmer's guide to live longer

34,658 2,373 Updated May 19, 2025

An open-source efficient deep learning framework/compiler, written in python.

Python 739 68 Updated Sep 4, 2025

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,717 323 Updated Oct 19, 2024

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 16,110 2,307 Updated Sep 3, 2025
Next