Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View rumisle's full-sized avatar
๐Ÿ›๏ธ
Oh
๐Ÿ›๏ธ
Oh
  • 07:32 (UTC)

Block or report rumisle

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Kimi CLI is your next CLI agent.

Python 2,038 145 Updated Oct 30, 2025

A scrollable-tiling Wayland compositor.

Rust 14,249 500 Updated Oct 30, 2025

Fast and memory-efficient exact attention

Python 20,236 2,097 Updated Oct 29, 2025

llvm-mctoll

C++ 867 123 Updated Jun 22, 2024

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 1,158 82 Updated Aug 28, 2025

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 922 77 Updated Sep 4, 2024

Async RL Training at Scale

Python 730 121 Updated Oct 30, 2025

A collection of LLM memes

282 4 Updated Sep 22, 2025

Learn CUDA with PyTorch

Cuda 95 12 Updated Sep 24, 2025
Python 447 35 Updated Aug 28, 2025

Renderer for the harmony response format to be used with gpt-oss

Rust 3,952 223 Updated Aug 15, 2025

Tensor library & inference framework for machine learning

C++ 113 5 Updated Oct 3, 2025

A CLI tool for managing Claude instances with git worktree

Rust 101 11 Updated Oct 27, 2025

CUDA kernel author's tools

Cuda 113 8 Updated Apr 24, 2022
C++ 309 26 Updated Oct 1, 2025

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 545 33 Updated Oct 30, 2025

Simple & Scalable Pretraining for Neural Architecture Research

Python 298 30 Updated Oct 28, 2025

Fused Qwen3 MoE layer for faster training, compatible with HF Transformers, LoRA, 4-bit quant, Unsloth

Python 200 9 Updated Oct 21, 2025

A Tree Search Library with Flexible API for LLM Inference-Time Scaling

Python 481 62 Updated Oct 18, 2025

MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user develop their prompts into full models.

Python 181 12 Updated Oct 11, 2025

PyTorch Single Controller

Rust 829 97 Updated Oct 30, 2025

slime is an LLM post-training framework for RL Scaling.

Python 2,306 232 Updated Oct 30, 2025

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 1,918 144 Updated Oct 30, 2025

Manage resources and move them between hardware contexts

Rust 2 1 Updated Feb 23, 2025

๐Ÿš€ Efficient implementations of state-of-the-art linear attention models

Python 3,591 279 Updated Oct 30, 2025

Tenstorrent Blackhole P100/P150 card RISC-V Linux demo ๐Ÿง

C++ 37 4 Updated Oct 14, 2025

Fast low-bit matmul kernels in Triton

Python 387 28 Updated Oct 26, 2025

A collection of formalized statements of conjectures in Lean.

Lean 653 85 Updated Oct 29, 2025
Next