Thanks to visit codestin.com
Credit goes to github.com

gigit0000

Follow

William Song gigit0000

Follow

Ready to dispatch!

10 followers · 41 following

Kim Baksa's Lab, South Korea
11:43 (UTC +09:00)

Achievements

Achievements

Stars

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,389 204 Updated Dec 19, 2025

xlite-dev / Awesome-LLM-Inference

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4,847 327 Updated Nov 28, 2025

cornserve-ai / cornserve

Easy, Fast, and Scalable Multimodal AI

Python 81 6 Updated Dec 19, 2025

kuterd / nv_isa_solver

Nvidia Instruction Set Specification Generator

Python 304 16 Updated Jul 9, 2024

gpu-mode / triton-index

Cataloging released Triton kernels.

277 14 Updated Sep 9, 2025

Ma-Lab-Berkeley / deep-representation-learning-book

Learning Deep Representations of Data Distributions

TeX 717 59 Updated Dec 18, 2025

siboehm / ShallowSpeed

Small scale distributed training of sequential deep learning models, built on Numpy and MPI.

Python 153 7 Updated Oct 19, 2023

Lightning-AI / forked-pdb

Python pdb for multiple processes

Python 72 9 Updated May 24, 2025

aphrodite-engine / aphrodite-engine

Large-scale LLM inference engine

C++ 1,610 178 Updated Nov 24, 2025

bloomberg / memray

Memray is a memory profiler for Python

Python 14,685 432 Updated Dec 15, 2025

jalalirs / arielml

Python 7 Updated Jul 26, 2025

ShawnZhong / compiler-explorer-triton

Forked from compiler-explorer/compiler-explorer

Triton Support in Compiler Explorer

TypeScript 5 Updated Aug 5, 2025

compiler-explorer / compiler-explorer

Run compilers interactively from your web browser and interact with the assembly

TypeScript 18,354 1,969 Updated Dec 19, 2025

Owen718 / FlexAttention-Examples

This repo provides several classic attention variant implementation based on FlexAttention API.

Python 2 1 Updated May 18, 2025

vllm-project / speculators

A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM

Python 169 22 Updated Dec 19, 2025

OpenMathLib / OpenBLAS

OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.

C 7,175 1,624 Updated Dec 19, 2025

kherrick / hacker-news

Hacker News

HTML 13 5 Updated Dec 20, 2025

nasa03 / llamafile

Forked from mozilla-ai/llamafile

Distribute and run LLMs with a single file.

C++ 1 Updated Jul 23, 2024

mozilla-ai / llamafile

Distribute and run LLMs with a single file.

C 23,534 1,250 Updated Dec 19, 2025

vosen / ZLUDA

CUDA on non-NVIDIA GPUs

Rust 13,673 879 Updated Dec 19, 2025

Syllo / nvtop

GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm

C 9,875 356 Updated Oct 25, 2025

TheCodeTraveler / HackerNews

A .NET MAUI app for displaying the top posts on Hacker News that demonstrates text sentiment analysis gathered using artificial intelligence

C# 280 40 Updated Nov 24, 2025

oz123 / awesome-c

A curated list of awesome C frameworks, libraries, resources and other shiny things. Inspired by all the other awesome-... projects out there.

10,903 908 Updated Nov 7, 2025

RoyalCities / RC-Home-Assistant-Low-VRAM

Local AI voice assistant stack for Home Assistant (GPU-accelerated) with persistent memory, follow-up conversation, and Ollama model recommendations - settings designed for low VRAM systems.

221 19 Updated Jul 27, 2025

ACIDBURN2501 / debug-macros

Debug Module for Embedded Systems

C 1 Updated May 3, 2025

gigit0000 / dia

Forked from nari-labs/dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 1 Updated Jul 6, 2025

thibmaek / awesome-raspberry-pi

📝 A curated list of awesome Raspberry Pi tools, projects, images and resources

Shell 15,582 1,071 Updated Nov 10, 2025

rogerallen / llama2.cu

Forked from karpathy/llama2.c

Inference Llama 2 in one file of pure C & one file with CUDA

C 31 1 Updated Oct 14, 2023

nari-labs / dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,983 1,651 Updated Nov 19, 2025

JohnClaw / chatllm.v

V-lang api wrapper for llm-inference chatllm.cpp

C 6 Updated Nov 20, 2024