Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View webstorms's full-sized avatar

Block or report webstorms

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 15,256 1,754 Updated Nov 7, 2025
Jupyter Notebook 8 1 Updated Sep 7, 2025

wake word engine benchmark framework

Python 146 29 Updated Aug 25, 2025

An Open-source FPGA IP Generator

Verilog 1,016 180 Updated Nov 13, 2025

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Python 1,891 168 Updated Oct 27, 2025

A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets.

Jupyter Notebook 953 74 Updated May 7, 2024

Verilog to Routing -- Open Source CAD Flow for FPGA Research

C++ 1,162 430 Updated Nov 13, 2025

Open-source implementation of AlphaEvolve

Python 4,532 669 Updated Nov 12, 2025

My take on Flow Matching

Jupyter Notebook 84 12 Updated Jan 11, 2025

Accessible large language models via k-bit quantization for PyTorch.

Python 7,747 793 Updated Nov 13, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,389 1,522 Updated Apr 24, 2025

FFmpeg Assembly Language Lessons

11,129 346 Updated Nov 7, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,255 3,961 Updated Nov 12, 2025

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,989 525 Updated Apr 11, 2025

Fast CUDA matrix multiplication from scratch

Cuda 940 141 Updated Sep 2, 2025

Code for studying the super weight in LLM

Jupyter Notebook 120 14 Updated Dec 3, 2024
Jupyter Notebook 38 2 Updated Jan 3, 2025

Plain pytorch implementation of LLaMA

Python 188 28 Updated May 22, 2023
Jupyter Notebook 11 Updated May 21, 2024

Flow-matching algorithms in JAX

Python 106 4 Updated Aug 12, 2024

Implementation of normalizing flows from 1d to Nd

Jupyter Notebook 36 9 Updated Feb 19, 2021

An introduction to ARM64 assembly on Apple Silicon Macs

Assembly 4,828 317 Updated Mar 25, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 8,437 832 Updated Nov 6, 2025

A minimalistic full working bitcoin miner implemented in python.

Python 51 26 Updated Mar 23, 2020

Material for gpu-mode lectures

Jupyter Notebook 5,292 533 Updated Sep 23, 2025

GPU programming related news and material links

1,787 105 Updated Sep 17, 2025

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

Jupyter Notebook 18,034 2,547 Updated Oct 3, 2025

Supervised Spiking Network

Jupyter Notebook 81 18 Updated Jun 25, 2022

A Free and Open Source Python Library for Multiobjective Optimization

Python 633 159 Updated Nov 14, 2025
Next