Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View BodhiHu's full-sized avatar
🌴
bodhicitta
🌴
bodhicitta
  • AMD, MooreThreads
  • Shanghai

Block or report BodhiHu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Efficient Triton Kernels for LLM Training

Python 5,781 421 Updated Oct 28, 2025

FlagGems is an operator library for large language models implemented in the Triton Language.

Python 743 146 Updated Oct 30, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,844 726 Updated Oct 15, 2025

ChatGPT CLI is a versatile tool for interacting with LLMs through OpenAI, Azure, and other popular providers like Perplexity AI and Llama. It supports prompt files, history tracking, and live data …

Go 828 52 Updated Oct 9, 2025

A client implementation for ChatGPT and Bing AI. Available as a Node.js module, REST API server, and CLI app.

JavaScript 4,201 724 Updated Jan 27, 2024

User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)

TypeScript 37,142 3,764 Updated Oct 29, 2025

Convert ONNX models to PyTorch.

Python 705 85 Updated Oct 14, 2025

A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment…

Python 1,486 187 Updated Oct 30, 2025

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 35,611 5,056 Updated Oct 30, 2025
Python 2 Updated Jan 28, 2025

Development repository for the Triton language and compiler

MLIR 17,401 2,349 Updated Oct 30, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 27,627 2,733 Updated Apr 30, 2025

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 1,918 144 Updated Oct 30, 2025

C implementation of the L-Mul f32/f16 multiplications from paper: https://arxiv.org/html/2410.00907

C 28 Updated Oct 12, 2024

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 18,231 3,524 Updated Oct 30, 2025

An open-source computer vision framework to build and deploy apps in minutes

Rust 767 41 Updated May 8, 2024

A GStreamer Deep Learning Inference Framework

C 131 30 Updated Nov 7, 2023

Data manipulation and transformation for audio signal processing, powered by PyTorch

Python 2,760 734 Updated Oct 29, 2025

Datasets, Transforms and Models specific to Computer Vision

Python 17,264 7,168 Updated Oct 29, 2025

Ultralytics YOLO 🚀

Python 48,045 9,275 Updated Oct 30, 2025

PyTorch native quantization and sparsity for training and inference

Python 2,467 355 Updated Oct 30, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65 2 Updated Oct 28, 2024

NVIDIA curated collection of educational resources related to general purpose GPU programming.

Jupyter Notebook 800 133 Updated Oct 29, 2025

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 8,332 2,168 Updated Sep 5, 2025

Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl

Cuda 3 1 Updated Sep 13, 2024

Tensor library for machine learning

C++ 13,342 1,375 Updated Oct 29, 2025

Diffusion model(SD,Flux,Wan,Qwen Image,...) inference in pure C/C++

C++ 4,495 435 Updated Oct 28, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,578 2,091 Updated Jul 17, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 15,248 1,097 Updated Oct 30, 2025
Jupyter Notebook 573 25 Updated Aug 23, 2024
Next