Thanks to visit codestin.com
Credit goes to github.com

BodhiHu

Follow

🌴

bodhicitta

中土 Bodhi BodhiHu

🌴

bodhicitta

Follow

namo amituofo❤ > bodhicitta > 🎾🧘‍♂️ > 阿彌陀佛 ❤️❤️❤️

22 followers · 10 following

AMD, MooreThreads
Shanghai

Achievements

Achievements

Stars

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 5,781 421 Updated Oct 28, 2025

FlagOpen / FlagGems

FlagGems is an operator library for large language models implemented in the Triton Language.

Python 743 146 Updated Oct 30, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,844 726 Updated Oct 15, 2025

kardolus / chatgpt-cli

ChatGPT CLI is a versatile tool for interacting with LLMs through OpenAI, Azure, and other popular providers like Perplexity AI and Llama. It supports prompt files, history tracking, and live data …

Go 828 52 Updated Oct 9, 2025

waylaidwanderer / node-chatgpt-api

A client implementation for ChatGPT and Bing AI. Available as a Node.js module, REST API server, and CLI app.

JavaScript 4,201 724 Updated Jan 27, 2024

chatboxai / chatbox

User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)

TypeScript 37,142 3,764 Updated Oct 29, 2025

ENOT-AutoDL / onnx2torch

Convert ONNX models to PyTorch.

Python 705 85 Updated Oct 14, 2025

NVIDIA / TensorRT-Model-Optimizer

A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment…

Python 1,486 187 Updated Oct 30, 2025

huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 35,611 5,056 Updated Oct 30, 2025

jiazhihao / mirage_baselines

Python 2 Updated Jan 28, 2025

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 17,401 2,349 Updated Oct 30, 2025

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 27,627 2,733 Updated Apr 30, 2025

mirage-project / mirage

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 1,918 144 Updated Oct 30, 2025

BodhiHu / L-Mul

C implementation of the L-Mul f32/f16 multiplications from paper: https://arxiv.org/html/2410.00907

C 28 Updated Oct 12, 2024

microsoft / onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 18,231 3,524 Updated Oct 30, 2025

pipeless-ai / pipeless

An open-source computer vision framework to build and deploy apps in minutes

Rust 767 41 Updated May 8, 2024

RidgeRun / gst-inference

A GStreamer Deep Learning Inference Framework

C 131 30 Updated Nov 7, 2023

pytorch / audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Python 2,760 734 Updated Oct 29, 2025

pytorch / vision

Datasets, Transforms and Models specific to Computer Vision

Python 17,264 7,168 Updated Oct 29, 2025

ultralytics / ultralytics

Ultralytics YOLO 🚀

Python 48,045 9,275 Updated Oct 30, 2025

pytorch / ao

PyTorch native quantization and sparsity for training and inference

Python 2,467 355 Updated Oct 30, 2025

MooreThreads / vllm_musa

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65 2 Updated Oct 28, 2024

NVIDIA / accelerated-computing-hub

NVIDIA curated collection of educational resources related to general purpose GPU programming.

Jupyter Notebook 800 133 Updated Oct 29, 2025

NVIDIA / cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 8,332 2,168 Updated Sep 5, 2025

MooreThreads / muAlg

Forked from NVIDIA/cub

Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl

Cuda 3 1 Updated Sep 13, 2024

ggml-org / ggml

Tensor library for machine learning

C++ 13,342 1,375 Updated Oct 29, 2025

leejet / stable-diffusion.cpp

Diffusion model(SD,Flux,Wan,Qwen Image,...) inference in pure C/C++

C++ 4,495 435 Updated Oct 28, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,578 2,091 Updated Jul 17, 2025

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 15,248 1,097 Updated Oct 30, 2025

apoorvumang / prompt-lookup-decoding

Jupyter Notebook 573 25 Updated Aug 23, 2024