Lemonade helps users run local LLMs with the highest performance by configuring state-of-the-art inference engines for their NPUs and GPUs. Join our discord: https://discord.gg/5xXzkMu8Zk
-
Updated
Oct 23, 2025 - Python
Lemonade helps users run local LLMs with the highest performance by configuring state-of-the-art inference engines for their NPUs and GPUs. Join our discord: https://discord.gg/5xXzkMu8Zk
Ollama alternative for Rockchip NPU: An efficient solution for running AI and Deep learning models on Rockchip devices with optimized NPU support ( rkllm )
An interactive Ascend-NPU process viewer
EmbeddedLLM: API server for Embedded Device Deployment. Currently support CUDA/OpenVINO/IpexLLM/DirectML/CPU
PyTorchSim is a Comprehensive, Fast, and Accurate NPU Simulation Framework
🐉 Revolutionary NPU framework for Linux | 24,988 FPS face recognition | AMD XDNA support | World's first complete NPU stack
A simple Python script for running LLMs on Intel's Neural Processing Units (NPUs)
Your models on any xPU
Superresolution running on Rockchip NPU (RK3588, etc..)
⚡ A seamless integration of HuggingFace Transformers & Diffusers with RBLN SDK for efficient inference on RBLN NPUs.
Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX. And Export the ONNX model for AXera's NPU
CoreML conversion script for Bria-RMBG-1.4
Convert and run scikit-learn MLPs on Rockchip NPU.
Add a description, image, and links to the npu topic page so that developers can more easily learn about it.
To associate your repository with the npu topic, visit your repo's landing page and select "manage topics."