Stars
Multi-stream video inference with Ultralytics YOLO - Display multiple video streams in a grid layout with real-time object detection.
🤗 Optimum ONNX: Export your model to ONNX and run inference with ONNX Runtime
A high-performance tool for video upscaling, interpolation, depth estimation, and more. Available as a CLI and Adobe Extension.
🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!
FlagGems is an operator library for large language models implemented in the Triton Language.
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…
llm deploy project based mnn. This project has merged into MNN.
A high-throughput and memory-efficient inference and serving engine for LLMs
PyTorch implementation of AlphaZero Chess from scratch
A Toolkit to Help Optimize Large Onnx Model
The minimal opencv for Android, iOS, ARM Linux, Windows, Linux, MacOS, HarmonyOS, WebAssembly, watchOS, tvOS, visionOS
A easy tool for generating Tensor Program from Torch(besd on Torch FX & TVM Relax)