Stars
A high-throughput and memory-efficient inference and serving engine for LLMs
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
Fast and accurate AI powered file content types detection
Common used path planning algorithms with animations.
π Accelerate inference and training of π€ Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
FlagGems is an operator library for large language models implemented in the Triton Language.
yolort is a runtime stack for yolov5 on specialized accelerators such as tensorrt, libtorch, onnxruntime, tvm and ncnn.
A high-performance tool for video upscaling, interpolation, depth estimation, and more. Available as a CLI and Adobe Extension.
PyTorch implementation of AlphaZero Chess from scratch
A Toolkit to Help Optimize Large Onnx Model
π€ Optimum ONNX: Export your model to ONNX and run inference with ONNX Runtime
Machine Learning, Facial Rigger
Multi-stream video inference with Ultralytics YOLO - Display multiple video streams in a grid layout with real-time object detection.
A easy tool for generating Tensor Program from Torch(besd on Torch FX & TVM Relax)