Lists (12)
Sort Name ascending (A-Z)
Starred repositories
An Open Source Machine Learning Framework for Everyone
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
ClickHouse® is a real-time analytics database management system
DuckDB is an analytical in-process SQL database management system
A modern replacement for Redis and Memcached
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
FlashMLA: Efficient Multi-head Latent Attention Kernels
🚀 The best real-time interactive AI avatar(digital human) with on-premise deployment and <1.5 s latency.
ESP8266 WiFi Connection manager with web captive portal
Transformer related optimization, including BERT, GPT
Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍
A modern high-performance open source message queuing system
Lightweight inference library for ONNX files, written in C++. It can run Stable Diffusion XL 1.0 on a RPI Zero 2 (or in 298MB of RAM) but also Mistral 7B on desktops and servers. ARM, x86, WASM, RI…
A lightning fast Finite State machine and REgular expression manipulation library.
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
A high-performance inference system for large language models, designed for production environments.
Common source, scripts and utilities for creating Triton backends.
The Triton backend for the PyTorch TorchScript models.
Your customized AI assistant - Personal assistants on any hardware! With llama.cpp, whisper.cpp, ggml, LLaMA-v2.
A working version of A-MNS: Fast and robust template matching with majority neighbour similarity and annulus projection transformation