Lists (1)
Sort Name ascending (A-Z)
Stars
🎯 告别信息过载,AI 助你看懂新闻资讯热点,简单的舆情监控分析 - 多平台热点聚合+基于 MCP 的AI分析工具。监控35个平台(抖音、知乎、B站、华尔街见闻、财联社等),智能筛选+自动推送+AI对话分析(用自然语言深度挖掘新闻:趋势追踪、情感分析、相似检索等13种工具)。支持企业微信/个人微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 推送,1分钟手机通知,无需…
An early research stage expert-parallel load balancer for MoE models based on linear programming.
《Machine Learning Systems: Design and Implementation》- Chinese Version
Supercharge Your LLM with the Fastest KV Cache Layer
AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。
Efficient and easy multi-instance LLM serving
A Datacenter Scale Distributed Inference Serving Framework
A fast communication-overlapping library for tensor/expert parallelism on GPUs.
Pluggable in-process caching engine to build and scale high performance services
The C++ REST SDK is a Microsoft project for cloud-based client-server communication in native code using a modern asynchronous C++ API design. This project aims to help C++ developers connect to an…
📄 🇨🇳 📃 论文阅读笔记(分布式系统、虚拟化、机器学习)Papers Notebook (Distributed System, Virtualization, Machine Learning)
A library of C++ coroutine abstractions for the coroutines TS
A high-throughput and memory-efficient inference and serving engine for LLMs
Simple, light-weight and easy-to-use asynchronous components
Master Modern C++(11/14/17/20) Templates: TMP, SFINAE, Concepts, CRTP, Variadic Magic, and Compile-Time Sorcery
A coroutine framework aimed at high-concurrency io with reasonable latency, based on io_uring.
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
深入研究 kvm,ceph,fuse特性,包含开源项目,代码案例,文章,视频,架构脑图等
2021年最新整理, C++ 学习资料,含C++ 11 / 14 / 17 / 20 / 23 新特性、入门教程、推荐书籍、优质文章、学习笔记、教学视频等
Ceph is a distributed object, block, and file storage platform
A distributed key-value storage system developed by Alibaba Group
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
`std::execution`, the proposed C++ framework for asynchronous and parallel programming.