Stars
A high-throughput and memory-efficient inference and serving engine for LLMs
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
朋友圈转发截图生成工具(
计算机类常用电子书整理,并且附带下载链接,包括Java,Python,Linux,Go,C,C++,数据结构与算法,人工智能,计算机基础,面试,设计模式,数据库,前端等书籍
shUnit2 is a xUnit based unit test framework for Bourne based shell scripts.
Java client for Kubernetes & OpenShift
Stork - Storage Orchestration Runtime for Kubernetes
Kubernetes deployment strategies explained
Production-Grade Container Scheduling and Management
这里收录比较实用的计算机相关技术书籍,可以在短期之内入门的简单实用教程、一些技术网站以及一些写的比较好的博文,欢迎Fork,你也可以通过Pull Request参与编辑。
The vip.com's java coding standard, libraries and tools