Highlights
- Pro
-
-
-
DistServe Public
Forked from LLMServe/DistServeDisaggregated serving system for Large Language Models (LLMs).
Jupyter Notebook Apache License 2.0 UpdatedAug 19, 2024 -
LLaMA-Factory Public
Forked from hiyouga/LLaMA-FactoryUnify Efficient Fine-Tuning of 100+ LLMs
Python Apache License 2.0 UpdatedApr 9, 2024 -
-
llama.cpp Public
Forked from ggml-org/llama.cppLLM inference in C/C++
C++ MIT License UpdatedFeb 25, 2024 -
PowerInfer Public
Forked from SJTU-IPADS/PowerInferHigh-speed Large Language Model Serving on PCs with Consumer-grade GPUs
C MIT License UpdatedJan 23, 2024 -
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLMTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
C++ Apache License 2.0 UpdatedDec 1, 2023 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedDec 1, 2023 -
TensorRT Public
Forked from NVIDIA/TensorRTNVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applicat…
C++ Apache License 2.0 UpdatedNov 29, 2023 -
FlexGen Public
Forked from FMInference/FlexLLMGenRunning large language models on a single GPU for throughput-oriented scenarios.
Python Apache License 2.0 UpdatedSep 27, 2023 -
xalloc Public
This lib is used to allocate normal DRAM-based memory and CXL-based memory using Rust.
-
-
Ditto Public
Forked from dmemsys/DittoThis is the implementation repository of our SOSP'23 paper: Ditto: An Elastic and Adaptive Memory-Disaggregated Caching System.
C++ UpdatedSep 24, 2023 -
runc Public
Forked from opencontainers/runcCLI tool for spawning and running containers according to the OCI specification
Go Apache License 2.0 UpdatedAug 10, 2023 -
-
memkind Public
Forked from memkind/memkindMemkind is an easy-to-use, general-purpose allocator which helps to fully utilize various kinds of memory available in the system, including DRAM, NVDIMM, and HBM
C Other UpdatedJun 14, 2023 -
zenfs Public
Forked from westerndigitalcorporation/zenfsZenFS is a storage backend for RocksDB that enables support for ZNS SSDs and SMR HDDs.
C++ GNU General Public License v2.0 UpdatedMay 22, 2023 -
-
curve Public
Forked from opencurve/curveCurve is a high-performance, lightweight-operation, cloud-native open source distributed storage system. Curve can be applied to: 1) mainstream cloud-native infrastructure platforms OpenStack and K…
C++ Apache License 2.0 UpdatedMar 31, 2023 -
-
opendal Public
Forked from apache/opendalOpenDAL: Access data freely, painlessly, and efficiently
Rust Apache License 2.0 UpdatedDec 20, 2022 -
XD_EE_DSA_2022 Public
my solution to XDU EE data structure and algorithm
C++ MIT License UpdatedDec 16, 2022 -
LearningOS_Record Public
Record my daily process when learning os-comp2022-winter
MIT License UpdatedNov 3, 2022 -
LevelDBRead Public
To record some notes when I read the leveldb source code
C++ BSD 3-Clause "New" or "Revised" License UpdatedOct 15, 2022 -
RocksDBRead Public
Forked from facebook/rocksdbTo record some notes when I read the rocksdb source code
C++ GNU General Public License v2.0 UpdatedOct 14, 2022 -
paper_readings Public
Keep track of the papers I have read and to be read
MIT License UpdatedSep 20, 2022 -
-
-