Tom-CaoZH

👋

Focusing

Zhang Cao Tom-CaoZH

👋

Focusing

59 followers · 123 following

China
https://tom-caozh.github.io/

Achievements

Highlights

Tom-CaoZH.github.io Public

This is my homepage.

Python Updated Aug 2, 2025
None Public

HTML Updated Jan 22, 2025
DistServe Public
Forked from LLMServe/DistServe

Disaggregated serving system for Large Language Models (LLMs).

Jupyter Notebook Apache License 2.0 Updated Aug 19, 2024
LLaMA-Factory Public
Forked from hiyouga/LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Python Apache License 2.0 Updated Apr 9, 2024
CXL-101 Public

Contain some materials about CXL.

cxl

21 2 MIT License Updated Feb 29, 2024
llama.cpp Public
Forked from ggml-org/llama.cpp

LLM inference in C/C++

C++ MIT License Updated Feb 25, 2024
PowerInfer Public
Forked from SJTU-IPADS/PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C MIT License Updated Jan 23, 2024
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ Apache License 2.0 Updated Dec 1, 2023
vllm Public
Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python Apache License 2.0 Updated Dec 1, 2023
TensorRT Public
Forked from NVIDIA/TensorRT

NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applicat…

C++ Apache License 2.0 Updated Nov 29, 2023
FlexGen Public
Forked from FMInference/FlexLLMGen

Running large language models on a single GPU for throughput-oriented scenarios.

Python Apache License 2.0 Updated Sep 27, 2023
xalloc Public

This lib is used to allocate normal DRAM-based memory and CXL-based memory using Rust.

Rust 3 MIT License Updated Sep 27, 2023
notes-pictures Public

MIT License Updated Sep 27, 2023
Ditto Public
Forked from dmemsys/Ditto

This is the implementation repository of our SOSP'23 paper: Ditto: An Elastic and Adaptive Memory-Disaggregated Caching System.

C++ Updated Sep 24, 2023
runc Public
Forked from opencontainers/runc

CLI tool for spawning and running containers according to the OCI specification

Go Apache License 2.0 Updated Aug 10, 2023
cuckoofilter Public
Forked from efficient/cuckoofilter

C++ Other Updated Jul 11, 2023
memkind Public
Forked from memkind/memkind

Memkind is an easy-to-use, general-purpose allocator which helps to fully utilize various kinds of memory available in the system, including DRAM, NVDIMM, and HBM

C Other Updated Jun 14, 2023
zenfs Public
Forked from westerndigitalcorporation/zenfs

ZenFS is a storage backend for RocksDB that enables support for ZNS SSDs and SMR HDDs.

C++ GNU General Public License v2.0 Updated May 22, 2023
OncoMatcher Public

Python MIT License Updated May 6, 2023
curve Public
Forked from opencurve/curve

Curve is a high-performance, lightweight-operation, cloud-native open source distributed storage system. Curve can be applied to: 1) mainstream cloud-native infrastructure platforms OpenStack and K…

C++ Apache License 2.0 Updated Mar 31, 2023
Leetcode Public

my solutions to some leetcode problems

C++ MIT License Updated Jan 10, 2023
opendal Public
Forked from apache/opendal

OpenDAL: Access data freely, painlessly, and efficiently

Rust Apache License 2.0 Updated Dec 20, 2022
XD_EE_DSA_2022 Public

my solution to XDU EE data structure and algorithm

C++ MIT License Updated Dec 16, 2022
LearningOS_Record Public

Record my daily process when learning os-comp2022-winter

MIT License Updated Nov 3, 2022
LevelDBRead Public

To record some notes when I read the leveldb source code

C++ BSD 3-Clause "New" or "Revised" License Updated Oct 15, 2022
RocksDBRead Public
Forked from facebook/rocksdb

To record some notes when I read the rocksdb source code

C++ GNU General Public License v2.0 Updated Oct 14, 2022
paper_readings Public

Keep track of the papers I have read and to be read

MIT License Updated Sep 20, 2022
tests Public

C++ Updated Aug 18, 2022
TinyDB Public

Just a very simple database

C++ MIT License Updated Jul 29, 2022
mit_6.824 Public

to record my study of mit 6.824

Go MIT License Updated Jul 15, 2022

Zhang Cao Tom-CaoZH

Achievements

Achievements

Highlights

Tom-CaoZH.github.io Public

Uh oh!

None Public

Uh oh!

DistServe Public

Uh oh!

LLaMA-Factory Public

Uh oh!

CXL-101 Public

Uh oh!

llama.cpp Public

Uh oh!

PowerInfer Public

Uh oh!

TensorRT-LLM Public

Uh oh!

vllm Public

Uh oh!

TensorRT Public

Uh oh!

FlexGen Public

Uh oh!

xalloc Public

Uh oh!

notes-pictures Public

Uh oh!

Ditto Public

Uh oh!

runc Public

Uh oh!

cuckoofilter Public

Uh oh!

memkind Public

Uh oh!

zenfs Public

Uh oh!

OncoMatcher Public

Uh oh!

curve Public

Uh oh!

Leetcode Public

Uh oh!

opendal Public

Uh oh!

XD_EE_DSA_2022 Public

Uh oh!

LearningOS_Record Public

Uh oh!

LevelDBRead Public

Uh oh!

RocksDBRead Public

Uh oh!

paper_readings Public

Uh oh!

tests Public

Uh oh!

TinyDB Public

Uh oh!

mit_6.824 Public

Uh oh!