Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View stephanie-wang's full-sized avatar

Highlights

  • Pro

Block or report stephanie-wang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A throughput-oriented high-performance serving framework for LLMs

Jupyter Notebook 913 44 Updated Oct 29, 2025

Advanced Topics on Systems for X

283 63 Updated Jul 10, 2024

Serving multiple LoRA finetuned LLM as one

Python 1,115 55 Updated May 8, 2024

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

Python 13,084 429 Updated Nov 5, 2025

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 17+ clouds, or on-prem).

Python 8,936 839 Updated Nov 12, 2025

FSCQ is a certified file system written and proven in Coq

Coq 249 22 Updated Oct 21, 2022

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 39,792 6,896 Updated Nov 12, 2025

The Firmament cluster scheduling platform

C++ 412 77 Updated May 26, 2021

Coz: Causal Profiling

C 4,402 167 Updated Aug 9, 2025

Read-Log-Update: A Lightweight Synchronization Mechanism for Concurrent Programming

C 49 18 Updated Aug 31, 2015