Susie Q susie-Choi

🗻

AI Engineer

53 followers · 72 following

HL Mando
Seoul, South Korea
in/susiechoi1022

Achievements

Highlights

Developer Program Member

Organizations

Stars

LLM Serving

LLM 서빙 최적화 관련 오픈소스

4 repositories

NVIDIA / TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 12,482 1,979 Updated Dec 27, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and multi-modality models.

Python 22,012 3,879 Updated Dec 27, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 92,068 14,256 Updated Dec 27, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66,282 12,221 Updated Dec 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Susie Q susie-Choi

Achievements

Achievements

Highlights

Organizations

Block or report susie-Choi

LLM Serving

NVIDIA / TensorRT-LLM

sgl-project / sglang

ggml-org / llama.cpp

vllm-project / vllm