-
Lvllm Public
LvLLM is a special NUMA extension of vllm that makes full use of CPU and memory resources, reduces GPU memory requirements, and features an efficient GPU parallel and NUMA parallel architecture, su…
-
lktransformers Public
The complete NUMA-optimized branch of the ktransformers project