Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Nepomuceno/vllm-cpu

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

vLLM CPU Images

Multi-arch (amd64 + arm64) Docker images for running vLLM with CPU backend.
Built automatically from upstream tags and commits.

Tags

  • nightly → current commit of main
  • <version> (e.g. 0.10.2) → official vLLM release tags
  • <commit-hash> → alternative tag for reproducibility
  • amd64-*, arm64-* → per-arch builds
  • latest → alias to latest release

Usage

Run an API server:

docker run --rm -p 8000:8000 \
  -e HUGGING_FACE_HUB_TOKEN=\$HF_TOKEN \
  gabrielbico/vllm-cpu:nightly \
  --model google/gemma-3-270m

Platforms

  • linux/amd64 (built on AVX2 hosts)
  • linux/arm64 (Apple Silicon, ARM servers)

Notes

These images are CPU only. For GPU builds, see the official vLLM docs.

About

Vllm cpu implementation

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages