Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
@gpustack

GPUStack

GPU cluster manager for optimized AI model deployment

Pinned Loading

  1. gpustack gpustack Public

    GPU cluster manager for optimized AI model deployment

    Python 4.3k 432

  2. runner runner Public

    Collection of Dockerfiles to build images for various inference services across different accelerated backends.

    Dockerfile 3 6

  3. runtime runtime Public

    Provides a unified interface to detect GPU resources and manages GPU workloads.

    Python 4 6

  4. gguf-parser-go gguf-parser-go Public

    Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

    Go 220 22

  5. vox-box vox-box Public

    A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.

    Python 184 26

Repositories

Showing 10 of 14 repositories
  • .github Public

    Meta-Github repository for all GPUStack repositories.

    gpustack/.github’s past year of commit activity
    Dockerfile 0 Apache-2.0 3 0 0 Updated Dec 22, 2025
  • gpustack-ui Public
    gpustack/gpustack-ui’s past year of commit activity
    TypeScript 64 Apache-2.0 45 0 0 Updated Dec 22, 2025
  • gpustack Public

    GPU cluster manager for optimized AI model deployment

    gpustack/gpustack’s past year of commit activity
    Python 4,259 Apache-2.0 432 354 14 Updated Dec 22, 2025
  • runner Public

    Collection of Dockerfiles to build images for various inference services across different accelerated backends.

    gpustack/runner’s past year of commit activity
    Dockerfile 3 Apache-2.0 6 0 0 Updated Dec 21, 2025
  • gpustack/gpustack-higress-plugin’s past year of commit activity
    Go 0 2 0 0 Updated Dec 18, 2025
  • runtime Public

    Provides a unified interface to detect GPU resources and manages GPU workloads.

    gpustack/runtime’s past year of commit activity
    Python 4 Apache-2.0 6 0 0 Updated Dec 18, 2025
  • community-inference-backends Public

    Community Inference Backends for GPUStack V2

    gpustack/community-inference-backends’s past year of commit activity
    Dockerfile 0 Apache-2.0 1 0 0 Updated Dec 17, 2025
  • gpustack/gpustack.github.io’s past year of commit activity
    HTML 0 2 0 0 Updated Dec 11, 2025
  • vox-box Public

    A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.

    gpustack/vox-box’s past year of commit activity
    Python 184 Apache-2.0 26 16 0 Updated Dec 2, 2025
  • llama-box Public archive

    LM inference server implementation based on *.cpp.

    gpustack/llama-box’s past year of commit activity
    C++ 294 MIT 29 2 0 Updated Nov 24, 2025