Supertonic FastAPI - High Performance OpenAI-Compatible TTS API

Supertonic FastAPI is a production-ready, lightning-fast, and OpenAI-compatible text-to-speech (TTS) API powered by Supertonic. It is designed for high-concurrency environments, providing a seamless drop-in replacement for OpenAI's speech synthesis services while offering superior performance and flexibility.

Key Features & SEO Benefits

OpenAI API Compatibility: Full support for OpenAI's TTS endpoints and data formats. Use existing SDKs without modification.
Advanced Auto-Chunking: Automatically handles long text inputs, ensuring smooth and consistent audio generation for long-form content.
Multilingual & Multi-Voice: Supports a wide range of professional voices (F1-F5, M1-M5) mapped to standard OpenAI voice names (alloy, echo, fable, onyx, nova, shimmer).
Real-Time Streaming: Implements chunked transfer encoding, allowing users to play audio while it's still being generated (LLM-ready).
GPU & CPU Hardware Acceleration: Optimized for NVIDIA CUDA (GPU) and Apple Silicon CoreML (Mac) using ONNX Runtime for near-instant inference.
Dockerized for Scale: Ready for containerized deployment with Nginx load balancing and multi-process support.
Professional Speech Synthesis: High-quality, natural-sounding AI voices suitable for podcasts, narrations, and assistants.

Quick Start (Local)

Setup:
```
./scripts/setup.sh
```

Run:

./scripts/start.sh
# Server listens on http://0.0.0.0:8800

Production Deployment (Docker)

The easiest way to run in production is using Docker Compose, which includes an Nginx load balancer.

Using Docker Compose

Start:
```
docker compose up -d
```

Scale:

# Scale to 3 worker instances
docker compose up -d --scale api=3

Individual Image

Build:
```
docker build -t supertonic-tts .
```

Run:

# CPU / Mac (Recommended for Docker on Mac)
docker run -p 8800:8800 supertonic-tts

# NVIDIA GPU (Requires nvidia-container-toolkit)
docker run --gpus all -p 8800:8800 supertonic-tts

Configuration (.env)

Variable	Default	Description
`PORT`	8800	Port to listen on
`FORCE_PROVIDERS`	auto	Force specific ORT provider: `cuda`, `coreml`, `cpu`, or `auto`
`MODEL_THREADS`	0	Intra-op threads (0=auto)
`MAX_WORKERS`	4	Thread pool workers for concurrent requests

API Usage

POST /v1/audio/speech

{
  "model": "tts-1",
  "input": "Hello, this is a test.",
  "voice": "alloy",
  "stream": true
}

Get Voices: /voices Health Check: /health

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
app		app
scripts		scripts
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
nginx.conf		nginx.conf
requirements.txt		requirements.txt
test_out		test_out

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Supertonic FastAPI - High Performance OpenAI-Compatible TTS API

Key Features & SEO Benefits

Quick Start (Local)

Production Deployment (Docker)

Using Docker Compose

Individual Image

Configuration (.env)

API Usage

About

Uh oh!

Releases

Packages

Languages

rvuyyuru2/supertonic-restapi

Folders and files

Latest commit

History

Repository files navigation

Supertonic FastAPI - High Performance OpenAI-Compatible TTS API

Key Features & SEO Benefits

Quick Start (Local)

Production Deployment (Docker)

Using Docker Compose

Individual Image

Configuration (.env)

API Usage

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages