Thanks to visit codestin.com
Credit goes to visionagents.ai

Skip to main content
Inworld AI provides expressive TTS designed for game characters and conversational AI. Offers realistic voices with streaming support.
Vision Agents requires a Stream account for real-time transport. Most providers offer free tiers to get started.

Installation

uv add vision-agents[inworld]

Quick Start

from vision_agents.core import Agent, User
from vision_agents.plugins import inworld, gemini, deepgram, getstream

agent = Agent(
    edge=getstream.Edge(),
    agent_user=User(name="Assistant", id="agent"),
    instructions="You are a helpful assistant.",
    llm=gemini.LLM("gemini-2.5-flash"),
    stt=deepgram.STT(),
    tts=inworld.TTS(),
)
Set INWORLD_API_KEY in your environment or pass api_key directly.

Parameters

NameTypeDefaultDescription
voice_idstr"Dennis"Voice ID (available in Inworld Portal)
model_idstr"inworld-tts-1.5-max"Model ("inworld-tts-1.5-max", "inworld-tts-1.5-min")
temperaturefloat1.1Randomness (0-2)
api_keystrNoneAPI key (defaults to INWORLD_API_KEY env var)
Inworld TTS supports up to 2,000 characters per request.

Next Steps