Stars
✨✨VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model
Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation
A Conversational Speech Generation Model
🔊 Text-Prompted Generative Audio Model
Dual-mode Bluetooth stack, with small memory footprint.
first base model for full-duplex conversational audio
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
✨📞 Asterisk PBX in 🐳 Docker — Smallest Asterisk ever! 🚀
A working STABLE and usable Asterisk PBX Server, in Docker, using Debian-lite
Easily build IVR applications for FreeSwitch
使用FreeSWITCH接受用户手机呼叫,通过UniMRCP Server集成讯飞开放平台(xfyun)插件将用户语音进行语音识别(ASR),并根据自定义业务逻辑调用语音合成(TTS),构建简单的端到端语音呼叫中心。