Stars
Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation
No fortress, purely open ground. OpenManus is Coming.
LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key 管理与二次分发。单可执行文件,提供 Docker 镜像,一键部署,开箱即用。LLM API management & k…
A fast, powerful, safe and lightweight scripting language and engine for .NET
⚡️ OpenAI PHP is a supercharged community-maintained PHP API client that allows you to interact with OpenAI API.
Keybase Go Library, Client, Service, OS X, iOS, Android, Electron
🤱🏻 Turn any webpage into a desktop app with one command. 一键打包网页生成轻量桌面应用
Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image …
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
Port of OpenAI's Whisper model in C/C++
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
🤖 可 DIY 的 多模态 AI 聊天机器人 | 🚀 快速接入 微信、 QQ、Telegram、等聊天平台 | 🦈支持DeepSeek、Grok、Claude、Ollama、Gemini、OpenAI | 工作流系统、网页搜索、AI画图、人设调教、虚拟女仆、语音对话 |
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
WebChatGPT: A browser extension that augments your ChatGPT prompts with web results.
A webapi to help chatting with Umamusume
🎒 飞书 ×(GPT-4 + GPT-4V + DALL·E-3 + Whisper)= 飞一般的工作体验 🚀 语音对话、角色扮演、多话题讨论、图片创作、表格分析、文档导出 🚀