Lists (3)
Sort Name ascending (A-Z)
Stars
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Enhanced GetX for Flutter: Stability, Performance, Beginner-Friendly.
A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.
🔥 官方推荐 🔥 RuoYi-Vue 全新 Pro 版本,优化重构所有功能。基于 Spring Boot + MyBatis Plus + Vue & Element 实现的后台管理系统 + 微信小程序,支持 RBAC 动态权限、数据权限、SaaS 多租户、Flowable 工作流、三方登录、支付、短信、商城、CRM、ERP、AI 大模型、IoT 物联网等功能。你的 ⭐️ Star ⭐️,是…
SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.
FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU needed. The user can ask a question and the system will make a mu…
🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,支持ChatGPT多轮对话能力,还可能是首个支持脑机交互的开源智能音箱项目。
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
👩🏿💻👨🏾💻👩🏼💻👨🏽💻👩🏻💻中国独立开发者项目列表 -- 分享大家都在做什么
[AAAI 2025] Official implementation of "OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on"
Instant voice cloning by MIT and MyShell. Audio foundation model.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Realtime Diffusion, using Automatic1111 Stable Diffusion API
🔊 Text-Prompted Generative Audio Model
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speaker text-to-speech
PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
OSS AI Companion Chatbot - Build your own AI companion in Python using ChatGPT.
State-of-the-art 2D and 3D Face Analysis Project
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.