Lists (10)
Sort Name ascending (A-Z)
Stars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
A feature-rich command-line audio/video downloader
🦜🔗 The platform for reliable agents.
Python tool for converting files and office documents to Markdown.
real time face swap and one-click video deepfake with only a single image
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Clone a voice in 5 seconds to generate arbitrary speech in real-time
The world's simplest facial recognition api for Python and the command line
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🎯 告别信息过载,你的 AI 舆情监控助手与热点筛选工具!聚合多平台热点 + RSS 订阅,支持关键词精准筛选。接入 MCP 架构,赋能 AI 自然语言对话分析、情感洞察与趋势预测。支持 Docker 一键部署,数据本地/云端自持。集成微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 等渠道智能推送。⭐
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
A generative speech model for daily dialogue.
The complete stack for AI Engineers: framework, runtime and control plane.
We write your reusable computer vision tools. 💜
Official Code for DragGAN (SIGGRAPH 2023)
Instant voice cloning by MIT and MyShell. Audio foundation model.
微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Easily train a good VC model with voice data <= 10 mins!
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
SoftVC VITS Singing Voice Conversion
Generative Models by Stability AI
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
Industry leading face manipulation platform