Stars
ComfyUI node for highly expressive speech and realistic zero-shot voice cloning
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…
A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for c…
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
Object Detection Metrics. 14 object detection metrics: mean Average Precision (mAP), Average Recall (AR), Spatio-Temporal Tube Average Precision (STT-AP). This project supports different bounding b…
Unlimited-length talking video generation that supports image-to-video and video-to-video generation
Wan: Open and Advanced Large-Scale Video Generative Models
A curated collection of fun and creative examples generated with Nano Banana & Nano Banana Pro🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the commu…
A model-driven approach to building AI agents in just a few lines of code.
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Amazon Bedrock Agentcore accelerates AI agents into production with the scale, reliability, and security, critical to real-world deployment.
The world's first open-source multimodal creative assistant This is a substitute for Canva and Manus that prioritizes privacy and is usable locally.
[CVPR 2025] Learning Flow Fields in Attention for Controllable Person Image Generation
HandFixer,一键手部修复工作流,ComfyUI, Hand reapair
a machine learning image inpainting task that instinctively removes watermarks from image indistinguishable from the ground truth image
Awesome curated collection of images and prompts generated by GPT-4o and gpt-image-1. Explore AI generated visuals created with ChatGPT and Sora, showcasing OpenAI’s advanced image generation capab…
Bring projects, wikis, and teams together with AI. AppFlowy is the AI collaborative workspace where you achieve more without losing control of your data. The leading open source Notion alternative.
A docker free offline version for HeyGem; Python and Linux is all you need!
Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"(ICCV2025)
「CNote」一份涵盖大部分学习 C 语言所需要掌握的核心知识,致力于打造最易懂的 C语言入门教程,让天下没有难学的 C语言。(包含C语言教程、C语言精华文章)
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, th…
The official Python SDK for Model Context Protocol servers and clients