Stars
A curated collection of fun and creative examples generated with Nano Banana & Nano Banana Pro🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the commu…
[ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer
[ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥
Official comfyui repository of Hellomeme
Convert your workflows into nodes and chain them together
A set of nodes for ComfyUI that can composite layer and mask to achieve Photoshop like functionality.
TheLocalLab / fluxgym-Colab
Forked from cocktailpeanut/fluxgymA Colab for the FluxGym Lora Training repository.
LLM Agent Framework in ComfyUI includes MCP sever, Omost,GPT-sovits, ChatTTS,GOT-OCR2.0, and FLUX prompt nodes,access to Feishu,discord,and adapts to all llms with similar openai / aisuite interfac…
Dead simple FLUX LoRA training UI with LOW VRAM support
The ultimate training toolkit for finetuning diffusion models
Video generation from text&image, 1st-gen
Rudimentary support for using multiple GPUs in a ComfyUI workflow
官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project
🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
text to speech using autoregressive transformer and VITS
vits2 backbone with multilingual-bert