gpt
✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows
Stable Diffusion web UI
fay是一个帮助数字人(2.5d、3d、移动、pc、网页)或大语言模型(openai兼容、deepseek)连通业务系统的agent框架。
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
make your Speaker talking as Native style with own voice!
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Generative Models by Stability AI
A latent text-to-image diffusion model
Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
Code for fintune ChatGLM-6b using low-rank adaptation (LoRA)
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
vits2 backbone with multilingual-bert
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
A simple VITS HTTP API, developed by extending Moegoe with additional features.
[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)