Stars
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Command line utility for forced alignment using Kaldi
Official code for "Towards An End-to-End Framework for Flow-Guided Video Inpainting" (CVPR2022)
Unofficial implementation of YOLO-World + EfficientSAM for ComfyUI
🖌️ ComfyUI implementation of ProPainter framework for video inpainting.
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Unleash Next-Level AI! 🚀 💻 Code Generation: DeepSeek r1 + Claude 3.7 Sonnet - Unparalleled Performance! 📝 Content Creation: DeepSeek r1 + Gemini 2.5 Pro - Superior Quality! 🔌 OpenAI-Compatible. 🌊 S…
GUI for a Vocal Remover that uses Deep Neural Networks.
a Fast Fourier Transform (FFT) library that tries to Keep it Simple, Stupid
A library implementing the EBU R128 loudness standard.
Jack clients to transport multichannel audio over a local network.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Robust Speech Recognition via Large-Scale Weak Supervision
MATLAB implementation of our llumination estimation technique from a single image (ICCV'09 and IJCV'12 papers)
Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Hackable and optimized Transformers building blocks, supporting a composable construction.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Stable diffusion for real-time music generation
Riffusion extension for AUTOMATIC1111's SD Web UI