Stars
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Run Orpheus 3B Locally With LM Studio
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…
Speech To Speech: an effort for an open-sourced and modular GPT4-o
This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.
Windows Precision Touchpad Driver Implementation for Apple MacBook / Magic Trackpad
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats including EPUB books and PDF documents.
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
A machine learning-based video super resolution and frame interpolation framework. Est. Hack the Valley II, 2018.
Stable Diffusion web UI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
daswer123 / hallo-webui
Forked from fudan-generative-vision/halloWebui for Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Algorithms and Data Structures implemented in Go for beginners, following best practices.
This repository contains the solutions and explanations to the algorithm problems on LeetCode. Only medium or above are included. All are written in C++/Python and implemented by myself. The proble…
Flowistry is an IDE plugin for Rust that helps you focus on relevant code.