Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View JaeDukSeo's full-sized avatar
🙏
Praying
🙏
Praying

Block or report JaeDukSeo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Truly universal encoding detector in pure Python.

Python 713 61 Updated Oct 14, 2025

Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.

TypeScript 10,669 1,069 Updated Oct 7, 2025

Noise supression using deep filtering

Python 3,451 339 Updated Oct 17, 2024

Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (primarily from UVR)

Python 898 151 Updated Oct 15, 2025

A novel media player that allows you to navigate by speaker

Svelte 65 4 Updated Oct 22, 2025

Very fast, accurate speaker diarization

Python 158 14 Updated Oct 16, 2025

⚡ Accelerate speaker diarization with Senko, processing 1 hour of audio in just 5 seconds on powerful hardware—boost your audio analysis efficiency.

Python 1 Updated Oct 24, 2025

LLM story writer with a focus on high-quality long output based on a user provided prompt.

Python 184 52 Updated Aug 23, 2025

TTS + Voice Cloning

Python 167 28 Updated Aug 16, 2025

A ComfyUI custom node integration for multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, IndexTTS-2, Chatterbox (classic and multilingual 23-lang), F5-TTS, Higgs Audio …

Python 355 26 Updated Oct 24, 2025

Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), predefined voices, voice cloning, and large audiobook-scale…

Python 586 163 Updated Jul 14, 2025

Modified version of Chatterbox that accepts text files as input and no character restrictions. I use it to make audiobooks, especially for my kids.

Python 435 78 Updated Aug 23, 2025

VLLM Port of the Chatterbox TTS model

Python 321 38 Updated Oct 18, 2025

SoTA open-source TTS

Python 14,215 1,876 Updated Sep 25, 2025

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal is…

Python 4,981 447 Updated Oct 5, 2025

Video translation and dubbing tool powered by LLMs. The video translator offers 100 language translations and one-click full-process deployment. The video translation output is optimized for platfo…

Go 8,696 706 Updated Sep 16, 2025

智能视频多语言AI配音/翻译工具 - Linly-Dubbing — “AI赋能,语言无界”

Jupyter Notebook 2,745 295 Updated Mar 5, 2025

A flask built web app that leverages the power of OpenAI's whisper model to transcribe audio and video files. Has support for various file formats. Generates timestamped .srt files.

HTML 3 1 Updated Jun 19, 2025

A small wrapper package around whisper-timestamped. Create force-aligned transcription TextGrids from raw audio!

Python 18 2 Updated Mar 31, 2025

Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover and Transcription.

Python 62 8 Updated Oct 2, 2025

A robust audio transcription tool using OpenAI's Whisper API. Handles files of any length by automatically splitting them into chunks, with progress tracking and timestamped output.

Python 5 1 Updated Feb 28, 2025

Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer(RVC), zero-shot Voice Cloning (E2, F5-TTS), YouTub…

Python 11 3 Updated Jan 29, 2025

🎬 Clipify: Instantly transform long videos into engaging, social media-ready clips with cutting-edge AI technology.

Python 22 3 Updated Aug 25, 2024

[CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation

Python 307 35 Updated Jun 6, 2025

Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021…

Python 251 21 Updated May 9, 2022

An open-source RAG-based tool for chatting with your documents.

Python 24,567 2,019 Updated Jul 4, 2025

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

TypeScript 19,276 1,823 Updated Oct 16, 2025

Tired of boring PDFs? Want to inject some chaotic energy into your documents? PDF2BRAINROT is here to help! This script takes your standard PDF files and transforms them into dynamic, attention-gra…

Python 1 Updated Feb 25, 2025

Generate audiobooks from e-books

Python 5,615 377 Updated Mar 2, 2025

Audio Reactivity Nodes for ComfyUI 🔊 Create AI generated audio-driven animations. Compatible with IPAdapter, ControlNets, AnimateDiff...

Python 478 17 Updated Jun 2, 2025
Next