A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 14,695 1,541 Updated Jan 27, 2026

toki-plus / video-mover

全自动短视频搬运工具，支持自动下载、去重、AI生成标题+标签、上传，可二开扩展至多平台，例如：TikTok->视频号/抖音/小红书、抖音->TikTok/视频号/小红书......video-processing, automation, tiktok, selenium, pyqt5, ffmpeg, bot, data-scraping, video-deduplication.

Python 319 57 Updated Jan 19, 2026

toki-plus / AB-Video-Deduplicator

一款强大的Python视频去重GUI工具，采用高帧率抽帧混合算法，以规避短视频平台查重。支持GPU加速。video-processing, automation, tiktok, selenium, pyqt5, ffmpeg, bot, data-scraping, video-deduplication.

Python 104 24 Updated Jan 19, 2026

luzhisheng / js_reverse

主要用来收集/学习爬虫相关技术如：js逆向、app逆向、抓包、验证码、加密技术、自动化技术、机器学习。

JavaScript 1,140 265 Updated Aug 15, 2025

OpenNeuroDatasets / ds005262

OpenNeuro dataset - ArEEG: Arabic Inner Speech EEG dataset

1 2 Updated Jan 22, 2025

boson-ai / higgs-audio

Text-audio foundation model from Boson AI

Python 7,880 601 Updated Jan 18, 2026

PallasBot / Pallas-Bot

《明日方舟》帕拉斯 Bot

Python 450 81 Updated Dec 23, 2025

lllyasviel / FramePack

Lets make video diffusion practical!

Python 16,567 1,632 Updated Oct 16, 2025

zy691 / Manchu-Recognition

1 Updated Jun 22, 2025

Lightricks / LTX-Video

Official repository for LTX-Video

Python 9,189 858 Updated Jan 5, 2026

canopyai / Orpheus-TTS

Towards Human-Sounding Speech

Python 5,910 508 Updated Dec 5, 2025

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,670 792 Updated May 27, 2025

lesterphillip / serenade

A Singing Style Conversion Framework Based On Audio Infilling

Python 33 5 Updated Apr 28, 2025

lb1169656535 / PolyVocalis

本项目主要功能是在一段多个不同说话人的音频里分别提取出不同说话人的音频

Python 2 Updated Apr 23, 2025

lukeewin / AudioSeparationGUI

这是一款基于FunASR实现的说话人分离的GUI程序

Python 157 26 Updated Dec 14, 2025

n8n-io / n8n

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

TypeScript 171,685 54,192 Updated Jan 28, 2026

SaeByeolMun / speech-classification-for-stroke-diagnosis

Jupyter Notebook 1 Updated Jul 17, 2024

SparkAudio / Spark-TTS

Spark-TTS Inference Code

Python 10,913 1,171 Updated Apr 9, 2025

alphacep / vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Jupyter Notebook 14,136 1,671 Updated Dec 8, 2025

yuaotian / go-cursor-help

解决Cursor在免费订阅期间出现以下提示的问题: Your request has been blocked as our system has detected suspicious activity / You've reached your trial request limit. / Too many free trial accounts used on this machine.

Shell 25,813 3,133 Updated Jan 27, 2026