Lists (5)
Sort Name ascending (A-Z)
Stars
Official electron build of draw.io
Boost LaTeX typesetting efficiency with preview, compile, autocomplete, colorize, and more.
The fastest and highest-quality deep learning powered Sora2 watermark cleaner.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
全自动短视频搬运工具,支持自动下载、去重、AI生成标题+标签、上传,可二开扩展至多平台,例如:TikTok->视频号/抖音/小红书、抖音->TikTok/视频号/小红书......video-processing, automation, tiktok, selenium, pyqt5, ffmpeg, bot, data-scraping, video-deduplication.
一款强大的Python视频去重GUI工具,采用高帧率抽帧混合算法,以规避短视频平台查重。支持GPU加速。video-processing, automation, tiktok, selenium, pyqt5, ffmpeg, bot, data-scraping, video-deduplication.
主要用来收集/学习爬虫相关技术如:js逆向、app逆向、抓包、验证码、加密技术、自动化技术、机器学习。
OpenNeuro dataset - ArEEG: Arabic Inner Speech EEG dataset
Text-audio foundation model from Boson AI
Lets make video diffusion practical!
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
A Singing Style Conversion Framework Based On Audio Infilling
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
解决Cursor在免费订阅期间出现以下提示的问题: Your request has been blocked as our system has detected suspicious activity / You've reached your trial request limit. / Too many free trial accounts used on this machine.
Robust Speech Recognition via Large-Scale Weak Supervision
This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024
Notes about courses Dive into Deep Learning by Mu Li
feature extraction from speech signals