Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View mcw519's full-sized avatar

Block or report mcw519

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SoTA open-source TTS

Python 14,215 1,876 Updated Sep 25, 2025

Turn detection for full-duplex dialogue communication

Python 436 27 Updated Oct 15, 2025

✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and budoux

Jupyter Notebook 65 8 Updated Sep 18, 2025
Cuda 7 Updated Mar 2, 2025

Target Speaker Extraction Toolkit

Python 207 27 Updated Oct 4, 2025

Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform

Python 463 67 Updated Nov 17, 2022

A novel human-interaction method for real-time speech extraction on headphones.

Python 585 65 Updated Jun 5, 2024

Official data preparation scripts for the URGENT 2024 Challenge

Python 84 7 Updated May 21, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 15,944 3,144 Updated Oct 24, 2025

LLM training in simple, raw C/CUDA

Cuda 27,951 3,248 Updated Jun 26, 2025

CoreNet: A library for training deep neural networks

Jupyter Notebook 7,025 548 Updated Oct 9, 2025

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 13,391 1,970 Updated Aug 8, 2024

Record live audio anywhere, anytime with Python

Python 1 Updated Sep 5, 2025

Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation.”

Python 70 7 Updated Apr 27, 2023

Make the sound you hear pure and clean by deep learning.

Python 8 Updated Aug 9, 2024

Python library for Room Impulse Response (RIR) simulation with GPU acceleration

Cuda 560 93 Updated Jul 18, 2025

A PyTorch implementation of DNN-based source separation.

Python 305 52 Updated Mar 29, 2022

Deep model with built-in self-attention alignment for acoustic echo cancellation, Pytorch implement

Python 39 12 Updated Jul 25, 2023

Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech

C++ 357 94 Updated Jan 22, 2023

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network

Python 139 34 Updated Mar 28, 2022

Official repo for "A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT" to appear in ICASSP 2021

Jupyter Notebook 40 4 Updated Oct 14, 2021

🎛 🔊 A Python library for audio.

C++ 5,817 308 Updated Oct 9, 2025

Audio processing by using pytorch 1D convolution network

Python 1,089 96 Updated May 16, 2025

A PyTorch-based Speech Toolkit

Python 10,607 1,570 Updated Oct 21, 2025

🙊 software for creating speech recognition models.

Python 159 32 Updated Jun 2, 2024

Tools for handling multimodal data in machine learning projects.

Python 1,074 255 Updated Oct 9, 2025

Roadmap to becoming an Artificial Intelligence Expert in 2022

JavaScript 30,428 2,548 Updated Sep 12, 2025

FSA/FST algorithms, differentiable, with PyTorch compatibility.

Cuda 1,272 230 Updated Aug 7, 2025