Stars
Turn detection for full-duplex dialogue communication
✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and budoux
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform
A novel human-interaction method for real-time speech extraction on headphones.
Official data preparation scripts for the URGENT 2024 Challenge
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
CoreNet: A library for training deep neural networks
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation.”
Make the sound you hear pure and clean by deep learning.
Python library for Room Impulse Response (RIR) simulation with GPU acceleration
A PyTorch implementation of DNN-based source separation.
Deep model with built-in self-attention alignment for acoustic echo cancellation, Pytorch implement
Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network
Official repo for "A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT" to appear in ICASSP 2021
Audio processing by using pytorch 1D convolution network
Tools for handling multimodal data in machine learning projects.
Roadmap to becoming an Artificial Intelligence Expert in 2022
FSA/FST algorithms, differentiable, with PyTorch compatibility.