Stars
This is the official implementation of the LiSenNet
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
Noise supression using deep filtering
A list of publicly available room impulse response datasets and scripts to download them.
An unofficial implementation of DeepVQE proposed by Microsoft Corp.
AI powered speech denoising and enhancement
The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"
Coarse implement of the paper "A Simultaneous Denoising and Dereverberation Framework with Target Decoupling", On DNS-2020 dataset, the DNSMOS of first stage is 3.42 and second stage is 3.47.
A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorch
Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech
A PyTorch Library for Multi-Task Learning
Real-time GCC-NMF Blind Speech Separation and Enhancement
C++ Implementation of PyTorch Tutorials for Everyone
🎤 Microphone sound source localization by SRP-PHAT and others numerical methods.(基于SRP-PHAT的麦克风声源定位)
speech enhancement\speech seperation\sound source localization
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Classical algorithms of sound source localization with beamforming, TDOA and high-resolution spectral estimation.