-
awsome-audio-foundation-models Public
Forked from labhamlet/awsome-audio-foundation-modelsThis repository contains benchmarking code for recent audio foundation models on HEAR and Nat-HEAR datasets.
Jupyter Notebook MIT License UpdatedDec 25, 2025 -
unified-source-separation Public
Forked from Jonathan-LeRoux/unified-source-separationOfficial repo for task-aware unified source separation (TUSS)
Python GNU Affero General Public License v3.0 UpdatedDec 20, 2025 -
NeMo Public
Forked from NVIDIA-NeMo/NeMoA scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Python Apache License 2.0 UpdatedDec 16, 2025 -
minimind Public
Forked from jingyaogong/minimind🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
Python Apache License 2.0 UpdatedDec 14, 2025 -
DeepASA Public
Forked from donghoney0416/DeepASAOfficial page of "DeepASA: An Object-Oriented Multi-Purpose Network for Auditory Scene Analysis"
Python UpdatedOct 18, 2025 -
-
-
-
-
Spatial-AST Public
Forked from zszheng147/Spatial-AST🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)
Python Other UpdatedFeb 13, 2025 -
TIGER Public
Forked from JusperLee/TIGERTIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation
Python UpdatedFeb 1, 2025 -
NBSS Public
Forked from Audio-WestlakeU/NBSSThe official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation
Python MIT License UpdatedJan 1, 2025 -
buddy Public
Forked from sp-uhh/buddyBUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models
Python UpdatedOct 18, 2024 -
-
FastICA Public
Fast ICA algorithm for blind source separation
-
AudioDec Public
Forked from facebookresearch/AudioDecAn Open-source Streaming High-fidelity Neural Audio Codec
-
-
MCSSFDAF Public
Multichannel State Space Frequency-Domain Adaptive Filtering(MCSSFDAF)
-
pytorch_lightning_template_for_beginners Public
Forked from Audio-WestlakeU/pytorch_lightning_template_for_beginnersA pytorch template for beginners based on pytorch_lightning
Python UpdatedFeb 1, 2024 -
encodec Public
Forked from facebookresearch/encodecState-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
Python MIT License UpdatedJan 4, 2024 -
Sixty-years-of-frequency-domain-monaural-speech-enhancement Public
Forked from cszheng-ioa/Sixty-years-of-frequency-domain-monaural-speech-enhancement -
clarity Public
Forked from claritychallenge/clarityClarity Challenge toolkit - software for building Clarity Challenge systems
Python MIT License UpdatedDec 25, 2023 -
-
-
Interference-Rejection-using-Riemannian-Geometry-for-DoA-Estimation Public
Forked from amitaybar/Interference-Rejection-using-Riemannian-Geometry-for-DoA-EstimationThis is the code for the paper "On Interference-Rejection using Riemannian Geometry for Direction of Arrival Estimation", A. Bar and R. Talmon
MATLAB UpdatedOct 1, 2023 -
A flexible dataset for blind source separation. You can change the non-stationarity of background noise, reverberation time of room according to your application flexiblely .
-
-
SpeechAlgorithms Public
Forked from Ryuk17/SpeechAlgorithmsSpeech Algorithms
-
AuxIVA Public
Independent vector analysis with alixiary-function-method
-
Beam-Guided-TasNet Public
Forked from hangtingchen/Beam-Guided-TasNetBeam-guided TasNet
Python BSD 3-Clause "New" or "Revised" License UpdatedSep 8, 2022