XianruiWang

🎧

Focusing

王贤锐（Henry） XianruiWang

🎧

Focusing

May the force be with us

115 followers · 230 following

CIAIC, NWPU & WasedaU & LMS, FAU
Japan
xianruiwang.github.io

Achievements

awsome-audio-foundation-models Public
Forked from labhamlet/awsome-audio-foundation-models

This repository contains benchmarking code for recent audio foundation models on HEAR and Nat-HEAR datasets.

Jupyter Notebook MIT License Updated Dec 25, 2025
unified-source-separation Public
Forked from Jonathan-LeRoux/unified-source-separation

Official repo for task-aware unified source separation (TUSS)

Python GNU Affero General Public License v3.0 Updated Dec 20, 2025
NeMo Public
Forked from NVIDIA-NeMo/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python Apache License 2.0 Updated Dec 16, 2025
minimind Public
Forked from jingyaogong/minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python Apache License 2.0 Updated Dec 14, 2025
DeepASA Public
Forked from donghoney0416/DeepASA

Official page of "DeepASA: An Object-Oriented Multi-Purpose Network for Auditory Scene Analysis"

Python Updated Oct 18, 2025
SpatialCLAP Public
Forked from sarulab-speech/SpatialCLAP

Python Updated Sep 17, 2025
XianruiWang Public

Config files for my GitHub profile.

config github-config

Updated Sep 12, 2025
ArrayDPS Public
Forked from ArrayDPS/ArrayDPS

Python MIT License Updated May 12, 2025
audio_flow Public
Forked from qiuqiangkong/audio_flow

Python MIT License Updated Apr 28, 2025
Spatial-AST Public
Forked from zszheng147/Spatial-AST

🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)

Python Other Updated Feb 13, 2025
TIGER Public
Forked from JusperLee/TIGER

TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation

Python Updated Feb 1, 2025
NBSS Public
Forked from Audio-WestlakeU/NBSS

The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation

Python MIT License Updated Jan 1, 2025
buddy Public
Forked from sp-uhh/buddy

BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models

Python Updated Oct 18, 2024
XianruiWang.github.io Public

JavaScript MIT License Updated Oct 10, 2024
FastICA Public

Fast ICA algorithm for blind source separation

MATLAB 52 8 GNU General Public License v3.0 Updated Oct 6, 2024
AudioDec Public
Forked from facebookresearch/AudioDec

An Open-source Streaming High-fidelity Neural Audio Codec

Python 1 Other Updated Jun 15, 2024
neural-fcasa Public
Forked from b-sigpro/neural-fcasa

Python 1 MIT License Updated Jun 12, 2024
MCSSFDAF Public

Multichannel State Space Frequency-Domain Adaptive Filtering(MCSSFDAF)

Python 4 3 Updated May 25, 2024
pytorch_lightning_template_for_beginners Public
Forked from Audio-WestlakeU/pytorch_lightning_template_for_beginners

A pytorch template for beginners based on pytorch_lightning

Python Updated Feb 1, 2024
encodec Public
Forked from facebookresearch/encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python MIT License Updated Jan 4, 2024
Sixty-years-of-frequency-domain-monaural-speech-enhancement Public
Forked from cszheng-ioa/Sixty-years-of-frequency-domain-monaural-speech-enhancement

Python 1 1 Updated Dec 28, 2023
clarity Public
Forked from claritychallenge/clarity

Clarity Challenge toolkit - software for building Clarity Challenge systems

Python MIT License Updated Dec 25, 2023
Source-separtion-toolkit Public

Python 6 1 Updated Nov 13, 2023
CTF_MNMF Public

unofficial implementation of CTFMNMF

Python 5 2 Updated Nov 12, 2023
Interference-Rejection-using-Riemannian-Geometry-for-DoA-Estimation Public
Forked from amitaybar/Interference-Rejection-using-Riemannian-Geometry-for-DoA-Estimation

This is the code for the paper "On Interference-Rejection using Riemannian Geometry for Direction of Arrival Estimation", A. Bar and R. Talmon

MATLAB Updated Oct 1, 2023
Blind_Source_Separation_Dataset Public

A flexible dataset for blind source separation. You can change the non-stationarity of background noise, reverberation time of room according to your application flexiblely .

Python 2 1 Updated Sep 18, 2023
AV-Sepformer Public
Forked from lin9x/AV-Sepformer

Python Updated Jun 28, 2023
SpeechAlgorithms Public
Forked from Ryuk17/SpeechAlgorithms

Speech Algorithms

C 1 Apache License 2.0 Updated Feb 28, 2023
AuxIVA Public

Independent vector analysis with alixiary-function-method

MATLAB 26 7 GNU General Public License v3.0 Updated Dec 21, 2022
Beam-Guided-TasNet Public
Forked from hangtingchen/Beam-Guided-TasNet

Beam-guided TasNet

Python BSD 3-Clause "New" or "Revised" License Updated Sep 8, 2022

王贤锐（Henry） XianruiWang

Achievements

Achievements

awsome-audio-foundation-models Public

Uh oh!

unified-source-separation Public

Uh oh!

NeMo Public

Uh oh!

minimind Public

Uh oh!

DeepASA Public

Uh oh!

SpatialCLAP Public

Uh oh!

XianruiWang Public

Uh oh!

ArrayDPS Public

Uh oh!

audio_flow Public

Uh oh!

Spatial-AST Public

Uh oh!

TIGER Public

Uh oh!

NBSS Public

Uh oh!

buddy Public

Uh oh!

XianruiWang.github.io Public

Uh oh!

FastICA Public

Uh oh!

AudioDec Public

Uh oh!

neural-fcasa Public

Uh oh!

MCSSFDAF Public

Uh oh!

pytorch_lightning_template_for_beginners Public

Uh oh!

encodec Public

Uh oh!

Sixty-years-of-frequency-domain-monaural-speech-enhancement Public

Uh oh!

clarity Public

Uh oh!

Source-separtion-toolkit Public

Uh oh!

CTF_MNMF Public

Uh oh!

Interference-Rejection-using-Riemannian-Geometry-for-DoA-Estimation Public

Uh oh!

Blind_Source_Separation_Dataset Public

Uh oh!

AV-Sepformer Public

Uh oh!

SpeechAlgorithms Public

Uh oh!

AuxIVA Public

Uh oh!

Beam-Guided-TasNet Public

Uh oh!