Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View gyq517's full-sized avatar

Block or report gyq517

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Recipe for LibriPhrase

Python 31 4 Updated Sep 2, 2023

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 2,502 224 Updated Aug 12, 2025

The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"

Python 179 13 Updated Sep 24, 2025

Official Repository For VoxBlink2

Python 84 5 Updated Aug 13, 2024

Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN

Python 96 22 Updated Sep 15, 2021

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python 1,054 159 Updated Oct 13, 2025

Noise supression using deep filtering

Python 3,458 340 Updated Oct 17, 2024

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

Python 742 127 Updated Apr 11, 2024

Visual profiler for Python

Python 3,971 151 Updated Jul 15, 2022

转换网易云音乐 ncm 到 mp3 / flac. Convert Netease Cloud Music ncm files to mp3/flac files.

C++ 2,745 364 Updated Oct 5, 2025

Algorithm for blind estimation of reverberation time

Jupyter Notebook 31 4 Updated Jun 6, 2024
Python 116 23 Updated Apr 24, 2023

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

Python 667 99 Updated Aug 22, 2025

语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download

54 7 Updated Jul 24, 2022

Robust Speech Recognition via Large-Scale Weak Supervision

Python 89,986 11,257 Updated Sep 8, 2025

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…

Python 1,833 309 Updated Mar 14, 2023

Keyword spotting, Speech wake_up, by pytorch, DNN, CNN, TDNN, DFSMN, LSTM

Python 50 10 Updated Mar 15, 2022

A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.

Python 2,161 205 Updated Sep 26, 2025

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Python 4,791 1,315 Updated Aug 14, 2024

VS Code extension that allows you to preview and play audio files.

TypeScript 169 16 Updated Jun 20, 2025

A PyTorch-based Speech Toolkit

Python 10,616 1,571 Updated Oct 21, 2025

A small package to create visualizations of PyTorch execution graphs

Jupyter Notebook 3,443 289 Updated Dec 30, 2024

Automatic headphone equalization from frequency responses

Python 14,907 2,518 Updated Jul 20, 2025

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 44 4 Updated Mar 2, 2021

Dataset and baseline code for the VocalSound dataset (ICASSP2022).

Jupyter Notebook 153 12 Updated Nov 12, 2022

Pytorch implementation of subband decomposition

HTML 92 13 Updated Jul 26, 2022

Frontend filterbank learning module with HVQT initialization capabilities.

Python 21 3 Updated Feb 27, 2024

Main codebase for TeXworks, a simple interface for working with TeX documents

C++ 746 154 Updated Oct 20, 2025

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Python 1,304 438 Updated Jul 25, 2024
Next