Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View AIDman's full-sized avatar
  • Microsoft
  • Seattle, WA
  • 11:13 (UTC -07:00)

Block or report AIDman

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 2,503 224 Updated Aug 12, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 89,998 11,258 Updated Sep 8, 2025

This repo is meant to serve as a detailed guide for Machine Learning/AI interviews.

250 66 Updated Apr 8, 2025

ICSD Dataset

Python 36 2 Updated Jun 11, 2025

My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT p…

Jupyter Notebook 1,063 184 Updated Dec 27, 2020

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python 1,054 159 Updated Oct 13, 2025

YourChatGPT is a versatile AI chatbot solution that harnesses the capabilities of the ChatGPT API. Crafted to simplify your journey, it enables you to create a tailored ChatGPT clone effortlessly.

JavaScript 18 5 Updated Aug 21, 2023

Implement MLP from Scratch using Python

Python 2 1 Updated Sep 27, 2022

SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.

Python 524 69 Updated Mar 29, 2025

Unofficial implement with paper SpeakerGAN: Speaker identification with conditional generative adversarial network

Python 8 3 Updated Dec 9, 2021

A Temporal-Spectral Generative Adversarial Network based End-to-end Packet Loss Concealment for Wideband Speech Transmission

Python 1 Updated Apr 27, 2022

Research code for the paper "Training speaker recognition systems with limited data" at https://arxiv.org/abs/2203.14688

Python 12 2 Updated Dec 2, 2024

Baseline for the Spoofing-aware Speaker Verification Challenge 2022

Python 65 22 Updated May 3, 2022

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion

Python 511 112 Updated Jan 13, 2025

深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为15个章节,近20万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系[email protected] 版权所有,违权必究 Tan 2018.06

TeX 4 1 Updated Nov 8, 2018

Lingvo

Python 2,851 452 Updated Sep 26, 2025

PyTorch implementation of Densely Connected Time Delay Neural Network

Python 89 23 Updated May 4, 2023

Deezer source separation library including pretrained models.

Python 27,482 3,035 Updated Apr 2, 2025

End to end dialect classification

Python 3 3 Updated Mar 30, 2022

Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" presented at Interspeech 2021

Python 25 6 Updated Oct 5, 2022

Deep Speaker: an End-to-End Neural Speaker Embedding System.

Python 935 241 Updated Apr 13, 2024

A library for high performance deep learning inference on NVIDIA GPUs.

C++ 557 65 Updated Jan 29, 2022

Learn and L3 embedding from audio/video pairs

Jupyter Notebook 88 20 Updated Apr 24, 2022

End-to-End Speech Processing Toolkit

Python 9,536 2,335 Updated Oct 27, 2025

Audio fingerprinting and recognition in Python

Python 6,666 1,466 Updated Apr 22, 2024

Python functions for reading kaldi data formats. Useful for rapid prototyping with python.

Python 377 119 Updated Jun 16, 2023

Python wrappers for Kaldi data

C++ 60 49 Updated Sep 20, 2017

Experimenting Speaker Verification and Recognition with Mistral A.K.A Alize

C++ 1 Updated Feb 3, 2014