AIDman

Bruce AIDman

3 followers · 24 following

Microsoft
Seattle, WA
11:13 (UTC -07:00)

Stars

modelscope / 3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 2,503 224 Updated Aug 12, 2025

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 89,998 11,258 Updated Sep 8, 2025

whaleonearth / MLE-DS-Interview-Prep-Guide

This repo is meant to serve as a detailed guide for Machine Learning/AI interviews.

250 66 Updated Apr 8, 2025

QingyuLiu0521 / ICSD

ICSD Dataset

Python 36 2 Updated Jun 11, 2025

gordicaleksa / pytorch-original-transformer

My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT p…

Jupyter Notebook 1,063 184 Updated Dec 27, 2020

wenet-e2e / wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python 1,054 159 Updated Oct 13, 2025

phrazhola / YourChatGptReactApp

YourChatGPT is a versatile AI chatbot solution that harnesses the capabilities of the ChatGPT API. Crafted to simplify your journey, it enables you to create a tailored ChatGPT clone effortlessly.

JavaScript 18 5 Updated Aug 21, 2023

shansiliu95 / MLP_Scratch_Python

Implement MLP from Scratch using Python

Python 2 1 Updated Sep 27, 2022

SpeechColab / Leaderboard

SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.

Python 524 69 Updated Mar 29, 2025

bashbaha / speakergan

Unofficial implement with paper SpeakerGAN: Speaker identification with conditional generative adversarial network

Python 8 3 Updated Dec 9, 2021

AIDman / TFGAN-PLC

Forked from Guanyuansheng/TFGAN-PLC

A Temporal-Spectral Generative Adversarial Network based End-to-end Packet Loss Concealment for Wideband Speech Transmission

Python 1 Updated Apr 27, 2022

nikvaessen / w2v2-speaker-few-samples

Research code for the paper "Training speaker recognition systems with limited data" at https://arxiv.org/abs/2203.14688

Python 12 2 Updated Dec 2, 2024

sasv-challenge / SASVC2022_Baseline

Baseline for the Spoofing-aware Speaker Verification Challenge 2022

Python 65 22 Updated May 3, 2022

yl4579 / StarGANv2-VC

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion

Python 511 112 Updated Jan 13, 2025

csltstu / DeepLearning-500-questions

Forked from scutan90/DeepLearning-500-questions

TeX 4 1 Updated Nov 8, 2018

tensorflow / lingvo

Lingvo

Python 2,851 452 Updated Sep 26, 2025

yuyq96 / D-TDNN

PyTorch implementation of Densely Connected Time Delay Neural Network

Python 89 23 Updated May 4, 2023

deezer / spleeter

Deezer source separation library including pretrained models.

Python 27,482 3,035 Updated Apr 2, 2025

r39ashmi / e2e_dialect

End to end dialect classification

Python 3 3 Updated Mar 30, 2022

cvqluu / MTL-Speaker-Embeddings

Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" presented at Interspeech 2021

Python 25 6 Updated Oct 5, 2022