A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 15,944 3,144 Updated Oct 24, 2025

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 27,951 3,248 Updated Jun 26, 2025

apple / corenet

CoreNet: A library for training deep neural networks

Jupyter Notebook 7,025 548 Updated Oct 9, 2025

karpathy / micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 13,391 1,970 Updated Aug 8, 2024

mcw519 / LiveSound

Record live audio anywhere, anytime with Python

Python 1 Updated Sep 5, 2025

merlresearch / hyper-unmix

Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation.”

Python 70 7 Updated Apr 27, 2023

mcw519 / PureSound

Make the sound you hear pure and clean by deep learning.

Python 8 Updated Aug 9, 2024

DavidDiazGuerra / gpuRIR

Python library for Room Impulse Response (RIR) simulation with GPU acceleration

Cuda 560 93 Updated Jul 18, 2025

tky823 / DNN-based_source_separation

A PyTorch implementation of DNN-based source separation.

Python 305 52 Updated Mar 29, 2022

lhwcv / self_attention_alignment

Deep model with built-in self-attention alignment for acoustic echo cancellation, Pytorch implement

Python 39 12 Updated Jul 25, 2023

jzi040941 / PercepNet

Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech

C++ 357 94 Updated Jan 22, 2023

JusperLee / AFRCNN-For-Speech-Separation

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network

Python 139 34 Updated Mar 28, 2022

tvuong123 / ModulationDomainLoss

Official repo for "A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT" to appear in ICASSP 2021

Jupyter Notebook 40 4 Updated Oct 14, 2021

spotify / pedalboard

🎛 🔊 A Python library for audio.

C++ 5,817 308 Updated Oct 9, 2025

KinWaiCheuk / nnAudio

Audio processing by using pytorch 1D convolution network

Python 1,089 96 Updated May 16, 2025

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Python 10,607 1,570 Updated Oct 21, 2025

CoEDL / elpis

🙊 software for creating speech recognition models.

Python 159 32 Updated Jun 2, 2024

lhotse-speech / lhotse

Tools for handling multimodal data in machine learning projects.

Python 1,074 255 Updated Oct 9, 2025

AMAI-GmbH / AI-Expert-Roadmap

Roadmap to becoming an Artificial Intelligence Expert in 2022

JavaScript 30,428 2,548 Updated Sep 12, 2025

k2-fsa / k2

FSA/FST algorithms, differentiable, with PyTorch compatibility.

Cuda 1,272 230 Updated Aug 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Meng Wu mcw519

Achievements

Achievements

Block or report mcw519

Stars

resemble-ai / chatterbox

TEN-framework / ten-turn-detection

DoodleBears / split-lang

leo811121 / quant_utils

wenet-e2e / wesep

MasayaKawamura / MB-iSTFT-VITS

vb000 / LookOnceToHear

urgent-challenge / urgent2024_challenge

NVIDIA-NeMo / NeMo