Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 12,307 1,940 Updated Oct 20, 2025

SeanNaren / warp-ctc

Pytorch Bindings for warp-ctc

Cuda 761 266 Updated Jul 2, 2023

syhw / wer_are_we

Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.

1,869 226 Updated Jun 27, 2022

awni / speech

A PyTorch Implementation of End-to-End Models for Speech-to-Text

Python 763 177 Updated Jul 6, 2023

rwth-i6 / returnn

The RWTH extensible training framework for universal recurrent neural networks

Python 370 133 Updated Oct 27, 2025

Alexander-H-Liu / End-to-end-ASR-Pytorch

This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.

Python 1,212 316 Updated Dec 19, 2020

buriburisuri / speech-to-text-wavenet

Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow

Python 3,996 792 Updated Oct 8, 2021

SeanNaren / deepspeech.pytorch

Speech Recognition using DeepSpeech2.

Python 2,135 628 Updated Dec 13, 2022

kaldi-asr / kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell 15,189 5,367 Updated Sep 22, 2025

mlcommons / training

Reference implementations of MLPerf® training benchmarks

Python 1,720 584 Updated Oct 22, 2025

tensorflow / models

Models and examples built with TensorFlow

Python 77,663 45,437 Updated Oct 27, 2025

espnet / espnet

End-to-End Speech Processing Toolkit

Python 9,537 2,337 Updated Oct 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Thanh KM thanhkm

Achievements

Achievements

Block or report thanhkm

Stars

svc-develop-team / so-vits-svc

jishengpeng / Languagecodec

lucidrains / voicebox-pytorch

lucidrains / vector-quantize-pytorch

lucidrains / audiolm-pytorch

auspicious3000 / contentvec

lucidrains / naturalspeech2-pytorch

xiph / rnnoise

guillaumegenthial / tf_ner

flashlight / wav2letter

hirofumi0810 / tensorflow_end2end_speech_recognition

hirofumi0810 / neural_sp

PaddlePaddle / PaddleSpeech