Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View thanhkm's full-sized avatar

Block or report thanhkm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SoftVC VITS Singing Voice Conversion

Python 27,710 5,066 Updated Nov 11, 2023

[ACL 2025 Oral] Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models

Python 206 14 Updated Jun 25, 2025

Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch

Python 666 53 Updated Oct 1, 2024

Vector (and Scalar) Quantization, in Pytorch

Python 3,644 296 Updated Oct 20, 2025

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,598 279 Updated Jan 12, 2025

speech self-supervised representations

Python 511 39 Updated Apr 27, 2023

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,332 105 Updated Sep 24, 2023

Recurrent neural network for audio noise reduction

C 5,096 1,007 Updated Feb 22, 2025

Simple and Efficient Tensorflow implementations of NER models with tf.estimator and tf.data

Python 925 273 Updated Dec 18, 2018

Facebook AI Research's Automatic Speech Recognition Toolkit

C++ 6,442 1,003 Updated Oct 27, 2025

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

Python 315 119 Updated Jan 23, 2018

End-to-end ASR/LM implementation with PyTorch

Python 594 138 Updated Aug 30, 2021

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 12,307 1,940 Updated Oct 20, 2025

Pytorch Bindings for warp-ctc

Cuda 761 266 Updated Jul 2, 2023

Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.

1,869 226 Updated Jun 27, 2022

A PyTorch Implementation of End-to-End Models for Speech-to-Text

Python 763 177 Updated Jul 6, 2023

The RWTH extensible training framework for universal recurrent neural networks

Python 370 133 Updated Oct 27, 2025

This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.

Python 1,212 316 Updated Dec 19, 2020

Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow

Python 3,996 792 Updated Oct 8, 2021

Speech Recognition using DeepSpeech2.

Python 2,135 628 Updated Dec 13, 2022

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell 15,189 5,367 Updated Sep 22, 2025

Reference implementations of MLPerf® training benchmarks

Python 1,720 584 Updated Oct 22, 2025

Models and examples built with TensorFlow

Python 77,663 45,437 Updated Oct 27, 2025

End-to-End Speech Processing Toolkit

Python 9,537 2,337 Updated Oct 28, 2025