Stars
Official repo of INTERSPEECH 2024 paper Genhancer: High-Fidelity Speech Enhancement via Generative Modeling on Discrete Codec Tokens. This repo provides additional audio samples.
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
Vector (and Scalar) Quantization, in Pytorch
A PyTorch implementation of "Continuous Relaxation Training of Discrete Latent Variable Image Models"
Official repo of ICASSP 2023 paper Neural Feature Predictor and Discriminative Residual Coding for Low-bitrate speech coding
ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.
Implementation of vocoders empowered with pytorch lightning
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Symbolic Music Generation with Diffusion Models
Pytorch Implementation of OpenAI's "Improved Variational Inference with Inverse Autoregressive Flow"
Wavenet Autoencoder for Unsupervised speech representation learning (after Chorowski, Jan 2019)
A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"
Code for reproducing key results in the paper "Improving Variational Inference with Inverse Autoregressive Flow"
Official repo of ICASSP 2021 paper Source-Aware Neural Speech Coding for Noisy Speech Compression (SANAC)
Datasets, Transforms and Models specific to Computer Vision
Simple max match segmentation for Chinese