Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View Mattias421's full-sized avatar
🐢
🐢

Highlights

  • Pro

Block or report Mattias421

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The package of IBM’s typeface, IBM Plex.

CSS 11,005 600 Updated Sep 29, 2025

DiFlow-TTS: Compact and Low-Latency Zero-Shot Text-to-Speech with Factorized Discrete Flow Matching

Python 57 8 Updated Sep 25, 2025

Awesome speech/audio LLMs, representation learning, and codec models

1,164 71 Updated Aug 13, 2025

DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fast voice synthesis.🐙

Python 44 4 Updated Nov 2, 2025

Evaluation software used in the Text Retrieval Conference

C 273 55 Updated Nov 1, 2024

Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.

Jupyter Notebook 39 8 Updated Mar 4, 2024

Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training

Python 41 4 Updated Dec 18, 2020

FSA/FST algorithms, differentiable, with PyTorch compatibility.

Cuda 1,274 230 Updated Oct 29, 2025

Implementation for the manuscript submission "Towards Unsupervised Speech Recognition Without Pronunciation Models""

Python 1 Updated Jan 2, 2025

PyTorch implementations of deep reinforcement learning algorithms and environments

Python 5,889 1,208 Updated Jul 25, 2024

This is the official code release for Bayesian Flow Networks.

Python 303 35 Updated Jul 18, 2024

asr2k

Python 52 3 Updated Jun 2, 2024

Reliability diagrams visualize whether a classifier model needs calibration

Jupyter Notebook 160 19 Updated Feb 11, 2022
Jupyter Notebook 185 22 Updated Jan 16, 2024

A playbook for systematically maximizing the performance of deep learning models.

29,332 2,400 Updated Jun 18, 2024

BERT score for text generation

Jupyter Notebook 1,831 234 Updated Jul 30, 2024

SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.

Python 829 91 Updated Oct 10, 2025
Python 20 4 Updated Jul 15, 2024

AI powered speech denoising and enhancement

Python 2,028 242 Updated Dec 3, 2024

(ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement

Python 70 3 Updated Jul 23, 2025

This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)

Python 224 26 Updated Jun 5, 2025

Efficient 3D molecular generation with flow-matching and Semla

Python 47 6 Updated Jul 22, 2025

Official implementation of All Atom Diffusion Transformers (ICML 2025)

Python 269 30 Updated Sep 4, 2025
Jupyter Notebook 36 3 Updated Feb 1, 2024
Jupyter Notebook 19 4 Updated Mar 14, 2023

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 3,633 246 Updated Sep 25, 2025

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

Python 674 99 Updated Aug 22, 2025
Next