Stars
Open-source scientific and technical publishing system built on Pandoc.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Use android as mic/speaker for ubuntu
Reference-aware automatic speech evaluation toolkit
[ICASSP 2025] Official PyTorch code for training and inference pipeline for DepMamba: Progressive Fusion Mamba for Multimodal Depression Detection
50k English-Japanese Parallel Corpus for Machine Translation Benchmark.
Manipulate audio with a simple and easy high level interface
PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio
PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)
Collection of audio-focused loss functions in PyTorch
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
NeuroKit2: The Python Toolbox for Neurophysiological Signal Processing
Everything you need to know to build your own RAG application
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
A python-port of julius-speech/segmentation-kit
Variational Recurrent Autoencoder for timeseries clustering in pytorch
PyTorch Implementation of Variational Recurrent Autoencoder
NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis
.ipynb rendering plugin for GitBucket
JupyterLab extension for live editing of LaTeX documents
JupyterHub service to cull idle servers and users
Unofficial implementation of "Speaker recognition from raw waveform with sincnet" paper.
JupyterLab desktop application, based on Electron.