Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
Extract phoneme-level timestamps from speeh audio.
PyTorch CUDA based implementation of KMeans with dimensionality reduction
Custom firmware for the Anycubic Kobra series of 3D printers (Kobra 2 Pro, Kobra 3, Kobra 3 V2, Kobra S1 and Kobra 3 Max)
ESP32/arduino library for SIM800, SIM900 GSM module.
An extension of PHOIBLE that includes features for allophones.
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
Simple text to phones converter for multiple languages
Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)
A PyTorch-based Speech Toolkit
An open-source RAG-based tool for chatting with your documents.
Audio generation using diffusion models, in PyTorch.
AI model running on RPi for failure detection
Sleep library for Arduino (compatible with arduino-tiny core)
Captcha solver extension for humans, available for Chrome, Edge and Firefox
ESP32 oscilloscope - see the signals through Web browser the way ESP32 sees them
ATTiny usb bootloader with a strong emphasis on bootloader compactness.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Voice activity detection (VAD) paper and code(From 198*~ )and its classification.
Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
A Web UI for easy subtitle using whisper model.
Transcription, forced alignment, and audio indexing with OpenAI's Whisper
State-of-the-Art Text Embeddings
Repository that accompanies "An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction" (EMNLP 2019)