Stars
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
kaldi-asr/kaldi is the official location of the Kaldi project.
Multi-talker ASR based on DiCoW with Serialized Output Training
Create VST plugins with JUCE that run machine learning models.
Based off the Juce tutorial: https://docs.juce.com/master/tutorial_playing_sound_files.html
Audio File Player Plugin Tool for Juce AudioPlugin Host (Audio Software Development)
openFrameworks addon for audio synthesis and generative music
Basic oscillator using the JUCE Oscillator class
Simple chat program that communicates using inaudible sounds
Functional programming language for signal processing and sound synthesis
Browser-based visual programming language and platform for sound synthesis.