Top 23 Python Audio Projects

ultimatevocalremovergui

1 83 22,421 3.7 Python

GUI for a Vocal Remover that uses Deep Neural Networks.

Project mention: Ultimate Vocal Remover GUI, a FOSS audio stem splitter | news.ycombinator.com | 2025-05-09
InfluxDB

www.influxdata.com featured

InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
beets

2 198 14,260 9.9 Python

music library manager and MusicBrainz tagger

Project mention: Beets: The music geek's media organizer | news.ycombinator.com | 2025-11-09
speechbrain

3 28 10,758 9.2 Python

A PyTorch-based Speech Toolkit

Project mention: 5 must know open-source repositories to build cool AI apps | dev.to | 2025-10-29

Star the Speech Brain repository ⭐
AudioGPT

4 4 10,200 0.0 Python

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
pydub

5 26 9,637 0.0 Python

Manipulate audio with a simple and easy high level interface

Project mention: Reading Neuromancer for the first time in 2025 | news.ycombinator.com | 2025-07-13

https://github.com/TaylorSMarks/playsound/issues/101
(A workaround exists: downgrading to version 1.2.2, but that comes with its own issues.)
The last time I experimented with audio in Python, I was surprised by how lacking its multimedia libraries are.
For example, when I needed to read audio files as data, I tried `SoundFile`, `librosa` (a wrapper around `SoundFile` or `audioread`), and `pydub`, and none of them was particularly satisfying or has seen much active development lately.
If you need to read various formats, pydub is probably your best bet (it does this by invoking ffmpeg under the hood). I was hoping for a more "native" solution, but oh well. Unfortunately, `pydub` is also unmaintained and has some serious performance issues (for example: https://github.com/jiaaro/pydub/issues/518 )
SpeechRecognition

6 16 8,898 8.6 Python

Speech recognition module for Python, supporting several engines and APIs, online and offline.
jukebox

7 129 8,014 0.0 Python

Code for the paper "Jukebox: A Generative Model for Music"
Stream

getstream.io featured

Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
librosa

8 17 7,998 5.0 Python

Python library for audio and music analysis
ffsubsync

9 33 7,413 5.4 Python

Automagically synchronize subtitles with video.

Project mention: Can't pay, won't pay: streaming services are driving viewers back to piracy | news.ycombinator.com | 2025-08-14

I developed a tool (https://github.com/smacke/ffsubsync) which can sync subtitles against each other, and this can be used in conjunction with other tools such as https://pypi.org/project/srt/ to combine multiple subtitle streams into a single stream. I've used this strategy to good effect to get both English and Chinese subtitles up at once.
dejavu

10 15 6,661 0.0 Python

Audio fingerprinting and recognition in Python (by worldveil)
pyAudioAnalysis

11 11 6,166 1.7 Python

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
Porcupine

12 32 4,487 8.6 Python

On-device wake word detection powered by deep learning

Project mention: Show HN: Shoggoth Mini – A weird tentacle robot powered by GPT-4o and RL | news.ycombinator.com | 2025-07-15

> also, "GPT-4o continuously listens to speech through the audio stream," is going to be problematic
Seems like openWakeWord or porcupine could be able to solve by adding a layer for wake word detection before sending the prompt off.
I wonder if latency would be any better with a local model cached in a 16GB or 24GB graphics card. It would have to be a quantized/distilled model, but maybe performance would still be acceptable.
https://github.com/dscripka/openWakeWord
https://github.com/Picovoice/porcupine
basic-pitch

13 8 4,419 6.8 Python

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
picard

14 245 4,356 9.9 Python

A cross-platform music tagger powered by the MusicBrainz database. Picard organizes your music collection by updating your tags, renaming your files, and sorting them into a folder structure, exactly the way you want it.

Project mention: Beets: The music geek's media organizer | news.ycombinator.com | 2025-11-09

If you have the files downloaded, picard is also useful - https://picard.musicbrainz.org/
distil-whisper

15 10 3,979 7.6 Python

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
riffusion-hobby

16 14 3,807 2.9 Python

Stable diffusion for real-time music generation

Project mention: Benn Jordan's AI poison pill and the weird world of adversarial noise | news.ycombinator.com | 2025-04-15

https://github.com/riffusion/riffusion-hobby
The more advanced music generators out now I believe have more of a 'stems' approach and a larger processing pipeline to increase fidelity and add tracking vocal capability but the underlying idea is the same.
Any adversarial attack to hide information in the spectrograph to fool the model into categorizing the track as something it is not isn't different than the image adversarial attacks which have been found to have ways to be mitigated.
Various forms of filtering for inaudible spectral information coupled with methods that destroy and re-synthesize/randomize phase information would likely break this poisoning attack.
DeepFilterNet

17 13 3,407 7.3 Python

Noise supression using deep filtering

Project mention: Show HN: Background noise removal in multimedia with a single command | news.ycombinator.com | 2025-10-06
aeneas

18 4 2,742 0.0 Python

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
mkchromecast

19 18 2,303 8.7 Python

Cast macOS and Linux Audio/Video to your Google Cast and Sonos Devices
matchering

20 4 2,297 6.1 Python

🎚️ Open Source Audio Matching and Mastering
Tauon

21 9 2,287 9.8 Python

The music player of today! :city_sunrise:
m3u8

22 2 2,216 7.7 Python

Python m3u8 Parser for HTTP Live Streaming (HLS) Transmissions
vocal-remover

23 6 1,717 6.1 Python

Vocal Remover using Deep Neural Networks
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Audio discussion

Python Audio related posts

Ask HN: What Are You Working On? (Nov 2025

177 projects | news.ycombinator.com | 9 Nov 2025
Beets: The music geek's media organizer

8 projects | news.ycombinator.com | 9 Nov 2025
5 must know open-source repositories to build cool AI apps

6 projects | dev.to | 29 Oct 2025
Creating a realtime voice agent using OpenAI's new gpt-realtime speech-to-speech model

3 projects | dev.to | 6 Oct 2025
Show HN: Background noise removal in multimedia with a single command

2 projects | news.ycombinator.com | 6 Oct 2025
Why I Ditched Spotify, and How I Set Up My Own Music Stack

12 projects | news.ycombinator.com | 4 Sep 2025
Can't pay, won't pay: streaming services are driving viewers back to piracy

1 project | news.ycombinator.com | 14 Aug 2025
A note from our sponsor - InfluxDB
www.influxdata.com | 15 Nov 2025

InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →

Index

What are some of the best open-source Audio projects in Python? This list will help you:

#	Project	Stars
1	ultimatevocalremovergui	22,421
2	beets	14,260
3	speechbrain	10,758
4	AudioGPT	10,200
5	pydub	9,637
6	SpeechRecognition	8,898
7	jukebox	8,014
8	librosa	7,998
9	ffsubsync	7,413
10	dejavu	6,661
11	pyAudioAnalysis	6,166
12	Porcupine	4,487
13	basic-pitch	4,419
14	picard	4,356
15	distil-whisper	3,979
16	riffusion-hobby	3,807
17	DeepFilterNet	3,407
18	aeneas	2,742
19	mkchromecast	2,303
20	matchering	2,297
21	Tauon	2,287
22	m3u8	2,216
23	vocal-remover	1,717

Python Audio

Top 23 Python Audio Projects

Python Audio discussion

Python Audio related posts

Ask HN: What Are You Working On? (Nov 2025

Beets: The music geek's media organizer

5 must know open-source repositories to build cool AI apps

Creating a realtime voice agent using OpenAI's new gpt-realtime speech-to-speech model

Show HN: Background noise removal in multimedia with a single command

Why I Ditched Spotify, and How I Set Up My Own Music Stack

Can't pay, won't pay: streaming services are driving viewers back to piracy

Index

Did you know that Python is the 2nd most popular programming language based on number of references?

Did you know that Python is
the 2nd most popular programming language
based on number of references?