carlthome

🎼

Carl Thomé carlthome

🎼

Music ML, audio data, self-supervised learning, differentiable programming

413 followers · 517 following

Achievements

x4 x3 x2

Achievements

x4 x3 x2

Organizations

Starred repositories

v-iashin / Synchformer

Source code for "Synchformer: Efficient Synchronization from Sparse Cues" (ICASSP 2024)

Python 92 9 Updated Sep 15, 2025

v-iashin / SparseSync

Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)

Python 53 10 Updated Jan 29, 2024

SonyCSLParis / codicodec

Encode and decode audio samples to/from continuous and discrete compressed representations!

Python 71 3 Updated Oct 24, 2025

SonyCSLParis / music2latent

Encode and decode audio samples to/from compressed latent representations!

Python 237 22 Updated Sep 19, 2025

csteinmetz1 / pyloudnorm

Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm

Python 725 57 Updated Jul 2, 2024

mikechambers / adb-mcp

JavaScript 394 53 Updated Oct 4, 2025

patchbanks / Lo-Fi-Drums-Dataset

Lo-Fi Drums Dataset is an open audio dataset containing 10,000 drum loops.

9 1 Updated May 23, 2025

SonyCSLParis / pesto

Self-supervised learning for real-time pitch estimation

Python 259 22 Updated Oct 15, 2025

MattiasMTS / dotfiles

Lua 4 Updated Oct 24, 2025

ace-step / ACE-Step

ACE-Step: A Step Towards Music Generation Foundation Model

Python 3,164 363 Updated Jun 27, 2025

open-audio-stack / open-audio-stack-registry

Audio registry with searchable list of packages containing Plugins, Presets and Projects.

TypeScript 35 4 Updated Sep 9, 2025

multimodal-art-projection / YuE

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python 5,619 649 Updated Jun 4, 2025

google / dataflow-ml-starter

Python 24 8 Updated Sep 15, 2025

chaosprint / glicol

Graph-oriented live coding language and music/audio DSP library written in Rust

Rust 2,830 91 Updated Apr 6, 2025

nix-community / nix-vscode-extensions

Nix expressions for VS Code Marketplace and Open VSX extensions

Haskell 335 27 Updated Oct 24, 2025

facebookresearch / audiobox-aesthetics

Unified automatic quality assessment for speech, music, and sound.

Python 613 42 Updated Jun 5, 2025

NilsDem / control-transfer-diffusion

Repository for the paper "Combining audio control and style transfer using latent diffusion", accepted at ISMIR 2024

Jupyter Notebook 55 7 Updated Feb 19, 2025

coreutils / coreutils

upstream mirror

C 4,814 994 Updated Oct 23, 2025

mlcommons / croissant

Croissant is a high-level format for machine learning datasets that brings together four rich layers.

Jupyter Notebook 744 90 Updated Oct 7, 2025

sony / hFT-Transformer

Pytorch implementation of automatic music transcription method that uses a two-level hierarchical frequency-time Transformer architecture (hFT-Transformer).

Python 107 10 Updated Jul 11, 2023

rhysd / actionlint

Static checker for GitHub Actions workflow files

Go 3,386 195 Updated Oct 23, 2025

unisonweb / unison

A friendly programming language from the future

Haskell 6,226 284 Updated Oct 24, 2025

meta-pytorch / torchcodec

PyTorch media decoding and encoding

Python 768 66 Updated Oct 24, 2025

curtified / FluxMusicGUI

Forked from camenduru/FluxMusic

Text-to-Music Generation with Rectified Flow Transformer

Python 64 3 Updated May 26, 2025

Carl Thomé carlthome

Organizations

Starred repositories

Machine learning