lucadellalib

Luca Della Libera lucadellalib

PhD student at Concordia University and Mila, currently working on speech processing.

40 followers · 7 following

Concordia University
Montréal, Québec, Canada
09:23 (UTC -05:00)
https://www.linkedin.com/in/luca-della-libera

Achievements

x2 x2

Achievements

x2 x2

Stars

herimor / voxtream

VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency

Python 181 22 Updated Oct 26, 2025

IDRnD / redimnet

The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"

Python 185 16 Updated Sep 24, 2025

mtkresearch / TASTE-SpokenLM

A method that directly addresses the modality gap by aligning speech token with the corresponding text transcription during the tokenization stage.

Python 107 11 Updated Sep 3, 2025

llm-jp / llama-mimi

Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequences of interleaved semantic and acoustic tokens.

Python 28 2 Updated Sep 20, 2025

boris-kuz / snax

jax port of snac

Python 11 1 Updated May 12, 2024

dianwen-ng / MUFFIN

Multi-band Frequency Reconstruction for Neural Psychoacoustic Coding

Python 19 3 Updated May 5, 2025

slp-rl / PAST

Python 45 6 Updated Jul 7, 2025

zhenye234 / X-Codec-2.0

Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 346 49 Updated Jul 21, 2025

D-Keqi / mtla

MTLA: Multi-head Temporal Latent Attention

Python 760 35 Updated Oct 6, 2025

luotianze666 / WaveFM

[NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching

Python 120 11 Updated Mar 27, 2025

ga642381 / speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

1,202 74 Updated Aug 13, 2025

YangAi520 / APCodec

Python 35 1 Updated Sep 24, 2024

Aria-K-Alethia / BigCodec

Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"

Python 212 17 Updated Sep 19, 2024

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,289 760 Updated Jan 10, 2026

jishengpeng / WavTokenizer

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

Python 1,256 108 Updated Mar 2, 2025

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 20,536 1,702 Updated Nov 19, 2025

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,987 2,062 Updated Jan 22, 2026

kyutai-labs / moshi

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 9,364 847 Updated Jan 19, 2026

lucidrains / vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Python 3,833 313 Updated Jan 13, 2026

haiciyang / LaDiffCodec

ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.

Python 55 3 Updated Nov 16, 2025

bshall / knn-vc

Voice Conversion With Just Nearest Neighbors

Python 509 74 Updated Jan 16, 2026

hustvl / Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,773 271 Updated Feb 13, 2025

state-spaces / mamba

Mamba SSM architecture

Python 17,023 1,568 Updated Jan 12, 2026

ContinualAI / continual-learning-papers

Continual Learning papers list, curated by ContinualAI

HTML 687 58 Updated Apr 22, 2024

i404788 / s5-pytorch

Pytorch implementation of Simplified Structured State-Spaces for Sequence Modeling (S5)

Python 82 3 Updated Apr 26, 2024

ivy-llc / ivy

Convert Machine Learning Code Between Frameworks

Python 14,222 5,561 Updated Oct 17, 2025

espnet / espnet

End-to-End Speech Processing Toolkit

Python 9,699 2,374 Updated Jan 21, 2026

proroklab / popgym

Partially Observable Process Gym

Python 211 17 Updated Jun 12, 2025

thu-ml / tianshou

An elegant PyTorch deep reinforcement learning library.

Python 9,754 1,239 Updated Dec 1, 2025

Farama-Foundation / Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 11,165 1,252 Updated Jan 20, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Luca Della Libera lucadellalib

Achievements

Achievements

Block or report lucadellalib

Stars

herimor / voxtream

IDRnD / redimnet

mtkresearch / TASTE-SpokenLM

llm-jp / llama-mimi

boris-kuz / snax

dianwen-ng / MUFFIN

slp-rl / PAST

zhenye234 / X-Codec-2.0

D-Keqi / mtla

luotianze666 / WaveFM

ga642381 / speech-trident

YangAi520 / APCodec

Aria-K-Alethia / BigCodec

facebookresearch / xformers

jishengpeng / WavTokenizer

SYSTRAN / faster-whisper

SWivid / F5-TTS

kyutai-labs / moshi

lucidrains / vector-quantize-pytorch

haiciyang / LaDiffCodec

bshall / knn-vc

hustvl / Vim

state-spaces / mamba

ContinualAI / continual-learning-papers

i404788 / s5-pytorch

ivy-llc / ivy

espnet / espnet

proroklab / popgym

thu-ml / tianshou

Farama-Foundation / Gymnasium