lucadellalib

Luca Della Libera lucadellalib

PhD student at Concordia University and Mila, currently working on speech processing.

35 followers · 7 following

Concordia University
Montréal, Québec, Canada
11:37 (UTC -04:00)
https://www.linkedin.com/in/luca-della-libera

Achievements

Stars

mtkresearch / TASTE-SpokenLM

A method that directly addresses the modality gap by aligning speech token with the corresponding text transcription during the tokenization stage.

Python 96 10 Updated Sep 3, 2025

llm-jp / llama-mimi

Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequences of interleaved semantic and acoustic tokens.

Python 25 1 Updated Sep 20, 2025

boris-kuz / snax

jax port of snac

Python 11 1 Updated May 12, 2024

dianwen-ng / MUFFIN

Multi-band Frequency Reconstruction for Neural Psychoacoustic Coding

Python 19 3 Updated May 5, 2025

slp-rl / PAST

Python 42 6 Updated Jul 7, 2025

zhenye234 / X-Codec-2.0

Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 321 46 Updated Jul 21, 2025

D-Keqi / mtla

MTLA: Multi-head Temporal Latent Attention

Python 758 35 Updated Oct 6, 2025

luotianze666 / WaveFM

[NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching

Python 111 9 Updated Mar 27, 2025

ga642381 / speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

1,161 71 Updated Aug 13, 2025

YangAi520 / APCodec

Python 33 1 Updated Sep 24, 2024

Aria-K-Alethia / BigCodec

Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"

Python 198 17 Updated Sep 19, 2024

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,027 727 Updated Oct 17, 2025

jishengpeng / WavTokenizer

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

Python 1,219 104 Updated Mar 2, 2025

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 18,731 1,552 Updated Oct 22, 2025

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,477 1,975 Updated Oct 24, 2025

kyutai-labs / moshi

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 9,028 816 Updated Oct 15, 2025

lucidrains / vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Python 3,638 295 Updated Oct 20, 2025

haiciyang / LaDiffCodec

ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.

Python 55 3 Updated Oct 17, 2025

bshall / knn-vc

Voice Conversion With Just Nearest Neighbors

Python 502 70 Updated Mar 18, 2024

hustvl / Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,632 254 Updated Feb 13, 2025

state-spaces / mamba

Mamba SSM architecture

Python 16,197 1,474 Updated Oct 10, 2025

ContinualAI / continual-learning-papers

Continual Learning papers list, curated by ContinualAI

HTML 666 56 Updated Apr 22, 2024

i404788 / s5-pytorch

Pytorch implementation of Simplified Structured State-Spaces for Sequence Modeling (S5)

Python 79 3 Updated Apr 26, 2024

ivy-llc / ivy

Convert Machine Learning Code Between Frameworks

Python 14,239 5,593 Updated Oct 17, 2025

espnet / espnet

End-to-End Speech Processing Toolkit

Python 9,533 2,336 Updated Oct 24, 2025

proroklab / popgym

Partially Observable Process Gym

Python 202 16 Updated Jun 12, 2025

thu-ml / tianshou

An elegant PyTorch deep reinforcement learning library.

Python 8,877 1,186 Updated Oct 25, 2025

Farama-Foundation / Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 10,466 1,164 Updated Oct 22, 2025

micromind-toolkit / micromind

A toolkit for tinyML research and deployment

Python 72 16 Updated Sep 18, 2024

state-spaces / s4

Structured state space sequence models

Jupyter Notebook 2,755 341 Updated Jul 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Luca Della Libera lucadellalib

Achievements

Achievements

Block or report lucadellalib

Stars

mtkresearch / TASTE-SpokenLM

llm-jp / llama-mimi

boris-kuz / snax

dianwen-ng / MUFFIN

slp-rl / PAST

zhenye234 / X-Codec-2.0

D-Keqi / mtla

luotianze666 / WaveFM

ga642381 / speech-trident

YangAi520 / APCodec

Aria-K-Alethia / BigCodec

facebookresearch / xformers

jishengpeng / WavTokenizer

SYSTRAN / faster-whisper

SWivid / F5-TTS

kyutai-labs / moshi

lucidrains / vector-quantize-pytorch

haiciyang / LaDiffCodec

bshall / knn-vc

hustvl / Vim

state-spaces / mamba

ContinualAI / continual-learning-papers

i404788 / s5-pytorch

ivy-llc / ivy

espnet / espnet

proroklab / popgym

thu-ml / tianshou

Farama-Foundation / Gymnasium

micromind-toolkit / micromind

state-spaces / s4