Focusing on multimodal synthesis (speech/audio/music), speech translation, and audio editing.
-
Zhejiang University
- https://liuhuadai.github.io
-
liuhuadai.github.io Public
Forked from RayeRen/acad-homepage.github.ioPersonal Homepage
SCSS MIT License UpdatedOct 20, 2025 -
FlashAudio Public
PyTorch Implementation of FlashAudio with Rectified Flow Models in Text-to-Audio Generation
-
OmniAudio Public
[ICML 2025] PyTorch Implementation of "OmniAudio: Generating Spatial Audio from 360-Degree Video"
-
Sphere360 Public
A 360-degree video dataset designed for 360-degree video-to-spatial audio generation.
4 UpdatedFeb 17, 2025 -
MEDIC Public
PyTorch Implementation of MEDIC: Zero-shot Music Editing with Disentangled Inversion Control
4 UpdatedOct 14, 2024 -
AudioLCM Public
Forked from Text-to-Audio/AudioLCMPyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.
-
-
Persona-Dialogue-Generation Public
a repository for persona multi-turn dialogue