Lists (3)
Sort Name ascending (A-Z)
Stars
Official implementation of the paper PitchFlower: A flow-based neural audio codec with pitch controllability
Meet CatBot - the c(h)atbot that showcased RAG evaluation using Ragas during PyData Amsterdam 2025.
Official code for "Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis"
A concise but complete full-attention transformer with a set of promising experimental features from various papers
Fast CUDA implementation of (differentiable) soft dynamic time warping for PyTorch
Noise removal/ reducer from the audio file in python. De-noising is done using Wavelets and thresholding is done by VISU Shrink thresholding technique
[CVPR 2025] Learning Flow Fields in Attention for Controllable Person Image Generation
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Official implementation of "Separate Anything You Describe"
NetLogo Syntax Highlighting for Visual Studio Code
An implementation of local windowed attention for language modeling
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
An open-source RAG-based tool for chatting with your documents.
Free Tailwind CSS v4 components for your next project, designed to enhance your web development with the latest features and styles 🚀
Inference and training library for high-quality TTS models.
Free and Open Source Alternative to Splitwise. Share expenses with your friends and family.
a MUSHRA compliant web audio API based experiment software
Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"
Convolutional layer for Kolmogorov-Arnold Network (KAN)