-
University of Rochester
- Rochester, NY
- https://scholar.google.com/citations?user=ng447e0AAAAJ&hl=en
Stars
PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
Codes for ICASSP 2024 paper: BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer. An online beat tracking system based on streaming Transformer
Python audio and music signal processing library
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Core Engine of Singing Voice Conversion & Singing Voice Clone
Code for reproducing the experiments and results of "Multi-Source Contrastive Learning from Musical Audio", accepted for publication in SMC2023
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Code for the Million Song Dataset, the dataset contains metadata and audio analysis for a million tracks, a collaboration between The Echo Nest and LabROSA. See website for details.
[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.
Real-time face swap for PC streaming or video calls
Generate Amazing Anime Pictures With BigGAN. Just Have Fun !!!
Official implementation of the SPL paper "One-class Learning Towards Synthetic Voice Spoofing Detection"
pepy is a site to get statistics information about any Python package.
Pytorch library for fast transformer implementations
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
56 language, 1 model Multilingual ASR
🤗 ParsBERT: Transformer-based Model for Persian Language Understanding
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.
Scooch Configures Object Oriented Class Hierarchies for python
Tutorial on Tempo, Beat and Downbeat estimation
Machine learning tools and framework for automatic music transcription.
Python library for audio and music analysis
Code for the paper "Learning Sparse Analytic Filters for Piano Transcription".
Frontend filterbank learning module with HVQT initialization capabilities.