Highlights
- Pro
-
moltbot Public
Forked from openclaw/openclawYour own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
TypeScript MIT License UpdatedJan 28, 2026 -
MSST-WebUI Public
Forked from SUC-DriverOld/MSST-WebUIA WebUI app for Music-Source-Separation-Training and we packed UVR together!
Python GNU Affero General Public License v3.0 UpdatedApr 2, 2025 -
Applio Public
Forked from IAHispano/ApplioA simple, high-quality voice conversion tool focused on ease of use and performance.
Python MIT License UpdatedMar 30, 2025 -
DeepSeek-VL2 Public
Forked from deepseek-ai/DeepSeek-VL2DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
Python MIT License UpdatedJan 29, 2025 -
-
TalkingGaussian Public
Forked from Fictionarry/TalkingGaussian[ECCV'24] TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting
Python UpdatedNov 26, 2024 -
F5-TTS Public
Forked from SWivid/F5-TTSOfficial code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Python MIT License UpdatedOct 16, 2024 -
SLAM-LLM Public
Forked from X-LANCE/SLAM-LLMSpeech, Language, Audio, Music Processing with Large Language Model
Python MIT License UpdatedOct 5, 2024 -
fish-speech Public
Forked from fishaudio/fish-speechBrand new TTS solution
Python Other UpdatedSep 16, 2024 -
3dgs-avatar-release Public
Forked from mikeqzy/3dgs-avatar-release3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting
Python MIT License UpdatedSep 10, 2024 -
webMUSHRA Public
Forked from audiolabs/webMUSHRAa MUSHRA compliant web audio API based experiment software
JavaScript Other UpdatedAug 9, 2024 -
VoiceCraft Public
Forked from jasonppy/VoiceCraftZero-Shot Speech Editing and Text-to-Speech in the Wild
Jupyter Notebook Other UpdatedApr 3, 2024 -
HierSpeechpp Public
Forked from sh-lee-prml/HierSpeechppThe official implementation of HierSpeech++
Python MIT License UpdatedFeb 20, 2024 -
vall-e Public
Forked from lifeiteng/vall-ePyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Python Apache License 2.0 UpdatedFeb 1, 2024 -
MiniGPT-4 Public
Forked from Vision-CAIR/MiniGPT-4Open-sourced codes for MiniGPT-4 and MiniGPT-v2
Python BSD 3-Clause "New" or "Revised" License UpdatedOct 13, 2023 -
sherpa Public
Forked from k2-fsa/sherpaSpeech-to-text server framework with next-gen Kaldi
C++ Apache License 2.0 UpdatedAug 16, 2023 -
bark Public
Forked from suno-ai/bark🔊 Text-Prompted Generative Audio Model
Jupyter Notebook MIT License UpdatedMay 4, 2023 -
audiolm-pytorch Public
Forked from lucidrains/audiolm-pytorchImplementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Python MIT License UpdatedMar 20, 2023 -
tuning_playbook Public
Forked from google-research/tuning_playbookA playbook for systematically maximizing the performance of deep learning models.
Other UpdatedFeb 2, 2023 -
CLAP Public
Forked from LAION-AI/CLAPContrastive Language-Audio Pretraining
Python Creative Commons Zero v1.0 Universal UpdatedJan 24, 2023 -
korean-romanizer Public
Forked from osori/korean-romanizerA Python library for Korean romanization
Python Other UpdatedJan 17, 2023 -
-
-
-
FACEGOOD-Audio2Face Public
Forked from FACEGOOD/FACEGOOD-Audio2Facehttp://www.facegood.cc
Python MIT License UpdatedJun 25, 2022 -
gpu-burn Public
Forked from wilicc/gpu-burnMulti-GPU CUDA stress test
C++ BSD 2-Clause "Simplified" License UpdatedJun 2, 2022 -
photometric_optimization Public
Forked from HavenFeng/photometric_optimizationPhotometric optimization code for creating the FLAME texture space and other applications
Python MIT License UpdatedMar 31, 2022 -
Speech-Backbones Public
Forked from huawei-noah/Speech-BackbonesThis is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Python UpdatedMar 6, 2022 -
wenet Public
Forked from wenet-e2e/wenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
C++ Apache License 2.0 UpdatedNov 24, 2021 -
YOLOX_AUDIO Public
Forked from intflow/YOLOX_AUDIOAudio event detection model based on YOLOX
Python Apache License 2.0 UpdatedNov 16, 2021