Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View hcy71o's full-sized avatar

Block or report hcy71o

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official code for "Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis"

Python 83 4 Updated Oct 26, 2025

Long-form streaming TTS system for multi-speaker dialogue generation

Python 924 103 Updated Oct 26, 2025

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

Python 1,219 104 Updated Mar 2, 2025

MOSS-TTSD is a spoken dialogue generation model that enables expressive dialogue speech synthesis in both Chinese and English, supporting zero-shot multi-speaker voice cloning, and long-form speech…

Python 991 87 Updated Sep 28, 2025

[ICML 2025 Tokenization Workshop] HH-Codec: High Compression High-fidelity Discrete Neural Codec for Spoken Language Modeling

Python 56 4 Updated Sep 28, 2025

Expressive Speech Retrieval using Natural Language Descriptions of Speaking Style

Python 6 3 Updated Aug 18, 2025

Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching

Python 677 89 Updated Oct 27, 2025

FHEVM, a full-stack framework for integrating Fully Homomorphic Encryption (FHE) with blockchain applications

Rust 24,905 1,282 Updated Oct 27, 2025

Zama Bounty Program: Contribute to the FHE space and Zama's open source libraries and get rewarded 💰

15,568 428 Updated Jul 25, 2025

PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)

Jupyter Notebook 527 30 Updated Sep 8, 2025
Python 6,002 462 Updated Aug 29, 2025

Morpho Blue Protocol

Solidity 245 114 Updated Oct 15, 2025

collection of diffusion model papers categorized by their subareas

1,993 90 Updated Oct 27, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,692 1,616 Updated Jul 6, 2025

Easily configurable liquidation bot for Morpho Blue

TypeScript 77 39 Updated Oct 27, 2025

EraX Text to Speech base on F5-TTS Base V1

Python 79 24 Updated May 8, 2025
Python 14 5 Updated Aug 1, 2025

PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions

Python 86 6 Updated Oct 11, 2024

GLM-4-Voice | 端到端中英语音对话模型

Python 3,067 263 Updated Dec 5, 2024

Concrete: TFHE Compiler that converts python programs into FHE equivalent

C++ 1,489 195 Updated Oct 9, 2025

Distributed Training Over-The-Internet

963 46 Updated Oct 14, 2025

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Python 539 32 Updated Oct 27, 2025

StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion

186 13 Updated Sep 27, 2024

The official implementation of GTCRN, an ultra-lightweight SE model.

Python 465 78 Updated May 28, 2025

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

Python 432 68 Updated May 19, 2025

ACM MM 2024 FlashSpeech: Efficient Zero-Shot Speech Synthesis

Python 150 11 Updated Sep 20, 2024

Evaluation Protocol for Large-Scale Zero-Shot TTS Literature

Python 87 11 Updated Mar 12, 2025

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 3,083 215 Updated May 19, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,487 1,979 Updated Oct 27, 2025

Official repository of Wavehax vocoder

Python 55 3 Updated Jul 28, 2025
Next