Voice Quality issue using Librispeech


Hello, I'm found difference between the generated audios from the provided demo notebook using Librispeech and the audios available on the web page. The generated audios lack naturalness compared to the ones on the demo; they seem to lack pauses and exhibit an unnatural tone. 

I've tried adjusting various parameters, including diffusion steps ([15, 20, 25]), embedding scale ([1, 2]), and altering the audio speed (110%) using `duration = model.predictor.duration_proj(x) / speed`. Despite these adjustments, the issue persists. Any insights or suggestions on how to improve the generated audios to match the quality of those on the demo page would be greatly appreciated.   

Generated Audio: [https://drive.google.com/file/d/1xhYaB_1jzabYbkPwt6aP3KSNDkjdwT7X/view?usp=drive_link](url)

Thank you for your time and impressive work!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Voice Quality issue using Librispeech #73

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

Voice Quality issue using Librispeech #73

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions