Fix word skipping in inference #1197

Alykasym · 2025-10-15T10:35:24Z

A GPU inference bug caused words to be skipped at the start or end of sentences. The root cause was an ASR model that was always loaded to GPU (and left resident) even when a user supplied reference text, creating GPU memory pressure and interfering with subsequent model loads.

Fix: only load the ASR model when no reference text is provided, and explicitly unload it (torch.cuda.empty_cache() / gc.collect()) immediately after transcription so the main model can load to GPU cleanly - the skipping no longer occurs in tests.

SWivid · 2025-10-15T10:41:54Z

Hi @Alykasym

The root cause was an ASR model that was always loaded to GPU (and left resident) even when a user supplied reference text

ASR model will not be loaded if ref_text provided.

Consider using a GPU card with >=4gb memory if you need ASR and TTS loaded together, otherwise the current PR's hardcoded modification will enforce reloading ASR model every time thus large overhead when inference.

Alykasym · 2025-10-15T10:56:52Z

Hi @SWivid , Thank you for quick response. I've tested on 8GB, 24GB and 48GB VRAM GPU. For some reason I was constantly getting skipped words especially in small sentences. But after explicitly adding unload ASR model, it never skipped a word.

the current PR's hardcoded modification will enforce reloading ASR model every time thus large overhead when inference.

I understand that, but keeping the ASR in memory leads to skipping words. Though no idea why exactly it happens.

Alykasym · 2025-10-15T11:01:48Z

And it doesn't skip words when I move ASR to CPU. As long as ASR not in the same memory with the F5 model, it doesn't skip words.

SWivid · 2025-10-15T11:08:10Z

@Alykasym maybe it is better to provide us some reproducible buggy examples, i.e.:
the ref_text ref_audio gen_text and the used random_seed, while the output gen_audio is facing/not facing word skipping.

SWivid · 2025-10-22T01:08:41Z

👀

Fix word skipping in inference

8850e6d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix word skipping in inference #1197

Fix word skipping in inference #1197

Uh oh!

Alykasym commented Oct 15, 2025 •

edited

Loading

Uh oh!

SWivid commented Oct 15, 2025 •

edited

Loading

Uh oh!

Alykasym commented Oct 15, 2025

Uh oh!

Alykasym commented Oct 15, 2025

Uh oh!

SWivid commented Oct 15, 2025

Uh oh!

SWivid commented Oct 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Fix word skipping in inference #1197

Are you sure you want to change the base?

Fix word skipping in inference #1197

Uh oh!

Conversation

Alykasym commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SWivid commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Alykasym commented Oct 15, 2025

Uh oh!

Alykasym commented Oct 15, 2025

Uh oh!

SWivid commented Oct 15, 2025

Uh oh!

SWivid commented Oct 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Alykasym commented Oct 15, 2025 •

edited

Loading

SWivid commented Oct 15, 2025 •

edited

Loading