Codestin Search App

pyf98 · 2025-01-06T20:45:37Z

What?

Implemented a unified batch decode interface for OWSM-CTC greedy search. The decode_batch method can decode a batch of audios which can be either short-form or long-form. Each audio can be provided as a path, a numpy 1-D array or a torch 1-D tensor. This makes the usage more flexible.
Enabled flash attention for inference. With mixed precision and flash attention, we can decode ~200 samples at the same time on a GPU with 96GB memory.
Added a stand-alone utility script to average model checkpoints.

for more information, see https://pre-commit.ci

pyf98 · 2025-01-11T20:33:04Z

This PR is ready.

sw005320 · 2025-01-12T14:15:33Z

espnet/nets/pytorch_backend/transformer/attention.py

why do you need this change?

The previous code uses flash attention only during training. But we can also use it for inference.

sw005320 · 2025-01-12T14:19:36Z

espnet2/bin/s2t_inference_ctc.py

Can you add a test for this batch decoding?

codecov · 2025-01-12T23:28:09Z

Codecov Report

Attention: Patch coverage is 0.91743% with 108 lines in your changes missing coverage. Please review.

Project coverage is 14.52%. Comparing base (522891b) to head (4474435).
Report is 95 commits behind head on master.

Files with missing lines	Patch %	Lines
espnet2/bin/s2t_inference_ctc.py	0.00%	106 Missing ⚠️
...pnet/nets/pytorch_backend/transformer/attention.py	33.33%	2 Missing ⚠️

❗ There is a different number of reports uploaded between BASE (522891b) and HEAD (4474435). Click for more details.

HEAD has 8 uploads less than BASE

Flag BASE (522891b) HEAD (4474435)

test_integration_espnet2 8 0

Additional details and impacted files

@@             Coverage Diff             @@
##           master    #6007       +/-   ##
===========================================
- Coverage   47.49%   14.52%   -32.97%     
===========================================
  Files         529      854      +325     
  Lines       47850    80268    +32418     
===========================================
- Hits        22727    11660    -11067     
- Misses      25123    68608    +43485

Flag	Coverage Δ
test_integration_espnet2	`?`
test_python_espnetez	`12.72% <0.00%> (?)`
test_utils	`20.64% <33.33%> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

sw005320 · 2025-01-13T09:23:19Z

Thanks!

Implement unified batch decode interface for OWSM-CTC

add file

6043163

mergify bot added the ESPnet2 label Jan 6, 2025

pre-commit-ci bot and others added 2 commits January 6, 2025 20:46

[pre-commit.ci] auto fixes from pre-commit.com hooks

813c6a0

for more information, see https://pre-commit.ci

implemente batch decode for owsm-ctc

95dcc72

mergify bot added ESPnet1 README labels Jan 8, 2025

pyf98 changed the title ~~Add a utility method to average checkpoints~~ Implement unified batch decode interface for OWSM-CTC Jan 8, 2025

[pre-commit.ci] auto fixes from pre-commit.com hooks

97a1178

for more information, see https://pre-commit.ci

sw005320 added this to the v.202503 milestone Jan 12, 2025

sw005320 reviewed Jan 12, 2025

View reviewed changes

add test for batch decode

4474435

sw005320 merged commit b927b00 into espnet:master Jan 13, 2025
38 of 39 checks passed

pyf98 deleted the owsmctc-test branch January 13, 2025 22:53

Shikhar-S pushed a commit to Shikhar-S/espnet that referenced this pull request Mar 13, 2025

Merge pull request espnet#6007 from pyf98/owsmctc-test

11b4f9d

Implement unified batch decode interface for OWSM-CTC

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement unified batch decode interface for OWSM-CTC#6007

Implement unified batch decode interface for OWSM-CTC#6007
sw005320 merged 5 commits intoespnet:masterfrom
pyf98:owsmctc-test

pyf98 commented Jan 6, 2025 •

edited

Loading

Uh oh!

pyf98 commented Jan 11, 2025

Uh oh!

sw005320 Jan 12, 2025

Uh oh!

pyf98 Jan 12, 2025

Uh oh!

sw005320 Jan 12, 2025

Uh oh!

pyf98 Jan 12, 2025

Uh oh!

codecov bot commented Jan 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

sw005320 commented Jan 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

pyf98 commented Jan 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What?

Uh oh!

pyf98 commented Jan 11, 2025

Uh oh!

sw005320 Jan 12, 2025

Choose a reason for hiding this comment

Uh oh!

pyf98 Jan 12, 2025

Choose a reason for hiding this comment

Uh oh!

sw005320 Jan 12, 2025

Choose a reason for hiding this comment

Uh oh!

pyf98 Jan 12, 2025

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Jan 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

sw005320 commented Jan 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pyf98 commented Jan 6, 2025 •

edited

Loading

codecov bot commented Jan 12, 2025 •

edited

Loading