Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Update egs2/TEMPLATE/asr2/asr2.sh#5682

Merged
mergify[bot] merged 3 commits intoespnet:masterfrom
simpleoier:is2024_dsu_asr2
Feb 27, 2024
Merged

Update egs2/TEMPLATE/asr2/asr2.sh#5682
mergify[bot] merged 3 commits intoespnet:masterfrom
simpleoier:is2024_dsu_asr2

Conversation

@simpleoier
Copy link
Collaborator

@simpleoier simpleoier commented Feb 27, 2024

What?

Updating the asr2.sh, filtering utterances at stage 4, and moving kmeans step after it, to stage 5.
In the previous script, filtering was done after kmeans, in stage 6. It only removes the utterances with 0-length in the target text. Now I make the filtering step as stage 4, taking into consideration of both wav length and target text length. The previous stage 6 is no longer needed.

Why?

asr2.sh suffers from the too short / long utterances in kmeans pseudo-labeling step, because no filtering was done before kmeans.

@sw005320 sw005320 added this to the v.202405 milestone Feb 27, 2024
@sw005320 sw005320 added the auto-merge Enable auto-merge label Feb 27, 2024
@sw005320
Copy link
Contributor

Thanks!
After the CI check, I'll merge this PR.

@codecov
Copy link

codecov bot commented Feb 27, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 76.12%. Comparing base (631ffa7) to head (5a0350b).

Additional details and impacted files
@@            Coverage Diff             @@
##           master    #5682      +/-   ##
==========================================
+ Coverage   71.07%   76.12%   +5.05%     
==========================================
  Files         746      747       +1     
  Lines       69369    69409      +40     
==========================================
+ Hits        49301    52836    +3535     
+ Misses      20068    16573    -3495     
Flag Coverage Δ
test_configuration_espnet2 ∅ <ø> (∅)
test_integration_espnet1 62.92% <ø> (ø)
test_integration_espnet2 48.92% <ø> (?)
test_python_espnet1 18.32% <ø> (ø)
test_python_espnet2 52.70% <ø> (ø)
test_utils 22.15% <ø> (ø)

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@mergify mergify bot merged commit 5eaf59d into espnet:master Feb 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants