Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Fix existing bugs in the TSE task#4915

Merged
sw005320 merged 3 commits intoespnet:masterfrom
Emrys365:tse
Feb 8, 2023
Merged

Fix existing bugs in the TSE task#4915
sw005320 merged 3 commits intoespnet:masterfrom
Emrys365:tse

Conversation

@Emrys365
Copy link
Collaborator

@Emrys365 Emrys365 commented Feb 5, 2023

This PR fixes existing bugs in the newly added TSE task. Some of them are critical that severely impede model training. Thanks to @YoshikiMas for the discussion!

Bug fixes:

  • egs2/TEMPLATE/enh1/enh.sh: enroll_spk?.scp files can be now correctly processed in Stage 3.
  • egs2/librimix/tse1/run.sh: fixed a typo
  • espnet2/bin/enh_tse_inference.py: shared_encoder can now take effect correctly in the inference stage.
  • espnet2/train/preprocessor.py: Fixed a critical bug that prepares wrong TSE training data when provided speech_ref data does not sum up to speech_mix

Typo fixing:

  • espnet2/enh/espnet_model_tse.py

To be discussed:

  • espnet2/iterators/chunk_iter_factory.py: To support chunk-based iterator for the TSE task, I have to add some hacks to this file. If you think the hacks make the script too messy, I can revert the change and think about other workarounds.

@Emrys365 Emrys365 added Bugfix ESPnet2 SE Speech enhancement labels Feb 5, 2023
@Emrys365 Emrys365 requested a review from simpleoier February 5, 2023 17:16
@codecov
Copy link

codecov bot commented Feb 6, 2023

Codecov Report

Merging #4915 (fc0d308) into master (61e0f55) will decrease coverage by 4.12%.
The diff coverage is 31.25%.

@@            Coverage Diff             @@
##           master    #4915      +/-   ##
==========================================
- Coverage   80.68%   76.56%   -4.12%     
==========================================
  Files         543      603      +60     
  Lines       48233    53744    +5511     
==========================================
+ Hits        38917    41151    +2234     
- Misses       9316    12593    +3277     
Flag Coverage Δ
test_integration_espnet1 66.33% <ø> (-0.06%) ⬇️
test_integration_espnet2 47.57% <12.50%> (-1.77%) ⬇️
test_python 66.44% <31.25%> (-2.82%) ⬇️
test_utils 23.35% <ø> (+<0.01%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files Coverage Δ
espnet2/enh/espnet_model_tse.py 80.64% <ø> (ø)
espnet2/train/preprocessor.py 27.17% <0.00%> (-0.02%) ⬇️
espnet2/iterators/chunk_iter_factory.py 70.00% <30.00%> (-5.00%) ⬇️
espnet2/bin/enh_tse_inference.py 91.02% <50.00%> (-0.72%) ⬇️
espnet2/hubert/espnet_model.py 52.57% <0.00%> (-40.57%) ⬇️
espnet2/fileio/read_text.py 65.59% <0.00%> (-28.86%) ⬇️
espnet2/text/build_tokenizer.py 77.14% <0.00%> (-4.11%) ⬇️
espnet2/asr/espnet_model.py 77.34% <0.00%> (-3.73%) ⬇️
espnet2/train/collate_fn.py 97.59% <0.00%> (-2.41%) ⬇️
...et/nets/pytorch_backend/transformer/subsampling.py 86.17% <0.00%> (-1.99%) ⬇️
... and 110 more

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

@sw005320 sw005320 added this to the v.202303 milestone Feb 7, 2023
@sw005320 sw005320 merged commit 3b5513a into espnet:master Feb 8, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Bugfix ESPnet2 SE Speech enhancement

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants