Codestin Search App

Emrys365 · 2023-07-21T14:29:43Z

This PR updates some SE recipes to add more subsets or data files:

egs2/chime4/enh1: added the dev and test subsets for the 2ch track
egs2/librimix/enh1: added data preparation of the transcriptions; updated README.md
fixed a permutation-related bug in the SE scoring stage

simpleoier

Thanks!

simpleoier · 2023-07-21T15:51:53Z

egs2/chime4/enh1/local/data.sh

    local/simu_ext_chime4_data_prep.sh --track 6 isolated_6ch_track ${odir}/audio/16kHz
    #  (2) {tr05,dt05,et05}_real_isolated_6ch_track
    local/real_ext_chime4_data_prep.sh --track 6 isolated_6ch_track ${CHIME4}/data/audio/16kHz/isolated_6ch_track
+


Will you also update the corresponding results in the README?

Yes, I can reuse previous models to do the evaluation.

simpleoier · 2023-07-21T15:59:11Z

egs2/chime4/enh1/local/real_ext_chime4_data_prep.sh

    sed -E "s#isolated_1ch_track/(.*)\.wav#isolated_6ch_track/\1.CH0.wav#g" ${x}_wav.scp > ${x}_spk1_wav.scp
  done

+elif [[ "$track" == "2" ]]; then


Can we get the same results if we follow the 6-ch track? If so, we can probably combine 2-ch and 6-ch track. E.g.

# 2-ch track for ch in $(seq 1 2); do find ${audio_dir}/ -name "*.CH${ch}.wav" | grep 'tr05_bus_real\|tr05_caf_real\|tr05_ped_real\|tr05_str_real' | sort -u > tr05_real_$enhan.CH${ch}.flist find ${audio_dir}/ -name "*.CH${ch}.wav" | grep 'dt05_bus_real\|dt05_caf_real\|dt05_ped_real\|dt05_str_real' | sort -u > dt05_real_$enhan.CH${ch}.flist if $eval_flag; then find ${audio_dir}/ -name "*.CH${ch}.wav" | grep 'et05_bus_real\|et05_caf_real\|et05_ped_real\|et05_str_real' | sort -u > et05_real_$enhan.CH${ch}.flist fi # make a scp file from file list for x in $list_set; do cat $x.CH${ch}.flist | awk -F'[/]' '{print $NF}'| sed -e "s/\.CH${ch}\.wav/_REAL/" > ${x}_wav.CH${ch}.ids paste -d" " ${x}_wav.CH${ch}.ids $x.CH${ch}.flist | sort -k 1 > ${x}_wav.CH${ch}.scp done done for x in $list_set; do sed -E "s#${audio_dir}/(.*)\.CH1.wav#${audio_dir}/\1.CH0.wav#g" ${x}_wav.CH1.scp > ${x}_spk1_wav.scp mix-mono-wav-scp.py ${x}_wav.CH{1,2}.scp > ${x}_wav.scp done

No, actually only the 1ch and 2ch tracks provide the audio list, while the 6ch track does not. So we have to use different logic to prepare the data.

simpleoier · 2023-07-21T15:59:59Z

egs2/chime4/enh1/local/real_ext_chime4_data_prep.sh

+    paste -d" " ${x}_wav.ids $x.flist | sort -k 1 > ${x}_wav.scp
+    paste -d" " ${x}_wav.ids ${x}_spk1.flist | sort -k 1 > ${x}_spk1_wav.scp
+  done
+


A stupid question at line 111, why was CH2 not included?

It is a convention for the CHiME-4 data because of microphone failures.

I see. Do you use CH2 in 2-ch track?

No, the channels are specified by the official audio list.

CH2 is on the back side of the tablet, and the recording condition is the worst and also different from the others. Thus, we usually do not include them in enhancement (but we include them in ASR training)

simpleoier · 2023-07-21T16:01:18Z

egs2/chime4/enh1/local/simu_ext_chime4_data_prep.sh

    sed -E "s#\.Clean\.wav#\.Noise\.wav#g" ${x}_spk1_wav.scp > ${x}_noise_wav.scp
  done

+elif [[ "$track" == "2" ]]; then


Similar to the above. Can we merge 2-ch and 6-ch track?

No, actually only the 1ch and 2ch tracks provide the audio list, while the 6ch track does not. So we have to use different logic to prepare the data.

simpleoier · 2023-07-21T16:03:53Z

egs2/librimix/enh1/local/data.sh

+
+    mkdir -p data/local
+    for dset in "train-clean-100" "train-clean-360" "dev-clean" "test-clean"; do
+        for reader_dir in $(find -L "${LIBRISPEECH}/${dset}" -mindepth 1 -maxdepth 1 -type d | sort); do


can we simply reuse librispeech/asr1/local/data_prep.sh here?

I try to minimize the data needed for the data preparation here. For this data, we only need the clean data to prepare the transcript here.

I think it is called on every single set.

if [ ${stage} -le 2 ] && [ ${stop_stage} -ge 2 ]; then log "stage 2: Data Preparation" for part in dev-clean test-clean dev-other test-other train-clean-100 train-clean-360 train-other-500; do # use underscore-separated names in data directories. local/data_prep.sh ${LIBRISPEECH}/LibriSpeech/${part} data/${part//-/_} done fi

OK. Now I reused it.

codecov · 2023-07-21T17:59:42Z

Codecov Report

Merging #5327 (a4c84c5) into master (ff427c3) will increase coverage by 3.68%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master    #5327      +/-   ##
==========================================
+ Coverage   72.52%   76.21%   +3.68%     
==========================================
  Files         658      669      +11     
  Lines       59156    59565     +409     
==========================================
+ Hits        42902    45395    +2493     
+ Misses      16254    14170    -2084

Flag	Coverage Δ
test_integration_espnet1	`65.96% <ø> (-0.01%)`	⬇️
test_integration_espnet2	`48.02% <100.00%> (?)`
test_python	`66.56% <100.00%> (+0.07%)`	⬆️
test_utils	`23.17% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
espnet2/bin/enh_scoring.py	`69.04% <100.00%> (+3.17%)`	⬆️

... and 73 files with indirect coverage changes

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

simpleoier

LGTM!

Emrys365 added 3 commits July 21, 2023 10:06

Update data preparation in egs2/chime4/enh1

e09a9b9

Add data preparation for transcriptions in egs2/librimix/enh1

7111cf9

Fix a bug in SE scoring

59f8b7a

Emrys365 added Recipe ESPnet2 SE Speech enhancement labels Jul 21, 2023

mergify bot added the README label Jul 21, 2023

sw005320 requested a review from simpleoier July 21, 2023 14:33

sw005320 added this to the v.202307 milestone Jul 21, 2023

simpleoier reviewed Jul 21, 2023

View reviewed changes

Fix a unit test error

e2c1d69

Emrys365 added 2 commits July 21, 2023 15:27

Reflect comments

1752e4f

Reflect comments

a4c84c5

simpleoier approved these changes Jul 21, 2023

View reviewed changes

sw005320 approved these changes Jul 21, 2023

View reviewed changes

sw005320 added the auto-merge Enable auto-merge label Jul 21, 2023

Emrys365 mentioned this pull request Jul 22, 2023

Transcription of speech signal is not available for Librimix dataset and speech enhancement task #4745

Closed

mergify bot merged commit 353c01f into espnet:master Jul 22, 2023

Conversation

Emrys365 commented Jul 21, 2023

Uh oh!

simpleoier left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Jul 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

simpleoier left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov bot commented Jul 21, 2023 •

edited

Loading