Codestin Search App

pyf98 · 2023-07-03T06:06:34Z

This PR adds decode_options and hyp_cleaner in evaluate_whisper_inference. The decode_options can be used to control the decoding hyperparameters in Whisper model's transcribe method.

Here is an example script to evaluate a test set using Whisper:

#!/usr/bin/env bash
# Set bash to 'debug' mode, it will exit on :
# -e 'error', -u 'undefined variable', -o ... 'error in pipeline', -x 'print commands',
set -e
set -u
set -o pipefail

whisper_tag=medium
cleaner=whisper_en
hyp_cleaner=whisper_en
nj=1
test_sets="test/WSJ/test_eval92"
decode_options="{language: en, task: transcribe, temperature: 0, beam_size: 10, fp16: False}"

for x in ${test_sets}; do
    wavscp=dump/raw/${x}/wav.scp
    outdir=whisper-${whisper_tag}_outputs/${x}
    gt_text=dump/raw/${x}/text

    scripts/utils/evaluate_asr.sh \
        --whisper_tag ${whisper_tag} \
        --nj ${nj} \
        --gpu_inference true \
        --stage 2 \
        --stop_stage 3 \
        --cleaner ${cleaner} \
        --hyp_cleaner ${hyp_cleaner} \
        --decode_options "${decode_options}" \
        --gt_text ${gt_text} \
        ${wavscp} \
        ${outdir}
done

for more information, see https://pre-commit.ci

pyf98 · 2023-07-03T06:10:01Z

egs2/TEMPLATE/asr1/pyscripts/utils/evaluate_whisper_inference.py

    # 3. Build data-iterator
    info_list = []
-    wavscp = open(data_path_and_name_and_type, "r", encoding="utf-8")
+    wavscp = open(key_file, "r", encoding="utf-8")


key_file can contain only a subset of utterances due to the use of multiple jobs. data_path_and_name_and_type can contain all the data.

pyf98 · 2023-07-03T06:10:53Z

egs2/TEMPLATE/asr1/scripts/utils/evaluate_asr.sh

-        for i in $(seq "${_nj}"); do
-            cat "${logdir}/output.${i}/1best_recog/${f}"
-        done | LC_ALL=C sort -k1 >"${outdir}/${f}"
+        if [ -f "${logdir}/output.1/1best_recog/${f}" ]; then


Whisper outputs do not contain all the files.

ftshijt · 2023-07-03T06:13:27Z

Many thanks for the update!

The examples are very good to show somewhere, do you consider some places for this? (A candidate might be tts/svs templates' readme as that is having a section for ASR evaluation specifically, but then it might not be general?)

doc/espnet2_tutorial.md

codecov · 2023-07-21T02:03:21Z

Codecov Report

Merging #5272 (2b42646) into master (f122c22) will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master    #5272   +/-   ##
=======================================
  Coverage   76.10%   76.10%           
=======================================
  Files         658      658           
  Lines       59156    59156           
=======================================
  Hits        45022    45022           
  Misses      14134    14134

Flag	Coverage Δ
test_integration_espnet1	`65.96% <ø> (ø)`
test_integration_espnet2	`47.51% <ø> (-0.01%)`	⬇️
test_python	`66.49% <ø> (ø)`
test_utils	`23.17% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

sw005320 · 2023-07-21T10:14:01Z

Thanks, @pyf98!

add decode_options and hyp_cleaner

89a6895

mergify bot added the ESPnet2 label Jul 3, 2023

[pre-commit.ci] auto fixes from pre-commit.com hooks

ed8e66e

for more information, see https://pre-commit.ci

pyf98 commented Jul 3, 2023

View reviewed changes

add hyp_cleaner in help msg

02cdd83

sw005320 added Enhancement Enhancement ASR Automatic speech recogntion labels Jul 3, 2023

sw005320 added this to the v.202307 milestone Jul 3, 2023

add whisper example in tutorial

b61540c

mergify bot added the Documentation label Jul 3, 2023

pyf98 commented Jul 3, 2023

View reviewed changes

doc/espnet2_tutorial.md Show resolved Hide resolved

pyf98 and others added 3 commits July 3, 2023 01:46

fix path

40584e1

Merge branch 'master' into eval-whisper

81e927a

Merge branch 'master' into eval-whisper

2b42646

sw005320 merged commit a5ad6ff into espnet:master Jul 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add decode_options and hyp_cleaner in evaluate_whisper_inference#5272

Add decode_options and hyp_cleaner in evaluate_whisper_inference#5272
sw005320 merged 7 commits intoespnet:masterfrom
pyf98:eval-whisper

pyf98 commented Jul 3, 2023

Uh oh!

pyf98 Jul 3, 2023

Uh oh!

pyf98 Jul 3, 2023

Uh oh!

ftshijt commented Jul 3, 2023

Uh oh!

Uh oh!

codecov bot commented Jul 21, 2023 •

edited

Loading

Uh oh!

sw005320 commented Jul 21, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

pyf98 commented Jul 3, 2023

Uh oh!

pyf98 Jul 3, 2023

Choose a reason for hiding this comment

Uh oh!

pyf98 Jul 3, 2023

Choose a reason for hiding this comment

Uh oh!

ftshijt commented Jul 3, 2023

Uh oh!

Uh oh!

codecov bot commented Jul 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

sw005320 commented Jul 21, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov bot commented Jul 21, 2023 •

edited

Loading