Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Add evaluation scripts with DiscreteSpeechMetrics.#5661

Merged
mergify[bot] merged 3 commits intoespnet:masterfrom
Takaaki-Saeki:spberteval
Feb 22, 2024
Merged

Add evaluation scripts with DiscreteSpeechMetrics.#5661
mergify[bot] merged 3 commits intoespnet:masterfrom
Takaaki-Saeki:spberteval

Conversation

@Takaaki-Saeki
Copy link
Contributor

@Takaaki-Saeki Takaaki-Saeki commented Feb 15, 2024

What?

This PR adds evaluation scripts using discrete-speech-metrics, which can be used to evaluate text-to-speech, noisy speech, and speech enhancement.
The evaluation metrics are computed between the candidate speech and the corresponding reference speech.

Why?

The metrics in discrete-speech-metrics are effective for speech synthesis evaluation because they better correlate with human subjective evaluation scores than MCD.
They also show higher correlations than UTMOS if reference speech is available.

They can also be effective for noisy speech evaluation.

See also

Toolkit: https://github.com/Takaaki-Saeki/DiscreteSpeechMetrics

@codecov
Copy link

codecov bot commented Feb 15, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (4a4c8d5) 75.07% compared to head (1645f15) 76.12%.
Report is 1148 commits behind head on master.

Additional details and impacted files
@@            Coverage Diff             @@
##           master    #5661      +/-   ##
==========================================
+ Coverage   75.07%   76.12%   +1.05%     
==========================================
  Files         709      744      +35     
  Lines       65290    69246    +3956     
==========================================
+ Hits        49017    52715    +3698     
- Misses      16273    16531     +258     
Flag Coverage Δ
test_configuration_espnet2 ∅ <ø> (∅)
test_integration_espnet1 62.92% <ø> (-2.62%) ⬇️
test_integration_espnet2 49.00% <ø> (+0.29%) ⬆️
test_python_espnet1 18.36% <ø> (-0.81%) ⬇️
test_python_espnet2 52.65% <ø> (+1.36%) ⬆️
test_utils 22.15% <ø> (?)

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@Takaaki-Saeki Takaaki-Saeki changed the title Add evaluation scripts with SpeechBERTScore and SpeechBLEU. Add evaluation scripts with DiscreteSpeechMetrics. Feb 15, 2024
@sw005320 sw005320 requested a review from ftshijt February 15, 2024 23:00
@sw005320 sw005320 added New Features TTS Text-to-speech labels Feb 15, 2024
Copy link
Collaborator

@ftshijt ftshijt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The scripts looks great to me!

It would be great to

  • add corresponding examples in egs2/TEMPLATE/tts1/README.md
  • corresponding entries in the tools/Makefile
  • try-exception to check if discrete_speech_metrics are installed or not

@mergify mergify bot added the README label Feb 19, 2024
@sw005320 sw005320 added this to the v.202405 milestone Feb 19, 2024
@sw005320 sw005320 added the auto-merge Enable auto-merge label Feb 22, 2024
@sw005320
Copy link
Contributor

LGTM!
After the CI test, I'll merge this PR.

@mergify mergify bot merged commit 2aa734d into espnet:master Feb 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants