Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Bugfix: JETS CTCLoss#5288

Merged
kan-bayashi merged 3 commits intoespnet:masterfrom
imdanboy:feature-jets_bugfix
Jul 24, 2023
Merged

Bugfix: JETS CTCLoss#5288
kan-bayashi merged 3 commits intoespnet:masterfrom
imdanboy:feature-jets_bugfix

Conversation

@imdanboy
Copy link
Contributor

@imdanboy imdanboy commented Jul 8, 2023

Hi, I tried to fix bug issued at #5179

  • fix 'log (unnormalized) probability' to 'log (normalized) probability' for ctc_loss function
  • move beta binomial prior into alignment learning module

In experiments of LJSPEECH, KSS and an my korean multi-spk dataset, I got similar results with the previous.
This fix seems to accelerate the diagonal alignment learning which might be helpful on multi speaker low quality dataset as mentioned in the issue.

Before

CFSD log F0 MCD SECS
LJSPEECH 6.6239 ± 2.4985 0.2905 ± 0.0682 6.6817 ± 0.5373 0.8465 ± 0.0534
KSS 8.7675 ± 7.3949 0.2092 ± 0.0693 7.2096 ± 0.8703 0.7969 ± 0.0695

After

CFSD log F0 MCD SECS
LJSPEECH 6.7197 ± 2.0207 0.2851 ± 0.0682 6.6667 ± 0.5716 0.8418 ± 0.0512
KSS 8.5586 ± 7.0130 0.2294 ± 0.0836 7.1963 ± 0.9438 0.7939 ± 0.0685

@mergify mergify bot added the ESPnet2 label Jul 8, 2023
@codecov
Copy link

codecov bot commented Jul 8, 2023

Codecov Report

Merging #5288 (8755db2) into master (f122c22) will not change coverage.
The diff coverage is 96.87%.

@@           Coverage Diff           @@
##           master    #5288   +/-   ##
=======================================
  Coverage   76.10%   76.10%           
=======================================
  Files         658      658           
  Lines       59156    59156           
=======================================
  Hits        45022    45022           
  Misses      14134    14134           
Flag Coverage Δ
test_integration_espnet1 65.96% <ø> (ø)
test_integration_espnet2 47.52% <15.62%> (ø)
test_python 66.49% <96.87%> (ø)
test_utils 23.17% <ø> (ø)

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files Coverage Δ
espnet2/gan_tts/jets/alignments.py 69.49% <96.42%> (+7.53%) ⬆️
espnet2/gan_tts/jets/generator.py 84.44% <100.00%> (ø)
espnet2/gan_tts/jets/loss.py 85.45% <100.00%> (-3.44%) ⬇️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

@kan-bayashi kan-bayashi merged commit 97080b4 into espnet:master Jul 24, 2023
@kan-bayashi kan-bayashi added Bugfix TTS Text-to-speech labels Jul 24, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants