Codestin Search App

akreal · 2023-10-08T15:25:04Z

What?

ASR recipe for LibriTTS with phonemized transcriptions.

Why?

As per discussion in #5393

Codecov Report

Merging #5466 (f76251b) into master (71dc9a3) will decrease coverage by 1.84%.
Report is 240 commits behind head on master.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master    #5466      +/-   ##
==========================================
- Coverage   77.14%   75.31%   -1.84%     
==========================================
  Files         684      707      +23     
  Lines       62713    64942    +2229     
==========================================
+ Hits        48383    48913     +530     
- Misses      14330    16029    +1699

Flag	Coverage Δ
test_configuration_espnet2	`∅ <ø> (∅)`
test_integration_espnet1	`65.67% <ø> (+0.13%)`	⬆️
test_integration_espnet2	`48.76% <ø> (-0.31%)`	⬇️
test_python_espnet1	`19.27% <ø> (-0.68%)`	⬇️
test_python_espnet2	`51.31% <ø> (-1.00%)`	⬇️
test_utils	`23.10% <ø> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

see 49 files with indirect coverage changes

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

sw005320 · 2023-10-08T17:58:21Z

Many thanks!
This is really useful.

ftshijt

Thanks for the contribution. Some minor comments/questions:

ftshijt · 2023-10-09T15:43:53Z

egs2/libritts/asr1/run.sh

+./asr.sh \
+    --lang en \
+    --ngpu 2 \
+    --nbpe 100 \


Is 100 bpe size an empirical good number?

Yes, it provided the best average phone error rate in the preliminary experiments:

LS VCTK Avg.

dev test dev test

Char 7.5 7.4 7.9 11.7 8.63

BPE 100 7.4 7.2 6.6 10.7 7.98

BPE 200 7.0 6.9 7.2 11.1 8.05

I guess larger BPE size makes model too biased towards the words appearing in LibriTTS.

ftshijt · 2023-10-09T15:44:53Z

egs2/libritts/asr1/local/phonemize_dir.py

+        text_phn = "".join(tokens).replace("<space>", " ")
+        otext.write(f"{utt} {text_phn}\n")
+
+os.replace(f"{idir}/text.phn", f"{idir}/text")


maybe consider keep the original text

for more information, see https://pre-commit.ci

akreal · 2023-10-14T11:49:21Z

Thanks for the review, @ftshijt !
I've addressed the comments and added README.

ftshijt · 2023-10-15T08:48:38Z

Looks very cool! Many thanks for your contribution.

mergify bot added the ESPnet2 label Oct 8, 2023

sw005320 added Recipe ASR Automatic speech recogntion labels Oct 8, 2023

sw005320 requested a review from ftshijt October 8, 2023 17:58

ftshijt reviewed Oct 9, 2023

View reviewed changes

Add phonemized LibriTTS ASR recipe

be22248

akreal force-pushed the phonemized-libritts branch from 3800a13 to be22248 Compare October 14, 2023 11:26

mergify bot added the README label Oct 14, 2023

[pre-commit.ci] auto fixes from pre-commit.com hooks

f76251b

for more information, see https://pre-commit.ci

akreal changed the title ~~[WIP] Add phonemized LibriTTS ASR recipe~~ Add phonemized LibriTTS ASR recipe Oct 14, 2023

ftshijt merged commit 72fd7bf into espnet:master Oct 15, 2023

akreal deleted the phonemized-libritts branch October 29, 2023 20:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add phonemized LibriTTS ASR recipe#5466

Add phonemized LibriTTS ASR recipe#5466
ftshijt merged 2 commits intoespnet:masterfrom
akreal:phonemized-libritts

akreal commented Oct 8, 2023

Uh oh!

codecov bot commented Oct 8, 2023 •

edited

Loading

Uh oh!

sw005320 commented Oct 8, 2023

Uh oh!

ftshijt left a comment

Uh oh!

ftshijt Oct 9, 2023

Uh oh!

akreal Oct 14, 2023 •

edited

Loading

Uh oh!

ftshijt Oct 9, 2023

Uh oh!

akreal Oct 14, 2023

Uh oh!

akreal commented Oct 14, 2023

Uh oh!

ftshijt commented Oct 15, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	LS		VCTK		Avg.
	dev	test	dev	test	Avg.
Char	7.5	7.4	7.9	11.7	8.63
BPE 100	7.4	7.2	6.6	10.7	7.98
BPE 200	7.0	6.9	7.2	11.1	8.05

Conversation

akreal commented Oct 8, 2023

What?

Why?

See also

Uh oh!

codecov bot commented Oct 8, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

sw005320 commented Oct 8, 2023

Uh oh!

ftshijt left a comment

Choose a reason for hiding this comment

Uh oh!

ftshijt Oct 9, 2023

Choose a reason for hiding this comment

Uh oh!

akreal Oct 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ftshijt Oct 9, 2023

Choose a reason for hiding this comment

Uh oh!

akreal Oct 14, 2023

Choose a reason for hiding this comment

Uh oh!

akreal commented Oct 14, 2023

Uh oh!

ftshijt commented Oct 15, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov bot commented Oct 8, 2023 •

edited

Loading

akreal Oct 14, 2023 •

edited

Loading