Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Add discrete-token ASR for LibriSpeech 100h#5350

Merged
sw005320 merged 3 commits intoespnet:masterfrom
akreal:asr2-librispeech-100
Jul 23, 2023
Merged

Add discrete-token ASR for LibriSpeech 100h#5350
sw005320 merged 3 commits intoespnet:masterfrom
akreal:asr2-librispeech-100

Conversation

@akreal
Copy link
Contributor

@akreal akreal commented Jul 22, 2023

What?

Discrete-token ASR for LibriSpeech 100h.

Why?

There is no discrete-token ASR recipe for this small scale dataset yet.

@mergify mergify bot added the ESPnet2 label Jul 22, 2023
@sw005320 sw005320 requested a review from simpleoier July 22, 2023 22:00
@sw005320 sw005320 added Recipe ASR Automatic speech recogntion labels Jul 22, 2023
@sw005320 sw005320 added this to the v.202307 milestone Jul 22, 2023
@akreal akreal force-pushed the asr2-librispeech-100 branch from 64a1cab to dd8ea67 Compare July 23, 2023 11:42
@mergify mergify bot added the README label Jul 23, 2023
@akreal akreal marked this pull request as ready for review July 23, 2023 11:43
@akreal akreal changed the title [WIP] Add discrete-token ASR for LibriSpeech 100h Add discrete-token ASR for LibriSpeech 100h Jul 23, 2023
@sw005320
Copy link
Contributor

Very cool!
This result is very good compared with our initial trial in https://arxiv.org/pdf/2305.18108.pdf.

Copy link
Collaborator

@simpleoier simpleoier left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you! The results are very cool.
I left two minor comments.

@@ -0,0 +1,93 @@
# Trained with A100 (40 GB) x 1 GPUs for Kmeans1K+nbpe5K. It takes 32 minutes per epoch.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you update this information?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done!

tgt_case="ts"

./asr2.sh \
--kmeans_opts "--batch_bins 4800000" \
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you reduce the number of gpu jobs in kmeans_opts: adding an argument --nj 4 for example?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done!

Copy link
Collaborator

@simpleoier simpleoier left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks!

@codecov
Copy link

codecov bot commented Jul 23, 2023

Codecov Report

Merging #5350 (643fc5b) into master (890ba71) will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master    #5350   +/-   ##
=======================================
  Coverage   76.11%   76.11%           
=======================================
  Files         672      672           
  Lines       59859    59859           
=======================================
  Hits        45563    45563           
  Misses      14296    14296           
Flag Coverage Δ
test_configuration_espnet2 ∅ <ø> (∅)
test_integration_espnet1 65.93% <ø> (ø)
test_integration_espnet2 47.92% <ø> (ø)
test_python_espnet1 20.51% <ø> (ø)
test_python_espnet2 51.35% <ø> (ø)
test_utils 23.17% <ø> (ø)

Flags with carried forward coverage won't be shown. Click here to find out more.

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

@sw005320 sw005320 merged commit 4b5cd10 into espnet:master Jul 23, 2023
@akreal akreal deleted the asr2-librispeech-100 branch October 12, 2023 09:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants