Thanks to visit codestin.com
Credit goes to github.com

Skip to content

ESPnet Recipe for ASR on the Makerere Radio Speech Corpus#5730

Merged
sw005320 merged 16 commits intoespnet:masterfrom
satvik-dixit:tmp
May 1, 2024
Merged

ESPnet Recipe for ASR on the Makerere Radio Speech Corpus#5730
sw005320 merged 16 commits intoespnet:masterfrom
satvik-dixit:tmp

Conversation

@satvik-dixit
Copy link
Contributor

What?

This is a new recipe for preparing the Makerere Radio Speech Corpus dataset and training automatic speech recognition models on it.

Details about the dataset:

The Makerere Radio Speech Corpus is a dataset in the Luganda language. It includes 20 hours of human-transcribed radio speech.

Related Links

Details about the dataset: https://zenodo.org/records/5855017
Paper on the dataset: https://arxiv.org/abs/2206.09790

@codecov
Copy link

codecov bot commented Apr 3, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 16.28%. Comparing base (844292e) to head (56a498c).
Report is 119 commits behind head on master.

❗ Current head 56a498c differs from pull request most recent head ac71eb4. Consider uploading reports for the commit ac71eb4 to get more accurate results

Additional details and impacted files
@@             Coverage Diff             @@
##           master    #5730       +/-   ##
===========================================
+ Coverage        0   16.28%   +16.28%     
===========================================
  Files           0      767      +767     
  Lines           0    70337    +70337     
===========================================
+ Hits            0    11453    +11453     
- Misses          0    58884    +58884     
Flag Coverage Δ
test_python_espnetez 14.22% <ø> (?)
test_utils 20.92% <ø> (?)

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@satvik-dixit
Copy link
Contributor Author

The error ImportError: cannot import name 'kaiser' from 'scipy.signal' (/home/runner/work/espnet/espnet/tools/venv/lib/python3.10/site-packages/scipy/signal/__init__.py) seems to be happening because kaiser should be imported from scipy.signal.windows according to the latest scipy version

@sw005320 sw005320 added Recipe ASR Automatic speech recogntion labels Apr 5, 2024
@sw005320 sw005320 added this to the v.202405 milestone Apr 5, 2024
@sw005320
Copy link
Contributor

sw005320 commented Apr 5, 2024

@jctian98, can you review this PR?

@jctian98
Copy link
Collaborator

jctian98 commented Apr 6, 2024

@satvik-dixit Thanks for making this PR. I've left some comments above.
Additionally,
(1) You split the train/valid/test sets by yourself. Could you confirm that the official release of this dataset doesn't have that split? If they already have that split, we'd better follow it.
(2) You split the train_dev/test in the Python script; and then train/dev in the shell script. Could you put this logic together? either in Python or Shell should be fine.
(3) Seems the current recipe cannot pass the CI test due to the scipy version issue. After that issue is solved, please make sure to pass other CI tests.

@sw005320
Copy link
Contributor

sw005320 commented Apr 8, 2024

@satvik-dixit Thanks for making this PR. I've left some comments above. Additionally,
(1) You split the train/valid/test sets by yourself. Could you confirm that the official release of this dataset doesn't have that split? If they already have that split, we'd better follow it.
(2) You split the train_dev/test in the Python script; and then train/dev in the shell script. Could you put this logic together? either in Python or Shell should be fine.

@satvik-dixit, please respond to them.

(3) Seems the current recipe cannot pass the CI test due to the scipy version issue. After that issue is solved, please make sure to pass other CI tests.

This is fixed.

@satvik-dixit
Copy link
Contributor Author

  1. That's right, the official release of this dataset doesn't have that split. Here are more about the dataset for reference:
    Details about the dataset: https://zenodo.org/records/5855017
    Paper on the dataset: https://arxiv.org/abs/2206.09790
  2. Yes, I can include all of it in the Python script (data_prep.py)
  3. Thanks Shinji



# Select the backend used by run.sh from "local", "stdout", "sge", "slurm", or "ssh"
cmd_backend='slurm'
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please do not change some environment related configs.
Please submit them as they are (e.g., cmd.sh and slurm.conf)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have made the suggested changes in cmd.sh and slurm.conf

- espnet version: `espnet 202402`
- pytorch version: `pytorch 2.0.1`
- Git hash: `eed7751c910977290ef9a177ea0942a0e3c2fd35`
- Commit date: `Mon Mar 25 18:26:50 2024 +0000`
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please upload the model to the HF hub and add a link here

Copy link
Contributor Author

@satvik-dixit satvik-dixit Apr 30, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have uploaded the model to the HF hub and added a link to this file. Here's the link: https://huggingface.co/satvik-dixit/asr_makerere.

@jctian98
Copy link
Collaborator

Hi @satvik-dixit , is there anything I can help?

@jctian98
Copy link
Collaborator

@satvik-dixit Thanks for the current progress. Shinji and I left some comments before. Please also kindly address those comments :)

@jctian98
Copy link
Collaborator

@sw005320 It seems some CI tests fail at the Espnet/Python Installation stage, which should not be @satvik-dixit 's fault. Is there a good way to deal with it?

@sw005320
Copy link
Contributor

LGTM.
I'll merge it with some checks.

@sw005320
Copy link
Contributor

Please add a corpus description to https://github.com/espnet/espnet/blob/master/egs2/README.md

@satvik-dixit
Copy link
Contributor Author

Added the corpus description to readme (https://github.com/espnet/espnet/blob/master/egs2/README.md)

@sw005320 sw005320 merged commit 543f488 into espnet:master May 1, 2024
@sw005320
Copy link
Contributor

sw005320 commented May 1, 2024

Thanks a lot!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ASR Automatic speech recogntion ESPnet2 README Recipe

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants