Conversation
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## master #5933 +/- ##
==========================================
- Coverage 55.60% 48.03% -7.58%
==========================================
Files 824 528 -296
Lines 76042 47144 -28898
==========================================
- Hits 42286 22647 -19639
+ Misses 33756 24497 -9259
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. |
for more information, see https://pre-commit.ci
|
@jctian98, can you review this PR? |
egs2/owsm_ctc_v3.1/s2t1/conf/train_s2t_multitask-ctc_ebf27_conv2d8_size1024.yaml
Show resolved
Hide resolved
|
|
||
| The training data follows the same format as the encoder-decoder OWSM v3.1, except that timestamps are removed from the `text` file. Please first follow the `egs2/owsm_v3.1/s2t1` recipe to prepare OWSM data, and then convert `text` into the new format by running `python local/convert_owsm_data.py` (the path to the BPE tokenizer needs to be modified to your path). | ||
|
|
||
| ## Pre-trained Model |
There was a problem hiding this comment.
I think it's OK to use your own style here, but if we have our classical information about the configurations written in the other README.md, that would be more informative and more reproducible.
|
@jctian98, this is a reminder. |
|
Sorry for my delay, will review it by the end of tomorrow. |
|
The code quality is very good, and nice job! @pyf98 . Additional comments: The details are good, I just raised these comments at the philosophy level. Any solution is perfect for me, and thanks for the contribution! |
|
@pyf98, any update? |
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
|
Thanks for all the comments. I have fixed them. The LM integration is not used now, but I'm keeping it because it is theoretically possible to integrate an LM. |
|
I think it's ok! Thanks for the response! @pyf98 |
|
Thanks, @pyf98! |
Add OWSM-CTC
What?
This PR adds OWSM-CTC: https://aclanthology.org/2024.acl-long.549/
TODO: