Update collect stats stage so that less memory cost in Utt_mvn#4888
Update collect stats stage so that less memory cost in Utt_mvn#4888sw005320 merged 1 commit intoespnet:masterfrom
Conversation
…ed in utterance mvn
98e4ce1 to
fb7f49e
Compare
Codecov Report
@@ Coverage Diff @@
## master #4888 +/- ##
==========================================
- Coverage 76.58% 76.58% -0.01%
==========================================
Files 603 603
Lines 53707 53700 -7
==========================================
- Hits 41131 41124 -7
Misses 12576 12576
Flags with carried forward coverage won't be shown. Click here to find out more.
📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more |
pengchengguo
left a comment
There was a problem hiding this comment.
Sorry didn't finish it in time. It looks good to me.
|
I just want to make sure. |
|
@sw005320 I don't think we need to change it on the config side in the recipe. I tried to make it compatible with previous configs / pretrained models in the following two ways:
|
|
OK, thanks for the confirmation! |
|
@simpleoier I noticed this change (extract_feats_in_collect_stats), now. Thank you, I agree this is useful option, but I think Why not adding just |
|
@kamo-naoyuki I use |
I can't understand why you had this attribute in the pre-trained model before this PR, but it's okay. Could you implement Currently, user can't know the option of |
It was used to avoid doing model forward in collect_stats when using self-supervised learning frontends.
Sounds good. I'll do it soon. |
Thank you! |
If utt_mvn is used and
extract_feats_in_collect_statsis False in config, the scripts would skip building model and no dummy features are generated.This can reduce the memory usage in Stage 10 when large SSL models are used, e.g. HuBERT.