Speaker embedding extractor (with ESPnet pre-trained speaker model)#5579
Speaker embedding extractor (with ESPnet pre-trained speaker model)#5579sw005320 merged 47 commits intoespnet:masterfrom
Conversation
for more information, see https://pre-commit.ci
|
@Jungjee Please feel free to have a check for the implementation~ |
Codecov ReportAttention:
Additional details and impacted files@@ Coverage Diff @@
## master #5579 +/- ##
==========================================
- Coverage 76.53% 76.49% -0.04%
==========================================
Files 720 720
Lines 66639 66607 -32
==========================================
- Hits 51001 50951 -50
- Misses 15638 15656 +18
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. |
…o spk_inference
for more information, see https://pre-commit.ci
One naming-level comment. |
I see, I didn't think about that. |
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
…o spk_inference
|
LGTM.
|
Sorry, it is not done yet. I recently mostly focused on checking the TTS performance (which is good). Will back to that later this week. |
|
Sounds good. |
FYI, to me, replacing existing stage 6 with this extraction can be done in another PR since this can impact the speed of current inference on models and also need several tests. Also (maybe not a good reason but) already several users trying to use the models we uploaded, e.g., @Emrys365 for SE challenge and @underdogliu for ASVspoof5 is another reason to split the PR for me. |
After an extra stage was added to tts.sh in espnet#5579 , following stage numbers were updated. A few were missed in the update and this PR covers those that remained.
What?
TODO