Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Commit f7c6d2d

Browse files
committed
update asr eval results
1 parent c9c778c commit f7c6d2d

5 files changed

Lines changed: 7 additions & 3 deletions

File tree

Figures/DER.jpg

61.2 KB
Loading

Figures/cpWER.jpg

66.9 KB
Loading

Figures/tcpWER.jpg

62.7 KB
Loading

README.md

Lines changed: 0 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -20,9 +20,6 @@
2020

2121
<h3>📰 News</h3>
2222

23-
<img src="https://img.shields.io/badge/Status-New-brightgreen?style=flat" alt="New" />
24-
<img src="https://img.shields.io/badge/Feature-Realtime_TTS-blue?style=flat&logo=soundcharts" alt="Realtime TTS" />
25-
2623
<strong>2026-01-21: 📣 We open-sourced <a href="docs/vibevoice-asr.md"><strong>VibeVoice-ASR</strong></a>, a unified speech-to-text model designed to handle 60-minute long-form audio in a single pass, generating structured transcriptions containing Who (Speaker), When (Timestamps), and What (Content), with support for User-Customized Context.</strong>
2724

2825
2025-12-16: 📣 We added more experimental speakers for exploration, including multilingual voices and 11 distinct English style voices. [Try it](docs/vibevoice-realtime-0.5b.md#optional-more-experimental-voices). More speaker types will be added over time.

docs/vibevoice-asr.md

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -21,6 +21,13 @@ It is a unified speech-to-text model designed to handle **1-hour long-form audio
2121
<img src="../Figures/VibeVoice_ASR_archi.png" alt="VibeVoice ASR Architecture" width="80%">
2222
</p>
2323

24+
## Evaluation
25+
<p align="center">
26+
<img src="../Figures/DER.jpg" alt="DER" width="80%">
27+
<img src="../Figures/cpWER.jpg" alt="cpWER" width="80%">
28+
<img src="../Figures/tcpWER.jpg" alt="tcpWER" width="80%">
29+
</p>
30+
2431
## Installation
2532
We recommend to use NVIDIA Deep Learning Container to manage the CUDA environment.
2633

0 commit comments

Comments
 (0)