Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Commit 939f1cb

Browse files
Matt Van HornYaoyaoChang
authored andcommitted
fix: use COMMON_AUDIO_EXTS for --audio_dir file discovery
The --audio_dir flag hardcodes 6 file extensions (wav, mp3, flac, mp4, m4a, webm) while audio_utils supports 25+ formats via FFmpeg. Files like .ogg, .opus, .aac, .wma are silently ignored. Use COMMON_AUDIO_EXTS from audio_utils with case-insensitive matching, consistent with the gradio demo (vibevoice_asr_gradio_demo.py:50).
1 parent 337e420 commit 939f1cb

1 file changed

Lines changed: 5 additions & 3 deletions

File tree

demo/vibevoice_asr_inference_from_file.py

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -20,6 +20,7 @@
2020

2121
from vibevoice.modular.modeling_vibevoice_asr import VibeVoiceASRForConditionalGeneration
2222
from vibevoice.processor.vibevoice_asr_processor import VibeVoiceASRProcessor
23+
from vibevoice.processor.audio_utils import COMMON_AUDIO_EXTS
2324

2425

2526
class VibeVoiceASRBatchInference:
@@ -502,9 +503,10 @@ def main():
502503
audio_files.extend(args.audio_files)
503504

504505
if args.audio_dir:
505-
import glob
506-
for ext in ["*.wav", "*.mp3", "*.flac", "*.mp4", "*.m4a", "*.webm"]:
507-
audio_files.extend(glob.glob(os.path.join(args.audio_dir, ext)))
506+
supported = set(e.lower() for e in COMMON_AUDIO_EXTS)
507+
for f in os.listdir(args.audio_dir):
508+
if os.path.splitext(f)[1].lower() in supported:
509+
audio_files.append(os.path.join(args.audio_dir, f))
508510

509511
if args.dataset:
510512
concatenated_audio = load_dataset_and_concatenate(

0 commit comments

Comments
 (0)