Thanks to visit codestin.com
Credit goes to github.com

Skip to content

pengx17/audio-ai-test

Repository files navigation

Live Caption with Whisper + AI Summary

This project provides real-time audio captioning using OpenAI's Whisper model and AI-powered summarization.

Features

  • Real-time audio recording
  • Whisper transcription
  • Gemini transcription
  • AI summary

Prerequisites

  • Node.js (v18 or higher)
  • Yarn (v4.5.3 or higher)
  • Python (for Whisper)

Setup

  1. Install dependencies:

    yarn install
  2. Set up Whisper:

    # Install Whisper dependencies (Mac only)
    pip install -U mlx-whisper

Development

  1. Start the development server:

    yarn dev
  2. In a separate terminal, start the backend server:

    cd server
    yarn dev
  3. Open your browser and navigate to http://localhost:5173

Project Structure

  • /src - Frontend React application
  • /server - Backend server with Whisper integration

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published