Live Caption with Whisper + AI Summary

This project provides real-time audio captioning using OpenAI's Whisper model and AI-powered summarization.

Features

Real-time audio recording
Whisper transcription
Gemini transcription
AI summary

Prerequisites

Node.js (v18 or higher)
Yarn (v4.5.3 or higher)
Python (for Whisper)

Setup

Install dependencies:
```
yarn install
```

Set up Whisper:

# Install Whisper dependencies (Mac only)
pip install -U mlx-whisper

Development

Start the development server:
```
yarn dev
```
In a separate terminal, start the backend server:
```
cd server
yarn dev
```
Open your browser and navigate to http://localhost:5173

Project Structure

/src - Frontend React application
/server - Backend server with Whisper integration

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.yarn/releases		.yarn/releases
public		public
server		server
src		src
.gitignore		.gitignore
.yarnrc.yml		.yarnrc.yml
README.md		README.md
eslint.config.js		eslint.config.js
index.html		index.html
package.json		package.json
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Live Caption with Whisper + AI Summary

Features

Prerequisites

Setup

Development

Project Structure

About

Uh oh!

Releases

Packages

Uh oh!

Languages

pengx17/audio-ai-test

Folders and files

Latest commit

History

Repository files navigation

Live Caption with Whisper + AI Summary

Features

Prerequisites

Setup

Development

Project Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages