Command line interface for the built-in speech recognition and transcription capabilities in macOS.
-
Updated
May 29, 2025 - Objective-C
Command line interface for the built-in speech recognition and transcription capabilities in macOS.
đź’¬ Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper on Win, Linux and Mac ... fast!
OCTRA is a web-application for the orthographic transcription of audio files.
French audio transcription using gradio
🎵 Complete offline audio transcription system with speaker diarization using OpenAI Whisper and PyAnnote. Features automatic audio cleaning, precise timestamps, multiple output formats (JSON/TXT/Markdown), and support for 20+ audio formats. No external APIs required - works entirely offline.
WhisperVoice is a browser extension that converts speech to text in real-time using speech recognition APIs. It’s perfect for quick transcriptions, note-taking, and accessibility, supporting multiple languages and customizable settings for a tailored experience.
AI-Video-Transcriber is an intelligent, open-source tool that automatically transcribes video and audio files using advanced artificial intelligence. It supports multiple languages, accurate speech recognition, and provides easy-to-read text transcripts for content creators, educators, and businesses.
Dictator – Supercharge Cursor Chat with voice-to-text, custom AI prompts, and workflow automation. Speak your ideas, inject templates instantly, and code faster with AI-powered assistance.
Self-hosted AI tool to transcribe and summarize meetings. Upload audio files, transcribe with Whisper, and generate structured summaries using OpenAI GPT or Google Gemini.
The Whisper Subtitle Generator leverages OpenAI's Whisper model to generate subtitles from audio and video files. This Python-based tool supports multiple languages and employs advanced audio processing techniques to ensure high accuracy in transcription.
A simple webpage that allows for image(s) to text transcription which does so by leveraging the Tesseract OCR engine.
MinuteMaster is a Python tool that transcribes audio, corrects the text using AI, and generates summaries for quick analysis.
transcript subtitle extractor is a lightweight web application built with Flask that retrieves and displays YouTube video transcripts using the YouTube Transcript API. It provides a simple interface for extracting subtitles by entering a YouTube video ID.
Add a description, image, and links to the transcription-tool topic page so that developers can more easily learn about it.
To associate your repository with the transcription-tool topic, visit your repo's landing page and select "manage topics."