Self-hosted AI transcription and intelligent note-taking platform
Documentation • Quick Start • Screenshots • Docker Hub • Releases
Speakr transforms your audio recordings into organized, searchable, and intelligent notes. Built for privacy-conscious teams and individuals, it runs entirely on your own infrastructure, ensuring your sensitive conversations remain completely private.
- Smart Recording & Upload - Record directly in browser or upload existing audio files
- AI Transcription - High-accuracy transcription with speaker identification
- Interactive Chat - Ask questions about your recordings and get AI-powered answers
- Inquire Mode - Semantic search across all recordings using natural language
- Internationalization - Full support for English, Spanish, French, German, and Chinese
- Smart Tagging - Organize with tags that include custom AI prompts
- Secure Sharing - Generate secure links to share recordings
- Beautiful Themes - Light and dark modes with customizable color schemes
# Create project directory
mkdir speakr && cd speakr
# Download configuration (choose one):
wget https://raw.githubusercontent.com/murtaza-nasir/speakr/master/config/docker-compose.example.yml -O docker-compose.yml
# For OpenAI or similar Whisper API:
wget https://raw.githubusercontent.com/murtaza-nasir/speakr/master/config/env.whisper.example -O .env
# OR for ASR with speaker diarization (requires additional container):
wget https://raw.githubusercontent.com/murtaza-nasir/speakr/master/config/env.asr.example -O .env
# Configure your service endpoints and API keys
nano .env # Set API endpoints (Local/OpenAI/OpenRouter/etc) and add your API keys
# Launch Speakr
docker compose up -d
# Access at http://localhost:8899Note: ASR option requires running onerahmet/openai-whisper-asr-webservice container alongside Speakr. See installation guide for complete setup.
View Full Installation Guide →
Complete documentation is available at murtaza-nasir.github.io/speakr
- Getting Started - Quick setup guide
- User Guide - Learn all features
- Admin Guide - Administration and configuration
- Troubleshooting - Common issues and solutions
- FAQ - Frequently asked questions
- Inline Transcript Editing - Edit speaker assignments and text directly within the speaker identification modal
- Add Speaker Functionality - Dynamically add new speakers during transcript review
- Enhanced Speaker Modal - Improved UX with hover-based edit controls and real-time updates
- GPT-5 Model Support with automatic detection
- Custom Prompt Selection for summary reprocessing
- PWA Enhancements and mobile recording improvements
|
Advanced Filtering Dashboard |
AI-Powered Semantic Search |
|
Interactive Transcription & Chat |
Full Internationalization |
View Full Screenshot Gallery →
- Backend: Python/Flask with SQLAlchemy
- Frontend: Vue.js 3 with Tailwind CSS
- AI/ML: OpenAI Whisper, OpenRouter, Ollama support
- Database: SQLite (default) or PostgreSQL
- Deployment: Docker, Docker Compose
- Bulk operations for recordings (mass delete, export, tagging)
- Enhanced speaker profile management with voice signatures
- Improved mobile experience with PWA enhancements
- Plugin system for custom integrations
- Team workspaces with shared recordings
- Automated workflow triggers
- End-to-end encryption option
- Enterprise SSO integration
This project is dual-licensed:
-
GNU Affero General Public License v3.0 (AGPLv3)
Speakr is offered under the AGPLv3 as its open-source license. You are free to use, modify, and distribute this software under the terms of the AGPLv3. A key condition of the AGPLv3 is that if you run a modified version on a network server and provide access to it for others, you must also make the source code of your modified version available to those users under the AGPLv3.
- You must create a file named
LICENSE(orCOPYING) in the root of your repository and paste the full text of the GNU AGPLv3 license into it. - Read the full license text carefully to understand your rights and obligations.
- You must create a file named
-
Commercial License
For users or organizations who cannot or do not wish to comply with the terms of the AGPLv3 (for example, if you want to integrate Speakr into a proprietary commercial product or service without being obligated to share your modifications under AGPLv3), a separate commercial license is available.
Please contact speakr maintainers for details on obtaining a commercial license.
You must choose one of these licenses under which to use, modify, or distribute this software. If you are using or distributing the software without a commercial license agreement, you must adhere to the terms of the AGPLv3.
Speakr is in active development. Planned features include a faster way to switch transcription languages on the fly.
We welcome contributions to Speakr! There are many ways to help:
- Bug Reports & Feature Requests: Open an issue
- Discussions: Share ideas and ask questions
- Documentation: Help improve our docs
- Translations: Contribute translations for internationalization
All code contributions require signing a Contributor License Agreement (CLA). This one-time process ensures we can maintain our dual-license model (AGPLv3 and Commercial).
See our Contributing Guide for complete details on:
- How the CLA works and why we need it
- Step-by-step contribution process
- Development setup instructions
- Coding standards and best practices
The CLA is automatically enforced via GitHub Actions. When you submit your first PR, our bot will guide you through signing.