Break language barriers instantly with AI-powered speech recognition and translation.
Install · Features · Tech Stack · Roadmap · Contributing
AsizweAI (meaning "Let Us Hear" in Zulu) is a Chrome extension that provides real-time translation of spoken vernacular, slang, and regional dialects into your preferred language. Whether you're watching a video, attending a virtual meeting, or consuming content in an unfamiliar dialect, AsizweAI captures the audio and delivers instant, context-aware translations.
Built with a focus on African languages (Zulu, Xhosa, Afrikaans) while supporting 15+ global languages, AsizweAI bridges communication gaps that traditional translators miss preserving cultural nuance, tone, and idiomatic expressions.
- Vernacular First: Unlike generic translators, we're optimized for colloquial speech, slang, and regional expressions
- Real-Time Processing: Hear translations as conversations happen, not after
- Privacy Focused: Audio is processed through secure APIs nothing is stored on our servers
- Cultural Preservation: Translations maintain the emotional intent and cultural context of the original speech
| Feature | Description |
|---|---|
| Real-Time Translation | Instant speech-to-text-to-translation pipeline with sub-second latency |
| Multi-Provider Support | Choose between OpenAI Whisper, Deepgram Nova-2, or ElevenLabs for STT |
| Voice Synthesis | Hear translations spoken aloud with natural TTS voices |
| Tab Audio Capture | Translate any audio playing in your browser tab |
- Glassmorphism UI — Premium, modern interface inspired by Notion and Linear
- One-Click Activation — Start translating with a single button press
- Live Transcript — See original and translated text in real-time
- Copy to Clipboard — Export your transcript with one click
- Swap Languages — Quickly reverse translation direction
- Volume Controls — Independent control for original audio and translation voice
- WCAG 2.1 Compliant — Full keyboard navigation and screen reader support
- Reduced Motion Support — Respects user preferences for animations
- High Contrast Mode — Enhanced visibility for users who need it
| Platform | Chrome Extension (Manifest V3) |
| Frontend | Vanilla JavaScript (ES Modules), CSS Custom Properties, HTML5 |
| STT | OpenAI Whisper · Deepgram Nova-2 · ElevenLabs Scribe · Web Speech API |
| Translation | OpenAI GPT-4o-mini · LibreTranslate (fallback) |
| TTS | ElevenLabs Turbo v2.5 · Native Web Speech Synthesis |
| Audio | Web Audio API · Tab Capture API · Offscreen Documents |
┌─────────────────────────────────────────────────────────────────┐
│ Chrome Extension │
├─────────────────────────────────────────────────────────────────┤
│ Popup UI │ Service Worker │ Offscreen Document │
│ ─────────── │ ─────────────── │ ───────────────── │
│ • Controls │ • Message Router │ • Audio Capture │
│ • Transcript │ • Tab Capture │ • STT Processing │
│ • Settings │ • State Management │ • Translation │
│ │ │ • TTS Playback │
└─────────────────────────────────────────────────────────────────┘
│
▼
┌───────────────────────────────┐
│ External APIs │
├───────────────────────────────┤
│ OpenAI · Deepgram · ElevenLabs │
└───────────────────────────────┘
- Google Chrome (v88 or later)
- An API key from one of the following:
- OpenAI (recommended)
- Deepgram (free $200 credits)
- ElevenLabs (best for African languages)
-
Clone the repository
git clone https://github.com/yourusername/asizwe-ai.git cd asizwe-ai -
Open Chrome Extensions
Navigate to
chrome://extensionsin your browser -
Enable Developer Mode
Toggle the "Developer mode" switch in the top-right corner
-
Load the extension
Click "Load unpacked" and select the
asizwe-aifolder -
Configure API Keys
- Click the AsizweAI icon in your toolbar
- Click the settings (gear) icon
- Enter your API key(s)
- Select your preferred providers
- Click "Save Settings"
-
Start translating!
Navigate to any page with audio, click the AsizweAI icon, and press the microphone button
Coming soon! The extension is currently in review.
- Navigate to a page with audio content (YouTube, a meeting, etc.)
- Click the AsizweAI extension icon
- Select your source and target languages
- Press the microphone button to start
- View real-time transcripts and hear translations
Supported Languages (16)
| Language | Code | STT | Translation | TTS |
|---|---|---|---|---|
| Auto-detect | auto |
- | - | - |
| Zulu | zu |
ElevenLabs | GPT-4o | Native |
| Xhosa | xh |
ElevenLabs | GPT-4o | Native |
| Afrikaans | af |
All | All | All |
| English | en |
All | All | All |
| Spanish | es |
All | All | All |
| French | fr |
All | All | All |
| German | de |
All | All | All |
| Portuguese | pt |
All | All | All |
| Italian | it |
All | All | All |
| Japanese | ja |
All | All | All |
| Korean | ko |
All | All | All |
| Chinese | zh |
All | All | All |
| Hindi | hi |
All | All | All |
| Arabic | ar |
All | All | All |
| Russian | ru |
All | All | All |
- Real-time tab audio capture
- Multi-provider STT support
- GPT-powered translation
- Text-to-speech output
- Glassmorphism UI redesign
- Copy transcript to clipboard
- Swap languages functionality
- WCAG accessibility compliance
- Microphone Input — Translate your own speech in conversations
- Custom Vocabulary — Add slang/jargon specific to your context
- Conversation Mode — Bi-directional translation for dialogue
- Offline Mode — Local models for privacy-sensitive use cases
- Firefox Support — Cross-browser extension
- Mobile Companion — React Native app for on-the-go translation
- API Access — Developer API for integration into other apps
- Enterprise Features — Team management, usage analytics
- Sotho (Northern & Southern)
- Tswana
- Swahili
- Yoruba
- Hausa
- Amharic
Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.
- Fork the Project
- Create your Feature Branch (
git checkout -b feature/amazing-feature) - Commit your Changes (
git commit -m 'Add some amazing feature') - Push to the Branch (
git push origin feature/amazing-feature) - Open a Pull Request
- Follow the existing code style (ES Modules, CSS Custom Properties)
- Ensure accessibility is maintained (test with screen readers)
- Add appropriate comments for complex logic
- Test with multiple STT/TTS providers
Distributed under the MIT License. See LICENSE for more information.
- OpenAI for Whisper and GPT models
- Deepgram for real-time speech recognition
- ElevenLabs for natural voice synthesis
- LibreTranslate for open-source translation
- The Chrome Extensions team for Manifest V3 documentation
AsizweAI — Let Us Hear
Built with purpose. Designed for connection.