AI Companion Video Call & Streaming Platform

A full-stack web application for real-time video calls with AI companions, featuring WebRTC peer-to-peer streaming, intelligent conversations powered by Google Gemini, voice synthesis via ElevenLabs, lifelike avatars from D-ID, and conversation memory via LangMem.

Architecture

Frontend: React + TypeScript + Vite + Tailwind CSS (deployed on Vercel)
Backend: Python + FastAPI + Socket.IO (deployed on Render)
Database: Supabase (PostgreSQL + Auth + Storage)
AI Services: Google Gemini, ElevenLabs, D-ID, LangMem
Real-time: WebRTC + Socket.IO for signaling

Features

User authentication with Supabase Auth
Browse and select AI companions
Real-time video calls with WebRTC
AI-powered conversations with LangMem context retention
Voice synthesis for AI responses via ElevenLabs
D-ID animated talking avatars
Real-time text chat during calls
Call recording and playback
Responsive design for mobile and desktop
WebRTC signaling via Socket.IO
Row-level security for data protection

Prerequisites

Node.js 18+ and npm
Python 3.11+
Supabase account
Google Gemini API key
ElevenLabs API key
D-ID API key
LangMem (auto-initializes)
Redis instance (optional, for caching)
Twilio account (optional, for TURN server)

Setup

Frontend Setup

Navigate to the frontend directory:

cd frontend

Install dependencies:

npm install

Copy .env.example to .env and fill in your values:

cp .env.example .env

Start the development server:

npm run dev

Backend Setup

Navigate to the backend directory:

cd backend

Create a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

Copy .env.example to .env and fill in your values:

cp .env.example .env

Start the server:

python main.py

Database Setup

Run the migration file to create all required tables:

Go to your Supabase project dashboard
Navigate to SQL Editor
Copy the contents of supabase/migrations/001_initial_schema.sql
Execute the SQL

This creates the following tables with Row Level Security:

profiles - User profiles
companions - AI companion data
video_rooms - Video call rooms
messages - Chat messages
call_recordings - Recording metadata
conversation_contexts - Conversation history

See docs/architecture.md for detailed schema documentation.

Environment Variables

Frontend (.env)

VITE_SUPABASE_URL=your_supabase_url
VITE_SUPABASE_ANON_KEY=your_supabase_anon_key
VITE_BACKEND_URL=http://localhost:8000
VITE_WS_URL=http://localhost:8000

Backend (.env)

SUPABASE_URL=your_supabase_url
SUPABASE_SERVICE_KEY=your_supabase_service_key
GEMINI_API_KEY=your_gemini_api_key
ELEVENLABS_API_KEY=your_elevenlabs_api_key
DID_API_KEY=your_did_api_key
REDIS_URL=redis://localhost:6379
TWILIO_ACCOUNT_SID=your_twilio_account_sid
TWILIO_AUTH_TOKEN=your_twilio_auth_token
FRONTEND_URL=http://localhost:5173
PORT=8000

Deployment

Frontend (Vercel)

Push your code to a Git repository
Import the project in Vercel
Set the root directory to frontend
Add environment variables in Vercel dashboard
Deploy

Backend (Render)

Push your code to a Git repository
Create a new Web Service in Render
Set the root directory to backend
Add environment variables in Render dashboard
Deploy

Render will use the render.yaml configuration file automatically.

API Documentation

Once the backend is running, visit http://localhost:8000/docs for interactive API documentation.

Key Endpoints

REST API

GET /api/companions - List all companions
POST /api/video/rooms - Create a video room
GET /api/webrtc/config - Get WebRTC configuration
POST /api/did/streams - Create D-ID avatar stream
POST /api/video/recordings - Upload call recording

WebSocket Events

join, offer, answer, candidate - WebRTC signaling
chat_message - Real-time chat
end_call - End video session

Technologies Used

Frontend

React 18
TypeScript
Vite
Tailwind CSS
React Router
Zustand (state management)
Socket.IO Client
Supabase JS Client
date-fns

Backend

FastAPI
Python Socket.IO
Supabase Python Client
Google Generative AI (Gemini)
ElevenLabs
D-ID
LangMem
Redis
Pydantic

Project Structure

project/
├── frontend/
│   ├── src/
│   │   ├── components/    # React components
│   │   ├── pages/         # Page components
│   │   ├── hooks/         # Custom React hooks
│   │   ├── services/      # API and WebSocket services
│   │   ├── stores/        # Zustand stores
│   │   ├── contexts/      # React contexts
│   │   ├── types/         # TypeScript types
│   │   └── utils/         # Utility functions
│   └── package.json
└── backend/
    ├── routes/            # API routes
    ├── services/          # Business logic services
    ├── websocket/         # WebSocket handlers
    ├── models/            # Pydantic models
    ├── utils/             # Utility functions
    ├── config/            # Configuration
    └── requirements.txt

Development

Frontend

cd frontend
npm run dev          # Start dev server
npm run build        # Build for production
npm run lint         # Run linter
npm run typecheck    # Type checking

Backend

cd backend
python main.py    # Start dev server

License

MIT

Documentation

Architecture Documentation - Detailed system architecture, data flows, and technical specifications
Setup Guide - Comprehensive setup and deployment instructions
API Documentation - Interactive API docs (when backend is running)

Recent Improvements

LangMem Integration

Conversation memory across sessions
Context-aware AI responses
User interaction history tracking

D-ID Avatar Streaming

Real-time animated talking avatars
WebRTC-based avatar streaming
Text-to-avatar speech synthesis

Enhanced Security

Row Level Security on all tables
JWT-based authentication
Secure WebSocket connections
Environment-specific configurations

Improved Architecture

Modular service layer
Proper error handling
Production-ready configuration
Comprehensive documentation

Support

For issues and questions:

Check the documentation in docs/
Review the setup guide
Open an issue on the repository

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
backend		backend
frontend		frontend
supabase/migrations		supabase/migrations
.gitignore		.gitignore
README.md		README.md
SETUP_INSTRUCTIONS.md		SETUP_INSTRUCTIONS.md
package.json		package.json

apoorvmaurya/companion

Folders and files

Latest commit

History

Repository files navigation

AI Companion Video Call & Streaming Platform

Architecture

Features

Prerequisites

Setup

Frontend Setup

Backend Setup

Database Setup

Environment Variables

Frontend (.env)

Backend (.env)

Deployment

Frontend (Vercel)

Backend (Render)

API Documentation

Key Endpoints

REST API

WebSocket Events

Technologies Used

Frontend

Backend

Project Structure

Development

Frontend

Backend

License

Documentation

Recent Improvements

LangMem Integration

D-ID Avatar Streaming

Enhanced Security

Improved Architecture

Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages