Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Melvinjayson/suna

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Atlas AI - Voice-Enabled Personal Assistant

(Your intelligent voice companion)

Atlas AI Screenshot

Atlas AI is a fully open source voice-enabled personal assistant that combines the power of advanced AI with natural speech interaction. Through voice commands and conversations, Atlas AI becomes your intelligent companion for productivity, research, task management, and everyday challenges—featuring comprehensive speech capabilities and modular assistant features.

Atlas AI's powerful toolkit includes voice-to-text and text-to-speech capabilities, intent recognition for personal assistant tasks, calendar and email management, reminder systems, weather and news queries, seamless browser automation, file management, web crawling, command-line execution, and integration with various APIs and services. These capabilities work together harmoniously through natural voice interaction, allowing Atlas AI to understand your spoken requests and respond with both text and speech!

License GitHub Repo stars Issues

Table of Contents

Project Architecture

Architecture Diagram

Atlas AI consists of five main components:

Backend API

Python/FastAPI service that handles REST endpoints, thread management, and LLM integration with Anthropic, and others via LiteLLM.

Frontend

Next.js/React application providing a responsive UI with voice-enabled chat interface, dashboard, and personal assistant controls.

Voice Integration

Comprehensive speech capabilities including Speech-to-Text (STT) for voice input, Text-to-Speech (TTS) for audio responses, voice activity detection, and intent recognition for personal assistant tasks.

Agent Docker

Isolated execution environment for every agent - with browser automation, code interpreter, file system access, tool integration, and security features.

Supabase Database

Handles data persistence with authentication, user management, conversation history, file storage, agent state, analytics, and real-time subscriptions.

Voice Features

Atlas AI provides comprehensive voice interaction capabilities:

  • Speech-to-Text (STT): Real-time voice recognition with continuous listening and voice activity detection
  • Text-to-Speech (TTS): Natural voice responses with configurable voice options, rate, and pitch
  • Voice Controls: Hands-free operation with voice commands for all assistant functions
  • Multi-language Support: Voice recognition and synthesis in multiple languages
  • Noise Handling: Advanced audio processing for clear voice recognition in various environments

Personal Assistant Capabilities

Atlas AI includes modular assistant features for productivity and daily tasks:

  • Calendar Management: Create, read, update, and delete calendar events through voice commands
  • Email Operations: Compose, read, and send emails with voice dictation
  • Reminders & Tasks: Set and manage reminders, to-do lists, and task scheduling
  • Weather & News: Get current weather conditions, forecasts, and latest news updates
  • Web Search: Intelligent web search with voice queries and spoken results
  • Smart Integrations: Connect with popular services like Google Calendar, Gmail, and more

Use Cases

  1. Competitor Analysis (Watch) - "Analyze the market for my next company in the healthcare industry, located in the UK. Give me the major players, their market size, strengths, and weaknesses, and add their website URLs. Once done, generate a PDF report."

  2. VC List (Watch) - "Give me the list of the most important VC Funds in the United States based on Assets Under Management. Give me website URLs, and if possible an email to reach them out."

  3. Looking for Candidates (Watch) - "Go on LinkedIn, and find me 10 profiles available - they are not working right now - for a junior software engineer position, who are located in Munich, Germany. They should have at least one bachelor's degree in Computer Science or anything related to it, and 1-year of experience in any field/role."

  4. Planning Company Trip (Watch) - "Generate me a route plan for my company. We should go to California. We'll be in 8 people. Compose the trip from the departure (Paris, France) to the activities we can do considering that the trip will be 7 days long - departure on the 21st of Apr 2025. Check the weather forecast and temperature for the upcoming days, and based on that, you can plan our activities (outdoor vs indoor)."

  5. Working on Excel (Watch) - "My company asked me to set up an Excel spreadsheet with all the information about Italian lottery games (Lotto, 10eLotto, and Million Day). Based on that, generate and send me a spreadsheet with all the basic information (public ones)."

  6. Automate Event Speaker Prospecting (Watch) - "Find 20 AI ethics speakers from Europe who've spoken at conferences in the past year. Scrapes conference sites, cross-references LinkedIn and YouTube, and outputs contact info + talk summaries."

  7. Summarize and Cross-Reference Scientific Papers (Watch) - "Research and compare scientific papers talking about Alcohol effects on our bodies during the last 5 years. Generate a report about the most important scientific papers talking about the topic I wrote before."

  8. Research + First Contact Draft (Watch) - "Research my potential customers (B2B) on LinkedIn. They should be in the clean tech industry. Find their websites and their email addresses. After that, based on the company profile, generate a personalized first contact email where I present my company which is offering consulting services to cleantech companies to maximize their profits and reduce their costs."

  9. SEO Analysis (Watch) - "Based on my website suna.so, generate an SEO report analysis, find top-ranking pages by keyword clusters, and identify topics I'm missing."

  10. Generate a Personal Trip (Watch) - "Generate a personal trip to London, with departure from Bangkok on the 1st of May. The trip will last 10 days. Find an accommodation in the center of London, with a rating on Google reviews of at least 4.5. Find me interesting outdoor activities to do during the journey. Generate a detailed itinerary plan."

  11. Recently Funded Startups (Watch) - "Go on Crunchbase, Dealroom, and TechCrunch, filter by Series A funding rounds in the SaaS Finance Space, and build a report with company data, founders, and contact info for outbound sales."

  12. Scrape Forum Discussions (Watch) - "I need to find the best beauty centers in Rome, but I want to find them by using open forums that speak about this topic. Go on Google, and scrape the forums by looking for beauty center discussions located in Rome. Then generate a list of 5 beauty centers with the best comments about them."

Self-Hosting

Atlas AI can be self-hosted on your own infrastructure using our setup wizard. For a comprehensive guide to self-hosting Atlas AI, please refer to our Self-Hosting Guide.

The setup process includes:

  • Setting up a Supabase project for database and authentication
  • Configuring Redis for caching and session management
  • Setting up Daytona for secure agent execution
  • Integrating with LLM providers (Anthropic, OpenAI, Groq, etc.)
  • Configuring web search and scraping capabilities
  • Setting up speech services for voice capabilities
  • Configuring personal assistant integrations (Calendar, Email, etc.)

Quick Start

  1. Clone the repository:
git clone https://github.com/Melvinjayson/atlas-ai.git
cd atlas-ai
  1. Run the setup wizard:
python setup.py
  1. Start or stop the containers:
python start.py

Manual Setup

See the Self-Hosting Guide for detailed manual setup instructions.

The wizard will guide you through all necessary steps to get your Atlas AI instance up and running. For detailed instructions, troubleshooting tips, and advanced configuration options, see the Self-Hosting Guide.

Contributing

We welcome contributions from the community! Please see our Contributing Guide for more details.

Acknowledgements

Main Contributors

Technologies

License

Atlas AI is licensed under the Apache License, Version 2.0. See LICENSE for the full license text.

About

Suna - Open Source Generalist AI Agent

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • TypeScript 64.1%
  • Python 32.7%
  • PLpgSQL 2.3%
  • CSS 0.5%
  • Dockerfile 0.2%
  • HTML 0.1%
  • Other 0.1%