Thanks to visit codestin.com
Credit goes to github.com

Skip to content

SohamRatnaparkhi/navigator-ai

Navigator AI

Intelligent Automation Within the Browser

Open-source browser automation powered by Gemini 2.5 Pro

powered by Devin

Have a doubt? Ask Devin - This repository is indexed on DeepWiki

Overview

Navigator AI empowers users and developers to seamlessly automate tasks within web browsers (with app support coming in the future). Unlike traditional browser automation tools, Navigator AI will offer:

  • A component library for direct integration into web applications
  • A browser extension for automating tasks in your browser

Consider it as Cursor/Windsurf for websites and applications. Big shoutout to Browser-Use as Navigator AI is inspired by them and currently is a kind of Chrome extension version that does what they do inside but inside YOUR browser while you retain full control to pause, stop, or restart.

⭐️ P.s. Drop a star if you want a completely FREE and open-source alternative to Manus AI. I aim to make Navigator AI on par with Manus AI. Your support will motivate me a lot. Thank you!

Key Features

  • Direct Web Integration: Embeddable React/framework components allow developers to add browser automation capabilities directly within their web/mobile applications. Users can easily use voice/text to automatically execute workflows on your app.
  • User-Friendly Extension: A browser extension that allows users to create, manage, and run repeatable workflows directly in their browser.
  • Knowledge Base Integration: Add custom rules, documentation, and knowledge bases that the agent will prioritize over its LLM-based workflow, making the agent specific to YOUR application.
  • Self-Improvement: The agent improves over time based on how you interact with different pages, especially when not actively using the agent.

Technology Stack

  • Frontend:
    • React, Vite, TypeScript (for both component library and extension)
    • Packaged in a Turborepo for efficient management
  • Backend:
    • Python, FastAPI
  • Database:
    • PostgreSQL (primary data storage)
    • Redis (caching)
    • Weaviate (vector database, deployed via Docker)

Setup Instructions

Prerequisites

Installation

  1. Clone the repository

    git clone https://github.com/SohamRatnaparkhi/navigator-ai.git
    cd navigator-ai
  2. Install dependencies

    # Install Python dependencies
    cd apps/server
    poetry install
    
    # Install Node dependencies
    cd ../extension
    pnpm install
    # OR
    npm install
  3. Run the development server

    pnpm run dev:server
    # OR
    npm run dev:server
  4. Run Redis

    cd apps/server
    docker compose up -d
  5. Build and install the extension

    # Build the extension
    pnpm run build
    # OR
    npm run build

    Then:

    • Open Chrome and navigate to chrome://extensions
    • Enable "Developer mode" (toggle in the top-right corner)
    • Click "Load unpacked" and select the /apps/extension/dist directory
    • The extension should now appear in your browser toolbar

Demos

1. Prompt: "Help me buy an m4 macbook pro with student discount. I want 24 gb/1 tb variant."

nai-1.mp4

2. Prompt: "Find the Navigator AI repo and create an issue."

Nai-2.mp4

Roadmap

Core Functionality

  • Deep agentic workflows

    • Planner at the top/each step
    • Support for multiple actions like switching tabs and copy-pasting
  • Visual Task Builder (Extension)

    • Develop a drag-and-drop interface for creating automation workflows
    • Add support for conditional logic and branching
  • Advanced DOM Interaction

    • Implement sophisticated element selection methods using vision LLMs
    • Add support for handling dynamic content

Intelligence & Learning

  • Self-Improving Agents

    • Implement feedback loops to learn from user corrections
    • Track user activity patterns (with permission) to improve automation
    • Develop metrics for measuring and reporting agent improvement
  • Knowledge Base Enhancement

    • Create an interface for managing custom rules and documentation
    • Implement priority weighting for different knowledge sources
    • Add support for importing existing documentation

Integration & Expansion

  • Third-party Integrations

    • Website-specific integrations (AWS, GCP, Amazon, etc.)
    • Multiple LLM provider support
    • API connections to popular services
  • Complex Web Interactions

    • Support for iframes and shadow DOM
    • Handling authentication and user sessions
    • Intelligent error recovery and pause mechanisms

User Experience

  • Workflow Management

    • Record and replay functionality for capturing user workflows
    • Scheduled tasks with time/interval specifications
    • Workflow sharing and importing capabilities
  • Notification System

    • Alert users when automation encounters obstacles
    • Provide detailed reporting on automation performance
    • Suggest improvements based on execution patterns

Contributing

Contributions are welcome! This project has areas for improvement and we appreciate your help. Please feel free to submit a Pull Request or create an Issue if you find a bug.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Star History

Star History Chart

About

Cursor for Vibe Browsing. Bring Comet and Dia into your default browser for free!

Resources

License

Code of conduct

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published