Build, deploy, and scale production-ready AI agents
Features • Quick Start • Architecture • Deployment • Contributing
DATA is a production-grade LLM application development platform designed for teams building AI-powered products. From simple chatbots to complex multi-agent workflows, DATA provides the visual tools, infrastructure, and runtime to go from prototype to production in hours, not weeks.
- No-Code to Pro-Code — Design workflows visually, extend with code when you need more power
- Production-Ready — Built-in observability, error handling, and performance monitoring
- Model Agnostic — 100+ LLM providers, embed, and rerank models out of the box
- Enterprise Secure — RBAC, SSO, audit logs, and data isolation
| Feature | Description |
|---|---|
| Visual Workflow Builder | Drag-and-drop pipeline design with real-time preview |
| Multi-Agent Orchestration | Chain, route, and parallel-execute AI agents |
| RAG Pipeline | Built-in document ingestion, chunking, embedding, and retrieval |
| 50+ Built-in Tools | Web search, code execution, image generation, API connectors |
| Plugin Ecosystem | Install community plugins or build your own |
| Human-in-the-Loop | Pause workflows for human approval or input |
| Auto-Scaling | Horizontal scaling for production workloads |
- 100+ Providers: OpenAI, Anthropic, Google, AWS Bedrock, Azure, local LLMs (Ollama/Llama.cpp)
- Model Fallbacks: Define fallback chains for reliability
- Load Balancing: Distribute requests across model instances
- Usage Tracking: Monitor token consumption and costs
- Real-time Monitoring: Live workflow execution traces
- Logging: Structured logs with search and filtering
- Analytics: Usage patterns, response times, error rates
- Integration: OpenTelemetry, Langfuse, custom exporters
- REST API: Full API access for custom integrations
- SDKs: Python, Node.js, and CLI toolkit
- DSL Export: Version-controlled app definitions
- CI/CD Ready: Docker-based deployment pipeline
┌─────────────────────────────────────────────────────────┐
│ Web Frontend (Next.js) │
│ ┌──────────┐ ┌──────────┐ ┌──────────┐ ┌──────────┐ │
│ │ Workflow │ │ Agent │ │ Dataset │ │ Settings │ │
│ │ Studio │ │ Config │ │ Manager │ │ Page │ │
│ └──────────┘ └──────────┘ └──────────┘ └──────────┘ │
└──────────────────────┬──────────────────────────────────┘
│ HTTP/WebSocket
┌──────────────────────▼──────────────────────────────────┐
│ API Server (FastAPI) │
│ ┌─────────────┐ ┌────────────┐ ┌──────────────────┐ │
│ │ Workflow │ │ Agent │ │ Model Runtime │ │
│ │ Engine │ │ Runner │ │ Manager │ │
│ ├─────────────┤ ├────────────┤ ├──────────────────┤ │
│ │ RAG Service │ │ Tool System│ │ Plugin Registry │ │
│ └─────────────┘ └────────────┘ └──────────────────┘ │
└──────────────────────┬──────────────────────────────────┘
│
┌──────────────────────▼──────────────────────────────────┐
│ Data & Cache Layer │
│ ┌────────────┐ ┌────────────┐ ┌──────────────────┐ │
│ │ PostgreSQL │ │ Redis │ │ Vector DB │ │
│ │ │ │ │ │ (multi-engine) │ │
│ └────────────┘ └────────────┘ └──────────────────┘ │
└──────────────────────────────────────────────────────────┘
User Request → API Gateway → Auth Check → Workflow Engine
↓
Agent Selection → Context Building → LLM Invocation
↓
Tools Execution ← RAG Retrieval ← Knowledge Base
↓
Response Assembly → Streaming → User
- Docker & Docker Compose
- 4GB+ RAM recommended
# Clone the repository
git clone https://github.com/your-org/data-agent
cd data-agent
# Start all services
cd docker
docker compose up -d
# Access the platform
# Web UI: http://localhost:3000
# API: http://localhost:5001
# Swagger: http://localhost:5001/docs# Backend
cd api
cp .env.example .env
uv sync
uv run flask db upgrade
uv run python app.py
# Frontend
cd web
cp .env.example .env.local
pnpm install
pnpm dev| Variable | Default | Description |
|---|---|---|
DATA_BIND_ADDRESS |
0.0.0.0 |
API server bind address |
DATA_PORT |
5001 |
API server port |
SECRET_KEY |
(required) | App encryption key |
DB_USERNAME |
data |
PostgreSQL username |
DB_PASSWORD |
data123456 |
PostgreSQL password |
DB_HOST |
localhost |
PostgreSQL host |
DB_PORT |
5432 |
PostgreSQL port |
REDIS_HOST |
localhost |
Redis host |
REDIS_PORT |
6379 |
Redis port |
- User Guide — Getting started, tutorials, best practices
- API Reference — Full REST API documentation
- Plugin Development — Building custom plugins
- Contributor Guide — How to contribute
DATA supports a rich plugin ecosystem:
| Type | Examples |
|---|---|
| Model Providers | OpenAI, Anthropic, Ollama, AWS Bedrock, Azure |
| Tools | Web search, Calculator, Code execution, Image gen |
| Vector Stores | Qdrant, Milvus, Weaviate, Pinecone |
| Document Loaders | PDF, HTML, Notion, Confluence, S3 |
| Observability | Langfuse, OpenTelemetry, Sentry |
# Backend tests
cd api
uv run pytest tests/unit_tests/ -v
# Frontend tests
cd web
pnpm test
# E2E tests
pnpm test:e2eWe welcome contributions! See our Contributing Guide for details.
- Fork the repo
- Create your feature branch (
git checkout -b feature/amazing) - Commit changes (
git commit -m 'Add amazing feature') - Push to branch (
git push origin feature/amazing) - Open a Pull Request
Please read our Code of Conduct.
This project is licensed under the Apache License 2.0 — see the LICENSE file for details.