DATAGEN is an advanced document analysis platform powered by AI agents. It allows users to upload documents for AI-powered analysis and visualization.
- Backend: Python-based multi-agent system using LangChain and LangGraph with OpenAI LLMs
- API Server: FastAPI server that bridges the frontend with the backend agent system
- Frontend: Next.js application with TypeScript, shadcn/UI components, and built-in API routes
- Python 3.10+
- Node.js 18+
- npm or pnpm
-
Clone the repository:
git clone https://github.com/karlotimmerman/DATAGEN.git cd DATAGEN
-
Install backend dependencies:
pip install -r requirements.txt
-
Install frontend dependencies:
cd frontend npm install # or if using pnpm pnpm install
-
Start the FastAPI backend server:
python api_server.py
The API server will be available at http://localhost:8000
-
Start the frontend development server:
cd frontend npm run dev # or if using pnpm pnpm dev
The frontend will be available at http://localhost:3000
- Open your browser and navigate to http://localhost:3000
- Upload documents for analysis
- Provide analysis instructions
- View real-time analysis progress and results
- Document upload and processing
- Real-time analysis progress tracking
- Interactive visualizations
- Code generation for data analysis
- Comprehensive analysis reports
NEXT_PUBLIC_API_BASE_URL=http://localhost:8000
NEXT_PUBLIC_SOCKET_URL=ws://localhost:8000
Configure your OpenAI API key and other settings as needed.
/frontend
- Next.js frontend application/api_server.py
- FastAPI server/main.py
- Multi-agent system implementation
-
Build the frontend:
cd frontend npm run build # or if using pnpm pnpm build
-
For production deployment, consider using:
- Docker containers
- Nginx as a reverse proxy
- PM2 for process management