This project demonstrates how to build an AI-powered agents using Agent Development Kit (ADK) and Gemini APIs as well as other LLM APIs.
- Modular multi-agent architecture
- LLM-powered agenta
- Visual content enhancement
- Auto-formatting with narration, colors, and transitions
- Optional looping agent for refinement
- Built with Google ADK, Gemini API and DeepSeek API
Parent Agent (LLM Chatbot)
│
├── 📝 Script Writer Agent
│ └─ Uses Gemini API to fetch & generate content
│
├── 👁️ Visualizer Agent
│ └─ Enhances script with visual cues and layout instructions
│
├── 🎨 Formatter Agent
│ └─ Formats the final output with narration, styles, timing
│
└── 🔁 (Optional) LoopAgent
└─ Repeats the workflow for improved results
name: script_writer_agent
model: gemini-1.5-flash
instruction: generate_script_steps.yaml
description: Collects topic-related data and drafts a video script.
output_key: script_draft_outputsteps:
- Search Google using Gemini API for topic keyword
- Collect 5–7 unique content pieces
- Remove duplicates and unrelated data
- Save result to output key- Google ADK – Agent framework
- Gemini API – LLM for content generation
- Instruction Files (YAML) – Agent guidance
- Session & Memory – Persistent agent communication
⚠️ This is a conceptual/experimental project. Setup instructions below are generalized.
-
Clone the Repo
git clone https://github.com/kuahoong/adk.git
-
Install Dependencies Make sure Python and required SDKs are installed.
pip install google-adk
-
Configure Agents you can view the agent file if any changes needed.
-
Run the Parent Agent
adk web
- Automated content pipelines
- Marketing video scripting
- Explainers and educational video drafts
- Creative prototyping with AI