Build AI Agent Tutorial

This is a tutorial project from Boot.dev focused on building an AI agent from scratch using Google's Gemini API.

Overview

This project implements a functional AI coding agent that can autonomously perform file operations, execute Python scripts, and interact with a sandboxed working directory. The agent uses natural language understanding to interpret user requests and execute appropriate functions.

Core Features

🤖 AI-Powered Function Calling

The agent uses Google's Gemini 2.0 Flash model with function declarations to understand user intent and automatically call the appropriate functions.

🛠️ Available Operations

List Files and Directories (get_files_info)
- Lists contents of a directory with file sizes and type information
- Constrained to working directory for security
Read File Contents (get_file_content)
- Reads and returns file contents
- Automatically truncates files larger than 10,000 characters
- Works with files in subdirectories
Write/Overwrite Files (write_file)
- Creates new files or overwrites existing ones
- Automatically creates parent directories if needed
- Returns confirmation with character count
Execute Python Files (run_python_file)
- Runs Python scripts with optional command-line arguments
- 30-second timeout for safety
- Captures both stdout and stderr
- Returns exit codes for error handling

🔒 Security Features

Sandboxed Working Directory: All operations are restricted to ./calculator directory
Path Validation: Security checks prevent access outside permitted directories
File Size Limits: 10,000 character limit on file reads to prevent memory issues
Execution Timeout: 30-second timeout on Python script execution
Input Validation: Verifies file types and paths before operations

📊 Session Logging

Every interaction with the AI agent is automatically logged in timestamped session folders:

data/
  session_20251102_074629/
    ├── user_prompt.txt              # Original user request
    ├── function_call_1.txt          # First function call details
    ├── function_call_2.txt          # Second function call (if any)
    └── ai_response_summary.txt      # Summary of all operations

For text responses:

data/
  session_20251102_074733/
    ├── user_prompt.txt              # Original user request
    └── ai_response.txt              # AI's text response

Technologies

Python 3.x
Google Gemini API (gemini-2.0-flash-001)
google-genai library
python-dotenv for environment variable management

Project Structure

tutorial.build-ai-agent/
├── main.py                          # Main AI agent script
├── functions/
│   ├── config.py                    # Configuration constants
│   ├── get_files_info.py           # Directory listing function + schema
│   ├── get_file_content.py         # File reading function + schema
│   ├── write_file.py               # File writing function + schema
│   └── run_python_file.py          # Python execution function + schema
├── calculator/                      # Sandboxed working directory
│   ├── main.py                     # Calculator application
│   ├── tests.py                    # Unit tests
│   └── pkg/
│       ├── calculator.py           # Calculator logic
│       └── render.py               # Output formatting
└── data/                           # Session logs
    └── session_[timestamp]/        # Individual session folders

Usage

Basic Usage

uv run main.py "your prompt here"

Verbose Mode

uv run main.py "your prompt here" --verbose

Verbose mode shows:

Detailed function call information (name and arguments)
Function results
Token usage statistics (prompt and response tokens)

Example Prompts

List Files:

uv run main.py "list files in the root directory"
uv run main.py "what files are in the pkg directory?"

Read Files:

uv run main.py "read the contents of main.py"
uv run main.py "show me calculator/pkg/calculator.py"

Write Files:

uv run main.py "create a file called test.txt with content 'Hello World'"
uv run main.py "write 'def hello(): print(\"Hi\")' to utils.py"

Execute Python:

uv run main.py "run the tests.py file"
uv run main.py "run main.py with arguments 10 + 5"

Complex Operations:

uv run main.py "create a file hello.py that prints 'Hello World' and then run it"

Setup

Install dependencies:

uv sync

Create .env file with your Gemini API key:

GEMINI_API_KEY=your_api_key_here

Run the agent:

uv run main.py "your request"

How It Works

Function Declaration System

Each function has a schema that describes it to the LLM:

schema_get_files_info = types.FunctionDeclaration(
    name="get_files_info",
    description="Lists files in the specified directory...",
    parameters=types.Schema(
        type=types.Type.OBJECT,
        properties={
            "directory": types.Schema(
                type=types.Type.STRING,
                description="The directory to list files from..."
            ),
        },
    ),
)

AI Decision Making

User provides natural language prompt
Gemini model analyzes the request
AI decides which function(s) to call
Agent executes functions with security checks
Results are returned to user and logged

System Prompt

The agent is guided by a system prompt that defines its role and capabilities:

You are a helpful AI coding agent.

When a user asks a question or makes a request, make a function call plan. 
You can perform the following operations:

- List files and directories
- Read file contents
- Execute Python files with optional arguments
- Write or overwrite files

All paths you provide should be relative to the working directory.

Key Learnings

Function Calling with LLMs: How to declare functions that AI can understand and call
Security in AI Agents: Implementing guardrails to prevent unauthorized access
Tool Integration: Connecting LLMs to real-world capabilities
Error Handling: Graceful error management in AI systems
Session Management: Tracking and logging AI interactions
Prompt Engineering: Crafting system prompts for desired behavior

Safety Considerations

⚠️ Important: This is an educational project. The AI agent can execute arbitrary Python code within the working directory. Do not:

Use it on production systems
Give it access to sensitive directories
Share it with untrusted users without additional security measures

For production use, implement additional security layers:

Container isolation (Docker)
Resource limits
Code review before execution
User authentication
Audit logging
Rate limiting

Development Notes

The agent uses gemini-2.0-flash-001 model
Working directory is hardcoded to ./calculator
Function results are wrapped in types.Content with from_function_response
All file operations use UTF-8 encoding
Session logs include timestamps for easy tracking

Future Enhancements

Potential improvements:

Multi-turn conversations with context
Web search integration
Git operations support
Database query capabilities
File upload/download
Interactive mode
Configuration file for working directory
Multi-language support beyond Python

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
calculator		calculator
functions		functions
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
tests.py		tests.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Build AI Agent Tutorial

Overview

Core Features

🤖 AI-Powered Function Calling

🛠️ Available Operations

🔒 Security Features

📊 Session Logging

Technologies

Project Structure

Usage

Basic Usage

Verbose Mode

Example Prompts

Setup

How It Works

Function Declaration System

AI Decision Making

System Prompt

Key Learnings

Safety Considerations

Development Notes

Future Enhancements

About

Uh oh!

Releases

Packages

Languages

tulonbaar/tutorial.build-ai-agent

Folders and files

Latest commit

History

Repository files navigation

Build AI Agent Tutorial

Overview

Core Features

🤖 AI-Powered Function Calling

🛠️ Available Operations

🔒 Security Features

📊 Session Logging

Technologies

Project Structure

Usage

Basic Usage

Verbose Mode

Example Prompts

Setup

How It Works

Function Declaration System

AI Decision Making

System Prompt

Key Learnings

Safety Considerations

Development Notes

Future Enhancements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages