Apex AI Proxy

A powerful Cloudflare Workers-based AI service aggregator that provides unified API endpoints compatible with both OpenAI and Anthropic formats. Built on top of Cloudflare AI Gateway for enhanced reliability, monitoring, and cost optimization.

Features

Unified API Interface: Single endpoint supporting both OpenAI and Anthropic API formats
Multi-Provider Support: Seamlessly integrate with multiple AI providers through Cloudflare AI Gateway
Streaming Support: Real-time streaming responses for both API formats
CORS Enabled: Ready for web applications with proper CORS handling
Type Safety: Full TypeScript implementation with comprehensive type definitions
Tool/Function Calling: Support for function calling in both OpenAI and Anthropic formats
Azure OpenAI Integration: Native support for Azure OpenAI deployments
Request Validation: Robust input validation and error handling
Image Support: Full support for vision models and image analysis
BYOK Integration: Secure API key storage using Cloudflare's BYOK feature

Supported Providers

Via Cloudflare AI Gateway:

OpenAI (GPT-4, GPT-3.5, GPT-4V)
Anthropic (Claude 3.5, Claude 3 Haiku/Sonnet/Opus)
Google AI Studio (Gemini Pro, Gemini Flash)
Groq (Mixtral, Llama models)
Mistral (Mistral Large, Codestral)
Grok (xAI models)
DeepSeek (DeepSeek Coder, DeepSeek Chat)
Cerebras (Llama models)
Perplexity AI (Sonar models)
Azure OpenAI
Cohere
Workers AI

Quick Deploy

Prerequisites

Cloudflare Account: Sign up at Cloudflare
AI Gateway Setup: Follow the AI Gateway Getting Started Guide
Provider API Keys: Obtain API keys from your chosen AI providers
Node.js: Version 16 or higher
pnpm: Package manager (or npm/yarn)

Setup Instructions

1. Clone and Install

git clone https://github.com/loadchange/apex-ai-proxy.git
cd apex-ai-proxy
pnpm install

2. Configure Cloudflare AI Gateway

Step 1: Create AI Gateway

Navigate to your Cloudflare Dashboard
Go to AI → AI Gateway
Select Create Gateway
Enter your Gateway name (e.g., apex-ai-proxy)
Select Create
Note your Account ID and Gateway ID from the gateway settings

Step 2: Enable Authentication (Recommended)

In your AI Gateway dashboard, go to Settings
Enable Authentication to secure your gateway
Generate a Gateway Token for API access
Store this token securely - you'll need it for deployment

3. Configure Provider API Keys

You have three options for provider authentication:

Option A: BYOK (Bring Your Own Keys) - Recommended

Store your API keys securely in Cloudflare:

In your AI Gateway dashboard, go to Provider Keys section
Click Add API Key
Select your AI provider from the dropdown
Enter your API key and optionally provide a description
Click Save
Repeat for all providers you want to use

Benefits of BYOK:

Secure storage with Cloudflare Secrets Store
Easy key rotation without code changes
Enhanced security - keys never exposed in requests
Rate limiting and budget controls per provider

Option B: Request Headers

Include provider API keys in request headers (traditional approach):

curl -H "Authorization: Bearer YOUR_PROVIDER_API_KEY" \
     -H "cf-aig-authorization: Bearer YOUR_GATEWAY_TOKEN"

Option C: Unified Billing

Use Cloudflare's billing system (where available) without managing individual provider keys.

4. Update Configuration

Edit wrangler.jsonc:

{
  "$schema": "node_modules/wrangler/config-schema.json",
  "name": "apex-ai-proxy",
  "main": "src/index.ts",
  "compatibility_date": "2024-09-01",
  "vars": {
    "ACCOUNT_ID": "your-cloudflare-account-id",
    "GATEWAY_ID": "your-gateway-id",
    "AZURE_RESOURCE": "your-azure-resource-name", // Optional: for Azure OpenAI
    "AZURE_API_VERSION": "2024-02-01"              // Optional: for Azure OpenAI
  }
}

How to find your IDs:

Account ID: Available in the right sidebar of any Cloudflare dashboard page
Gateway ID: Found in your AI Gateway settings page URL or dashboard

5. Set Up Authentication

Create a secret for your gateway token:

# Set your gateway authentication token
wrangler secret put GatewayToken
# Enter your Gateway Token when prompted (from step 2)

6. Deploy

# Deploy to Cloudflare Workers
pnpm run deploy

After deployment, your worker will be available at: https://your-worker-name.your-subdomain.workers.dev

API Usage

Model Specification Format

Important: This proxy uses the format model#provider (note the # separator):

OpenAI: gpt-4#openai, gpt-3.5-turbo#openai
Anthropic: claude-3-5-haiku-20241022#anthropic, claude-3-5-sonnet-20241022#anthropic
Google: gemini-2.0-flash#google-ai-studio
Groq: mixtral-8x7b-32768#groq
Mistral: mistral-large-latest#mistral
Azure OpenAI: gpt-4#azure-openai

OpenAI Compatible Endpoint

curl -X POST "https://your-worker.your-subdomain.workers.dev/v1/chat/completions" \
  -H "Authorization: Bearer your-gateway-token" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "claude-3-5-haiku-20241022#anthropic",
    "messages": [
      {
        "role": "user",
        "content": "Hello, world!"
      }
    ],
    "stream": true
  }'

Anthropic Compatible Endpoint

curl -X POST "https://your-worker.your-subdomain.workers.dev/v1/messages" \
  -H "Authorization: Bearer your-gateway-token" \
  -H "Content-Type: application/json" \
  -H "anthropic-version: 2023-06-01" \
  -d '{
    "model": "claude-3-5-sonnet-20241022#anthropic",
    "max_tokens": 1000,
    "messages": [
      {
        "role": "user",
        "content": "Hello, world!"
      }
    ],
    "stream": true
  }'

Image Analysis Support

Both endpoints support vision models for image analysis:

curl -X POST "https://your-worker.your-subdomain.workers.dev/v1/chat/completions" \
  -H "Authorization: Bearer your-gateway-token" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4-vision-preview#openai",
    "messages": [
      {
        "role": "user",
        "content": [
          {
            "type": "text",
            "text": "What do you see in this image?"
          },
          {
            "type": "image_url",
            "image_url": {
              "url": "data:image/jpeg;base64,/9j/4AAQSkZJRgABAQAAAQ..."
            }
          }
        ]
      }
    ]
  }'

Client Integration Examples

OpenAI Python Client

from openai import OpenAI

client = OpenAI(
    api_key="your-gateway-token",
    base_url="https://your-worker.your-subdomain.workers.dev/v1"
)

# Use Claude via OpenAI client
response = client.chat.completions.create(
    model="claude-3-5-haiku-20241022#anthropic",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)

Anthropic Python Client

from anthropic import Anthropic

client = Anthropic(
    api_key="your-gateway-token",
    base_url="https://your-worker.your-subdomain.workers.dev"
)

response = client.messages.create(
    model="claude-3-5-sonnet-20241022#anthropic",
    max_tokens=1000,
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)

Cursor/VS Code Integration

Configure your IDE to use the adapter:

Cursor Settings (Settings → Cursor Settings):

{
  "cursor.general.apiKey": "your-gateway-token",
  "cursor.general.baseURL": "https://your-worker.your-subdomain.workers.dev/v1",
  "cursor.general.model": "claude-3-5-haiku-20241022#anthropic"
}

VS Code with Continue Extension:

{
  "models": [
    {
      "title": "Claude 3.5 Haiku",
      "provider": "openai",
      "model": "claude-3-5-haiku-20241022#anthropic",
      "apiKey": "your-gateway-token",
      "apiBase": "https://your-worker.your-subdomain.workers.dev/v1"
    }
  ]
}

Environment Variables

Variable	Description	Required	Example
`ACCOUNT_ID`	Your Cloudflare Account ID	✅	`1234567890abcdef1234567890abcdef`
`GATEWAY_ID`	Your AI Gateway ID	✅	`my-gateway`
`AZURE_RESOURCE`	Azure OpenAI resource name	❌	`my-azure-openai`
`AZURE_API_VERSION`	Azure OpenAI API version	❌	`2024-02-01`

Secrets

Secret	Description	Required	How to Set
`GatewayToken`	Authentication token for API access	✅	`wrangler secret put GatewayToken`

Development

Local Development

# Start local development server
pnpm run dev

# Your proxy will be available at http://localhost:8787

Testing

# Run tests
pnpm run test

# Test with specific provider
curl -X POST "http://localhost:8787/v1/chat/completions" \
  -H "Authorization: Bearer test-token" \
  -H "Content-Type: application/json" \
  -d '{"model": "gpt-3.5-turbo#openai", "messages": [{"role": "user", "content": "test"}]}'

Type Generation

# Generate Cloudflare Workers types
pnpm run cf-typegen

Architecture

┌─────────────────┐ ┌──────────────────┐ ┌─────────────────┐ │ AI Clients │───▶│ AI Gateway │───▶│ Cloudflare AI │ │ │ │ Adapter │ │ Gateway │ │ • Codex │ │ (Workers) │ │ │ │ • Claude Code │ │ │ │ • Rate Limiting │ │ │ │ • Protocol Conv. │ │ • Monitoring │ │ │ │ • Validation │ │ • Cost Control │ │ │ │ • Error Handling │ │ • BYOK Storage │ │ │ │ • Image Support │ │ • Caching │ └─────────────────┘ └──────────────────┘ └─────────────────┘

                                                        │
                                                        ▼
                                                ┌─────────────────┐
                                                │  AI Providers   │
                                                │                 │
                                                │ • OpenAI        │
                                                │ • Anthropic     │
                                                │ • Google        │
                                                │ • Groq          │
                                                │ • Mistral       │
                                                │ • Others        │
                                                └─────────────────┘

Key Features

Format Conversion

The proxy automatically converts between OpenAI and Anthropic message formats:

System Messages: Properly handled for each provider's requirements
Tool Calls: Function calling support across both formats
Streaming: Real-time streaming with proper Server-Sent Events formatting
Error Handling: Consistent error responses across providers
Image Content: Full support for vision models and multi-modal inputs

Request Validation

Input validation for both API formats
Model name validation with model#provider format
Authentication verification via Gateway Token
CORS handling for web applications
Content-type validation and parsing

Provider Integration

Dynamic provider routing based on model specification
Support for Cloudflare AI Gateway's unified endpoints
Azure OpenAI integration with deployment management
Automatic error handling and fallback responses
Special handling for provider-specific features (e.g., Mistral streaming options)

Monitoring and Analytics

Leverage Cloudflare AI Gateway's built-in monitoring:

Request Analytics: Track usage patterns, costs, and performance metrics
Rate Limiting: Configure per-provider and per-user limits
Caching: Reduce costs with intelligent response caching
Logs: Detailed request and response logging with full audit trail
Cost Tracking: Monitor spending across all providers
Error Analytics: Track error rates and failure patterns

Access analytics at: https://dash.cloudflare.com/[account-id]/ai/ai-gateway/[gateway-id]

Security Best Practices

Use BYOK: Store API keys securely in Cloudflare rather than in code
Enable Gateway Authentication: Protect your proxy with authentication tokens
Set Rate Limits: Configure appropriate rate limits to prevent abuse
Monitor Usage: Regularly review analytics for unusual patterns
Rotate Keys: Periodically rotate both gateway tokens and provider API keys
Use HTTPS: Always use HTTPS endpoints in production

Troubleshooting

Common Issues

Invalid Model Format: Ensure you're using model#provider format (not provider/model)
Authentication Errors: Verify your Gateway Token is set correctly as a secret
Provider Errors: Check that your provider API keys are valid and have sufficient credits
CORS Issues: Ensure your client is sending proper headers for cross-origin requests

Debug Mode

Enable debug logging by setting the DEBUG environment variable:

wrangler secret put DEBUG
# Enter "true" when prompted

Contributing

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

License

This project is licensed under the MIT License. See the LICENSE file for details.

Support

Documentation: Cloudflare AI Gateway Docs
Issues: GitHub Issues
Discussions: GitHub Discussions

Acknowledgments

Cloudflare Workers for the serverless platform
Cloudflare AI Gateway for AI provider management
All the AI providers for their excellent APIs

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
src		src
.editorconfig		.editorconfig
.gitignore		.gitignore
.prettierrc.js		.prettierrc.js
LICENSE		LICENSE
README.md		README.md
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
tsconfig.json		tsconfig.json
vitest.config.mts		vitest.config.mts
worker-configuration.d.ts		worker-configuration.d.ts
wrangler.jsonc		wrangler.jsonc

License

loadchange/apex-ai-proxy

Folders and files

Latest commit

History

Repository files navigation

Apex AI Proxy

Features

Supported Providers

Quick Deploy

Prerequisites

Setup Instructions

1. Clone and Install

2. Configure Cloudflare AI Gateway

Step 1: Create AI Gateway

Step 2: Enable Authentication (Recommended)

3. Configure Provider API Keys

Option A: BYOK (Bring Your Own Keys) - Recommended

Option B: Request Headers

Option C: Unified Billing

4. Update Configuration

5. Set Up Authentication

6. Deploy

API Usage

Model Specification Format

OpenAI Compatible Endpoint

Anthropic Compatible Endpoint

Image Analysis Support

Client Integration Examples

OpenAI Python Client

Anthropic Python Client

Cursor/VS Code Integration

Environment Variables

Secrets

Development

Local Development

Testing

Type Generation

Architecture

Key Features

Format Conversion

Request Validation

Provider Integration

Monitoring and Analytics

Security Best Practices

Troubleshooting

Common Issues

Debug Mode

Contributing

License

Support

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages