AI Pipe

AI Pipe lets you build web apps that can access LLM APIs (e.g. OpenRouter, OpenAI, Gemini etc.) without a back-end.

An instance is hosted at https://aipipe.org/. You can host your own on CloudFlare. Licensed under MIT.

User Guide

Visit these pages:

aipipe.org to understand how it works.
aipipe.org/login with a Google Account to get your AI Pipe Token and track your usage.
aipipe.org/playground to explore models and chat with them.

AI Pipe Token

You can use the AI Pipe Token from aipipe.org/login in any OpenAI API compatible application by setting:

OPENAI_API_KEY as your AI Pipe Token
OPENAI_BASE_URL as https://aipipe.org/openai/v1

For example:

export OPENAI_API_KEY=$AIPIPE_TOKEN
export OPENAI_BASE_URL=https://aipipe.org/openai/v1

Now you can run:

uvx openai api chat.completions.create -m gpt-4.1-nano -g user "Hello"

... or:

uvx llm 'Hello' -m gpt-4o-mini --key $AIPIPE_TOKEN

This will print something like Hello! How can I assist you today?

Developer Guide

Paste this code into index.html, open it in a browser, and check your DevTools Console

<script type="module">
  import { getProfile } from "https://aipipe.org/aipipe.js";

  const { token, email } = getProfile();

  const response = await fetch("https://aipipe.org/openrouter/v1/chat/completions", {
    method: "POST",
    headers: { Authorization: `Bearer ${token}`, "Content-Type": "application/json" },
    body: JSON.stringify({
      model: "openai/gpt-4o-mini",
      messages: [{ role: "user", content: "What is 2 + 2?" }],
    }),
  }).then((r) => r.json());
  console.log(response);
</script>

This app will:

Prompt the user for their AI Pipe token.
- getProfile() will prompt the user to enter their AI Pipe token if not already stored
- It validates the token by decoding the JWT to extract the email
- Invalid tokens will prompt the user to enter a valid token
Store the token for future use.

The token and email are stored in localStorage for subsequent visits
getProfile() returns the stored token and email from the validated JWT

Make an LLM API call to OpenRouter or OpenAI and log the response.

You can replace any call to https://openrouter.ai/api/v1 with https://aipipe.org/openrouter/v1 and provide Authorization: Bearer $AIPIPE_TOKEN as a header.
Similarly, you can replace https://api.openai.com/v1 with https://aipipe.org/openai/v1 and provide Authorization: Bearer $AIPIPE_TOKEN as a header.
AI Pipe replaces the token and proxies the request via the provider.

API

GET /usage: Returns usage data for specified email and time period

Example: Get usage for a user

curl https://aipipe.org/usage -H "Authorization: $AIPIPE_TOKEN"

Response:

{
  "email": "[email protected]",
  "days": 7,
  "cost": 0.000137,
  "usage": [
    {
      "date": "2025-04-16",
      "cost": 0.000137
    }
  ],
  "limit": 0.1
}

GET /proxy/[URL]: Proxies requests to the specified URL, bypassing CORS restrictions. No authentication required.

Example: Get contents of a URL

curl "https://aipipe.org/proxy/https://httpbin.org/get?x=1"

Response:

{
  "args": {
    "x": "1"
  },
  "headers": {
    "Accept": "*/*",
    "Host": "httpbin.org",
    "User-Agent": "curl/8.5.0"
  },
  "origin": "45.123.26.54",
  "url": "https://httpbin.org/get?x=1"
}

Notes:

The response includes the original URL in the X-Proxy-URL header
URLs must begin with http or https
Requests timeout after 30 seconds
All HTTP methods (GET, POST, etc.) and headers are preserved
CORS headers are added for browser compatibility

GET token?credential=...: Converts a Google Sign-In credential into an AI Pipe token:

When a user clicks "Sign in with Google" on the login page, Google's client library returns a JWT credential
The login page sends this credential to /token?credential=...
AI Pipe verifies the credential using Google's public keys
If valid, AI Pipe signs a new token containing the user's email (and optional salt) using AIPIPE_SECRET
Returns: { token, email ... } where additional fields come from Google's profile

OpenRouter API

GET /openrouter/*: Proxy requests to OpenRouter

Example: List Openrouter models

curl https://aipipe.org/openrouter/v1/models -H "Authorization: $AIPIPE_TOKEN"

Response:

{
  "data": [
    {
      "id": "google/gemini-2.5-pro-preview-03-25",
      "name": "Google: Gemini 2.5 Pro Preview",
      // ...
    },
  ],
}

Example: Make a chat completion request

curl https://aipipe.org/openrouter/v1/chat/completions -H "Authorization: $AIPIPE_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"model": "google/gemini-2.0-flash-lite-001", "messages": [{ "role": "user", "content": "What is 2 + 2?" }] }'

Response:

{
  "id": "gen-...",
  "provider": "Google",
  "model": "google/gemini-2.0-flash-lite-001",
  "object": "chat.completion",
  // ...
}

OpenAI API

GET /openai/*: Proxy requests to OpenAI

Example: List OpenAI models

curl https://aipipe.org/openai/v1/models -H "Authorization: $AIPIPE_TOKEN"

Response:

{
  "object": "list",
  "data": [
    {
      "id": "gpt-4o-audio-preview-2024-12-17",
      "object": "model",
      "created": 1734034239,
      "owned_by": "system",
    },
    // ...
  ],
}

Example: Make a responses request

curl https://aipipe.org/openai/v1/responses -H "Authorization: $AIPIPE_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"model": "gpt-4.1-nano", "input": "What is 2 + 2?" }'

Response:

{
  "id": "resp_...",
  "object": "response",
  "model": "gpt-4.1-nano-2025-04-14",
  // ...
  "output": [
    {
      "role": "assistant",
      "content": [{ "text": "2 + 2 equals 4." }],
      // ...
    },
  ],
}

Example: Create embeddings

curl https://aipipe.org/openai/v1/embeddings -H "Authorization: $AIPIPE_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"model": "text-embedding-3-small", "input": "What is 2 + 2?" }'

Response:

{
  "object": "list",
  "data": [
    {
      "object": "embedding",
      "index": 0,
      "embedding": [
        0.010576399, -0.037246477,
        // ...
      ],
    },
  ],
  "model": "text-embedding-3-small",
  "usage": {
    "prompt_tokens": 8,
    "total_tokens": 8,
  },
}

Gemini API

GET /geminiv1beta/*: Proxy requests to Google's Gemini API

Example: Make a generateContent request

curl https://aipipe.org/geminiv1beta/models/gemini-1.5-flash:generateContent -H "Authorization: $AIPIPE_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"contents":[{"parts":[{"text":"What is 2 + 2?"}]}]}'

Response:

{
  "candidates": [
    {
      "content": { "parts": [{ "text": "2 + 2 is 4." }] },
    },
  ],
  "modelVersion": "gemini-1.5-flash",
  "usageMetadata": { "promptTokenCount": 8, "candidatesTokenCount": 8, "totalTokenCount": 16 },
}

Example: Create embeddings

curl https://aipipe.org/geminiv1beta/models/gemini-embedding-001:embedContent -H "Authorization: $AIPIPE_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"model":"gemini-embedding-001","content":{"parts":[{"text":"What is 2 + 2?"}]}}'

Response:

{
  "embedding": { "values": [0.01, -0.02] },
  "usageMetadata": { "tokenCount": 8 },
}

Similarity API

POST /similarity: Calculate semantic similarity between documents and topics using embeddings.

Example: Calculate similarity between documents and topics

curl https://aipipe.org/similarity -H "Authorization: $AIPIPE_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "docs": ["The quick brown fox jumps over the lazy dog", "A fast orange fox leaps over a sleepy canine"],
    "topics": ["fox jumping", "dog sleeping"],
    "model": "text-embedding-3-small",
    "precision": 5
  }'

Response:

{
  "model": "text-embedding-3-small",
  "similarity": [
    [0.82345, 0.12345], // Similarity scores for first doc against each topic
    [0.81234, 0.23456], // Similarity scores for second doc against each topic
  ],
  "tokens": 42,
}

Example: Calculate similarity between all documents (self-similarity matrix)

curl https://aipipe.org/similarity -H "Authorization: $AIPIPE_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "docs": [
      "The quick brown fox jumps over the lazy dog",
      "A fast orange fox leaps over a sleepy canine",
      "The lazy dog sleeps while the fox jumps"
    ],
    "model": "text-embedding-3-small"
  }'

Response:

{
  "model": "text-embedding-3-small",
  "similarity": [
    [1.0, 0.82345, 0.71234], // First doc's similarity with all docs
    [0.82345, 1.0, 0.6789], // Second doc's similarity with all docs
    [0.71234, 0.6789, 1.0], // Third doc's similarity with all docs
  ],
  "tokens": 63,
}

Example: Using structured input format

curl https://aipipe.org/similarity -H "Authorization: $AIPIPE_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "docs": [
      { "type": "text", "value": "The quick brown fox jumps over the lazy dog" },
      { "type": "text", "value": "A fast orange fox leaps over a sleepy canine" }
    ],
    "topics": [
      { "type": "text", "value": "fox jumping" },
      { "type": "text", "value": "dog sleeping" }
    ]
  }'

Parameters:

docs: Array of strings or objects with {type, value}. Required.
topics: Optional array of strings or objects with {type, value}. If not provided, calculates similarity between all documents.
model: Optional embedding model name. Defaults to "text-embedding-3-small".
precision: Optional number of decimal places in similarity scores. Defaults to 5.

Admin Guide

To self-host AI Pipe, you need a:

CloudFlare Account - hosts your AI Pipe instance
OpenRouter API Key - to access OpenRouter models
OpenAI API Key - to access OpenAI models
Google Client ID - for user login. Add OAuth 2.0 redirect URLs:
- https://aipipe.org/login (or your domain)
- http://localhost:8787/login (for testing)

Clone and install:

git clone https://github.com/sanand0/aipipe.git
cd aipipe
npm install

Configure budgets and security in src/config.js. For example:

// Set a budget limit for specific email IDs or domains
const budget = {
  "*": { limit: 0.1, days: 7 }, // Default fallback: low limits for unknown users. Use 0.001 to limit to free models.
  "[email protected]": { limit: 0, days: 1 }, // Blocked user: zero limit stops all operations
  "[email protected]": { limit: 10.0, days: 30 }, // Premium user with monthly high-volume allocation
  "@example.com": { limit: 1.0, days: 7 }, // Domain-wide policy: moderate weekly quota for organization
};

// If a user reports their key as stolen, add/change their salt to new random text.
// That will invalidate their token.
const salt = {
  "[email protected]": "random-text",
};

Create .dev.vars (which is .gitignored) with your secrets:

# Required: Your JWT signing key
AIPIPE_SECRET=$(openssl rand -base64 12)

# Optional: add email IDs of admin users separated by comma and/or whitespace.
ADMIN_EMAILS="[email protected], [email protected], ..."

# Optional: Add only the APIs you need
OPENROUTER_API_KEY=sk-or-v1-...  # via openrouter.ai/settings
OPENAI_API_KEY=sk-...            # via platform.openai.com/api-keys
GEMINI_API_KEY=AI...             # via aistudio.google.com/app/apikey

Test your deployment:

Ensure that .dev.vars has all keys set (including optional ones). Then run:

npm run dev   # Runs at http://localhost:8787
[email protected] npm test
curl http://localhost:8787/usage -H "Authorization: $AIPIPE_TOKEN"

Or run specific tests, e.g. only OpenAI tests, via:

npm test -- --grep 'OpenAI'

Deploy to Cloudflare:

# Add secrets to production
npx wrangler secret put AIPIPE_SECRET
npx wrangler secret put ADMIN_EMAILS
npx wrangler secret put OPENROUTER_API_KEY
npx wrangler secret put OPENAI_API_KEY
npx wrangler secret put GEMINI_API_KEY

# Deploy
npm run deploy

# Test
BASE_URL=https://aipipe.org [email protected] npm test

Admin API

GET /admin/usage: Get historical usage of all users. Only for admins

curl https://aipipe.org/admin/usage -H "Authorization: $AIPIPE_TOKEN"

Response:

{
  "data": [
    {
      "email": "[email protected]",
      "date": "2025-04-18",
      "cost": 25.5,
    },
    // ...
  ],
}

GET /admin/[email protected]: Generate a JWT token for any user. Only for admins.

curl "https://aipipe.org/admin/[email protected]" -H "Authorization: $AIPIPE_TOKEN"

Response:

{
  "token": "eyJhbGciOiJIUzI1NiI..."
}

POST /admin/cost: Overwrite the cost usage for a user on a specific date. Only for admins.

curl https://aipipe.org/admin/cost -X POST -H "Authorization: $AIPIPE_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"email": "[email protected]", "date": "2025-04-18", "cost": 1.23}'

Response:

{
  "message": "Cost for [email protected] on 2025-04-18 set to 1.23"
}

Architecture

File Structure

src/worker.js: Main entry point. Handles authentication, proxying with streaming, cost tracking
src/providers.js: Defines parameters for each LLM providers, e.g. endpoints, API keys, cost calculation
src/cost.js: Tracks daily cost per user via Durable Objects
src/config.js: Configuration for budget limits by user/domain, token invalidation
src/utils.js: Utilities to manage headers, etc.

Database Schema

The cost table in Durable Objects stores:

CREATE TABLE cost (
  email TEXT,      -- User's email address
  date TEXT,       -- YYYY-MM-DD in UTC
  cost NUMBER,     -- Cumulative cost for the day
  PRIMARY KEY (email, date)
);

Provider Interface

Each provider in providers.js implements:

{
  base: "https://api.provider.com",     // Base URL to proxy to
  key: "PROVIDER_API_KEY",             // Environment variable with API key
  cost: async ({ model, usage }) => {  // Calculate cost for a request
    return {
      cost: /* Calculate cost based on prompt & completion tokens */
    }
  }
}

Add new providers by implementing this interface and adding routing in worker.js.

Alternatives

AI Pipe is for light, widespread use, e.g. public demos and student assignments, where cost is low, frequency is low, and access is wide.

If you need production features, explore LLM Routers like:

litellm 21,852 ⭐ May 2025 (BerriAI). 100+ providers (OpenAI, Bedrock, Vertex, Groq, …). Auth: per-key, per-user, BYO provider keys, JWT or Basic for multi-tenant dashboards. Rate-limit: token/req budget per model/project, burst ceilings, fallback queue.
RouteLLM 3,886 ⭐ Aug 2024 (LM-Sys). Custom providers (template: OpenAI, Anyscale). Auth: BYO provider keys via env vars. Rate-limit: none (relies on upstream or external proxy).
helicone 3,715 ⭐ May 2025. 15+ providers (OpenAI, Anthropic, Bedrock, Groq, Gemini, …). Auth: Helicone org key + BYO provider keys. Rate-limit: soft limits via dashboard alerts, no enforced throttling (observability focus).
FastChat 38,506 ⭐ Apr 2025. Local/remote self-hosted models (e.g., Mixtral, Llama). Auth: Bearer key pass-through. Rate-limit: none (use external proxy).
apisix 15,076 ⭐ Apr 2025. 100+ providers via plugins (OpenAI, Claude, Gemini, Mistral, …). Auth: JWT, Key-Auth, OIDC, HMAC. Rate-limit: token/request per consumer/route, distributed leaky-bucket.
envoy 25,916 ⭐ May 2025. Provider-agnostic (define clusters manually). Auth: mTLS, API key, OIDC via filters. Rate-limit: global/local via Envoy's rate-limit service.
openllmetry 5,752 ⭐ Apr 2025. Configurable providers (OpenAI, Azure, Anthropic, local vLLM). Auth: OpenAI-style key, BYO keys. Rate-limit: Redis-backed token/RPS optional.
kong 40,746 ⭐ Apr 2025. Multi-provider via "ai-llm-route" plugin. Auth: Key-Auth, ACL, OIDC via plugins. Rate-limit: per-key, per-route, cost-aware token limits.
semantic-router 2,569 ⭐ Apr 2025 (experimental). Embedding-based routing within apps (no external provider integration). Auth: n/a. Rate-limit: n/a.
unify 298 ⭐ May 2025. Providers wrapped via LiteLLM. Auth: Unify project key, BYO provider keys. Rate-limit: soft budget alerts; no enforced throttling yet.
OpenRouter (SaaS). 300+ models, 30+ providers. Auth: OpenRouter key, OAuth2, BYO provider keys. Rate-limit: credit-based (1 req/credit/s, 20 rpm free tier), DDOS protection.
Portkey Gateway (SaaS). 250+ providers & guard-rail plugins. Auth: Portkey API key, BYO keys, OAuth for teams. Rate-limit: sliding-window tokens, cost caps, programmable policy engine.
Martian Model Router (SaaS, private). Dozens of commercial/open models (Accenture's "Switchboard"). Auth: Martian API key, BYO keys planned. Rate-limit: undisclosed; SLA-based dynamic throttling.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
public		public
src		src
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
wrangler.toml		wrangler.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI Pipe

User Guide

AI Pipe Token

Developer Guide

API

OpenRouter API

OpenAI API

Gemini API

Similarity API

Admin Guide

Admin API

Architecture

File Structure

Database Schema

Provider Interface

Alternatives

About

Uh oh!

Releases

Packages

Languages

License

mynkpdr/aipipe

Folders and files

Latest commit

History

Repository files navigation

AI Pipe

User Guide

AI Pipe Token

Developer Guide

API

OpenRouter API

OpenAI API

Gemini API

Similarity API

Admin Guide

Admin API

Architecture

File Structure

Database Schema

Provider Interface

Alternatives

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages