AI Gateway — One API, Every Model

Supported Models

Access every major foundation model through one API. Switch providers in one line of code.

OpenAI

GPT-4o · GPT-4o-mini · o1 · o3-mini

Anthropic

Claude Opus 4.6 · Claude Sonnet 4.6 · Claude Haiku 4.5

Google

Gemini 2.5 Pro · Gemini 2.5 Flash · Gemma 3

Mistral

Mistral Large · Mistral Medium · Codestral

DeepSeek

DeepSeek-R1 · DeepSeek-V3 · DeepSeek-R1-Lite

Qwen

Qwen3 · Qwen3-Coder · QwQ-32B

Zhipu

GLM-4-Plus · GLM-4-Long · GLM-Z1

90+ More

Cohere · Yi · MiniMax · Moonshot · Baichuan · and more

One Endpoint. One SDK.

Drop-in compatible with OpenAI SDK. Switch models by changing a single parameter.

from openai import OpenAI

client = OpenAI(
    base_url="https://api.your-gateway.com/v1",
    api_key="your-api-key"
)

# Use any model — just change the name
response = client.chat.completions.create(
    model="gpt-4o",        # or "claude-opus-4-6", "gemini-2.5-pro"
    messages=[{
        "role": "user",
        "content": "Hello, how are you?"
    }]
)

print(response.choices[0].message.content)

curl https://api.your-gateway.com/v1/chat/completions \
  -H "Authorization: Bearer your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4o",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

import OpenAI from 'openai';

const client = new OpenAI({
  baseURL: 'https://api.your-gateway.com/v1',
  apiKey: 'your-api-key',
});

// Route to any model instantly
const response = await client.chat.completions.create({
  model: 'gpt-4o',
  messages: [{ role: 'user', content: 'Hello!' }],
});

console.log(response.choices[0].message.content);

Why AI Gateway

Enterprise-grade infrastructure for production AI applications.

Smart Routing

Automatically route requests to the best model based on cost, latency, and quality requirements.

Unified Auth

One API key to access all providers. Manage credentials, permissions, and budgets in one place.

Real-time Analytics

Track usage, costs, latency, and error rates across all models with granular dashboards.

Fallback & Retry

Automatic failover between providers. If one model is down, requests route to alternatives instantly.

Rate Limiting

Per-user, per-model rate limits with token bucket algorithms. Protect your budget and infrastructure.

Streaming First

Full SSE streaming support with consistent response format across all providers and models.

Simple Pricing

Pay only for what you use. No hidden fees. No minimums.

Free

$0/month

Up to 1M tokens/month
5 models
Basic analytics
Community support

Get Started

Pro

$29/month

Unlimited tokens
All 100+ models
Advanced analytics
Smart routing
Fallback & retry
Priority support

Get Started

Enterprise

Custom

Everything in Pro
SSO & SAML
Dedicated infrastructure
SLA guarantee
Custom integrations
Dedicated support

Contact Sales

Start building with AI Gateway today.

Get your API key in 30 seconds. No credit card required.

Get Started Free Read the docs →

One API.
All Models.

Supported Models

OpenAI

Anthropic

Google

Meta

Mistral

DeepSeek

Qwen

Zhipu

90+ More

One Endpoint. One SDK.

Why AI Gateway

Smart Routing

Unified Auth

Real-time Analytics

Fallback & Retry

Rate Limiting

Streaming First

Simple Pricing

Free

Pro

Enterprise

Start building with AI Gateway today.

One API.All Models.

Supported Models

OpenAI

Anthropic

Google

Meta

Mistral

DeepSeek

Qwen

Zhipu

90+ More

One Endpoint. One SDK.

Why AI Gateway

Smart Routing

Unified Auth

Real-time Analytics

Fallback & Retry

Rate Limiting

Streaming First

Simple Pricing

Free

Pro

Enterprise

Start building with AI Gateway today.

One API.
All Models.