Models Features Pricing China AI Docs
Log in Get Started

One API.
All Models.

Route requests to GPT-4o, Claude 4, Gemini, Llama, and 100+ models through a single unified endpoint.

100+ Models
99.9% Uptime
<50ms Latency

Supported Models

Access every major foundation model through one API. Switch providers in one line of code.

OpenAI

GPT-4o · GPT-4o-mini · o1 · o3-mini

Anthropic

Claude Opus 4.6 · Claude Sonnet 4.6 · Claude Haiku 4.5

Google

Gemini 2.5 Pro · Gemini 2.5 Flash · Gemma 3

Meta

Llama 4 · Llama 4 Maverick · Llama 3.3 70B

Mistral

Mistral Large · Mistral Medium · Codestral

DeepSeek

DeepSeek-R1 · DeepSeek-V3 · DeepSeek-R1-Lite

Qwen

Qwen3 · Qwen3-Coder · QwQ-32B

Zhipu

GLM-4-Plus · GLM-4-Long · GLM-Z1

90+ More

Cohere · Yi · MiniMax · Moonshot · Baichuan · and more

One Endpoint. One SDK.

Drop-in compatible with OpenAI SDK. Switch models by changing a single parameter.

from openai import OpenAI

client = OpenAI(
    base_url="https://api.your-gateway.com/v1",
    api_key="your-api-key"
)

# Use any model — just change the name
response = client.chat.completions.create(
    model="gpt-4o",        # or "claude-opus-4-6", "gemini-2.5-pro"
    messages=[{
        "role": "user",
        "content": "Hello, how are you?"
    }]
)

print(response.choices[0].message.content)

Why AI Gateway

Enterprise-grade infrastructure for production AI applications.

Smart Routing

Automatically route requests to the best model based on cost, latency, and quality requirements.

Unified Auth

One API key to access all providers. Manage credentials, permissions, and budgets in one place.

Real-time Analytics

Track usage, costs, latency, and error rates across all models with granular dashboards.

Fallback & Retry

Automatic failover between providers. If one model is down, requests route to alternatives instantly.

Rate Limiting

Per-user, per-model rate limits with token bucket algorithms. Protect your budget and infrastructure.

Streaming First

Full SSE streaming support with consistent response format across all providers and models.

Simple Pricing

Pay only for what you use. No hidden fees. No minimums.

Free

$0/month
  • Up to 1M tokens/month
  • 5 models
  • Basic analytics
  • Community support
Get Started

Enterprise

Custom
  • Everything in Pro
  • SSO & SAML
  • Dedicated infrastructure
  • SLA guarantee
  • Custom integrations
  • Dedicated support
Contact Sales

Start building with AI Gateway today.

Get your API key in 30 seconds. No credit card required.