New

Announcing AISIX: The AI-Native AI Gateway for LLMs and AI AgentsLearn More

Learn More

From the original creators of Apache APISIX

Make AI your competitive edge — not your cost center.

AISIX is the open-source AI gateway for LLMs and agents. Put one OpenAI-compatible API in front of every model — ship AI features to production in days, then scale the ones that move your numbers.

python
from openai import OpenAI
client = OpenAI(
    base_url="https://your-aisix-gateway/v1",  # point to AISIX
    api_key="AISIX_API_KEY",
)
# your existing OpenAI code — unchanged
resp = client.chat.completions.create(
    model="gpt-4o",  # or claude, gemini, deepseek… 100+
    messages=[{"role": "user", "content": "Hello"}],
)
Sub-ms
proxy overhead
100+
LLM providers
100%
OpenAI-compatible
Apache-2.0
open-source core

Don't let AI plumbing slow your roadmap

The first prompt is easy. The 50th service, the 10th model, and the first incident in front of customers are not. A gateway is where that complexity goes — so your team keeps shipping.

Direct to providers

Every team reinvents the plumbing

  • A different SDK per provider — every integration is a project
  • No shared limits — one runaway loop takes everyone down
  • Spend you can't see until the invoice lands
  • Provider keys copied across every service
  • No guardrails or audit trail — afraid to ship AI to customers
With AISIX

One platform to ship AI faster

  • One OpenAI-compatible API — integrate once, never rewrite
  • Org-wide limits & budgets — scale without surprises
  • Every call logged — see what's working
  • Keys centralized & encrypted — ship without leaking secrets
  • Guardrails, retries & failover — put AI in front of customers

Switch models without switching code

Point your existing OpenAI SDK at AISIX and reach every major provider — no rewrites, no per-vendor SDKs, no lock-in.

  • OpenAI-compatible chat, streaming (SSE), embeddings, rerank, audio & images — plus an Anthropic-compatible /v1/messages.
  • Virtual / routing models let one model name fan out across providers and regions.
  • Cloud providers — AWS Bedrock, Azure OpenAI & GCP Vertex AI — alongside direct APIs.
Providers, one interface
OpenAIAnthropicGoogle GeminiDeepSeek AWS BedrockAzure OpenAIVertex AIMistral GroqCohereQwenTogetherFireworks 100+ more →

Your whole AI gateway, in one console

Requests, latency, errors, spend, and model health — live across every environment, with no dashboards to build yourself.

AISIX Cloud console overview: request volume, p50/p99 latency, error rate, cost today, a 24-hour request and cost chart, and environment health across instances.

Everything you need to take AI to production

From first prototype to production scale — one platform, every control built in.

Ship and switch models without rewrites. Alias any model to any provider — OpenAI, Anthropic, Bedrock, Vertex, Groq — with weighted load balancing, automatic failover, health checks, and semantic & cost-optimal routing.

AISIX models page aliasing gpt-4o, claude-3-7-sonnet, bedrock-claude-prod, gemini-flash and others to upstream providers and API bases.

Scale to production traffic without surprises. Sliding-window request (RPM / RPD) and token (TPM / TPD) limits plus concurrency caps — scoped per key, team, or provider account, and synchronized across replicas through Redis.

AISIX rate limits: per-key burst, per-team cap, and provider-account protection with request and token budgets per minute.

Put AI in front of customers, confidently. Pre-input and post-output checks — keyword / regex blocklists, PII redaction (Presidio), prompt-injection and moderation (Lakera, OpenAI Moderation, Llama-Guard), and per-key model access control.

AISIX guardrails: competitor-mention blocklist, PII redaction, and a fail-closed prompt-injection shield, scoped to keys or models.

Invest in what's working. Month-to-date spend across every environment, key, and member — with per-key and org budgets, alerts at 75 / 90 / 100%, per-model cost tracking, and hard-stop caps.

AISIX usage and spend: org spend against a monthly cap, requests this month, top model by cost, and per-environment and per-member breakdowns.

A real gateway — not a Python proxy

Most AI gateways are a scripting-language shim in your hot path. AISIX is gateway infrastructure, rebuilt for AI.

Rs

Built in Rust

Sub-millisecond overhead, no garbage-collection pauses, and a stateless data plane that scales horizontally with your traffic.

OS

Open source, Apache-2.0

Self-host the full gateway as a single binary — free, forever, no lock-in. Or run it fully managed on AISIX Cloud.

A6

From the Apache APISIX team

Five-plus years of production gateway engineering from API7.ai, the creators of Apache APISIX — now rebuilt for LLMs and agents.

VPC

Cloud or your own VPC

Start on the managed cloud in minutes, or deploy the control plane and data planes entirely inside your own infrastructure.

Ready when you scale
SOC 2 Type IIISO 27001GDPRHIPAASSO (SAML / OIDC)Audit logs

Ship AI that grows your business

OpenAI-compatible — point your SDK at AISIX and start shipping. Free to start, no credit card.