New

Débloquez l'avenir de l'IA avec APISIX – La passerelle IA entièrement open-source pour les agents IA et LLMs !En Savoir Plus

En Savoir Plus

From the original creators of Apache APISIX

The AI gateway for LLMs and agents, built in Rust.

Put one OpenAI-compatible API in front of every model — and route, govern, and observe every LLM and AI-agent call at sub-millisecond overhead. Open-source core, managed cloud.

python
from openai import OpenAI
client = OpenAI(
    base_url="https://your-aisix-gateway/v1",  # point to AISIX
    api_key="AISIX_API_KEY",
)
# your existing OpenAI code — unchanged
resp = client.chat.completions.create(
    model="gpt-4o",  # or claude, gemini, deepseek… 100+
    messages=[{"role": "user", "content": "Hello"}],
)
Sub-ms
proxy overhead
100+
LLM providers
100%
OpenAI-compatible
Apache-2.0
open-source core

Calling LLM APIs directly doesn't scale

The first prompt is easy. The 50th service, 10th model, and first surprise invoice are not. A gateway is where that complexity goes.

Direct to providers

Every team reinvents the plumbing

  • A different SDK, auth, and quirks per provider
  • No shared rate limits — one loop drains the quota
  • Spend you can't see until the invoice lands
  • Provider keys copied across every service
  • No guardrails, no audit trail, no fallback
With AISIX

One control point for all AI traffic

  • One OpenAI-compatible API for every model
  • Org-wide limits, budgets & per-model cost tracking
  • Every call logged with usage & cost analytics
  • Keys centralized and envelope-encrypted
  • Guardrails, retries & failover on every request

Switch models without switching code

Point your existing OpenAI SDK at AISIX and reach every major provider — no rewrites, no per-vendor SDKs, no lock-in.

  • OpenAI-compatible chat, streaming (SSE), embeddings, rerank, audio & images — plus an Anthropic-compatible /v1/messages.
  • Virtual / routing models let one model name fan out across providers and regions.
  • Cloud providers — AWS Bedrock, Azure OpenAI & GCP Vertex AI — alongside direct APIs.
Providers, one interface
OpenAIAnthropicGoogle GeminiDeepSeek AWS BedrockAzure OpenAIVertex AIMistral GroqCohereQwenTogetherFireworks 100+ more →

Your whole AI gateway, in one console

Requests, latency, errors, spend, and model health — live across every environment, with no dashboards to build yourself.

AISIX Cloud console overview: request volume, p50/p99 latency, error rate, cost today, a 24-hour request and cost chart, and environment health across instances.

Everything between your app and the models

Routing, limits, cost, and safety — configured once, enforced on every request, visible everywhere.

Alias any model, route to any provider. One stable name maps to any upstream — OpenAI, Anthropic, Bedrock, Vertex, Groq — with weighted load balancing, automatic failover, health checks, and semantic & cost-optimal routing.

AISIX models page aliasing gpt-4o, claude-3-7-sonnet, bedrock-claude-prod, gemini-flash and others to upstream providers and API bases.

Rate limits that actually hold. Sliding-window request (RPM / RPD) and token (TPM / TPD) limits plus concurrency caps — scoped per key, team, or provider account, and synchronized across replicas through Redis.

AISIX rate limits: per-key burst, per-team cap, and provider-account protection with request and token budgets per minute.

Guardrails on every prompt. Pre-input and post-output checks — keyword / regex blocklists, PII redaction (Presidio), prompt-injection and moderation (Lakera, OpenAI Moderation, Llama-Guard), and per-key model access control.

AISIX guardrails: competitor-mention blocklist, PII redaction, and a fail-closed prompt-injection shield, scoped to keys or models.

No surprise invoices. Month-to-date spend across every environment, key, and member — with per-key and org budgets, alerts at 75 / 90 / 100%, per-model cost tracking, and hard-stop caps.

AISIX usage and spend: org spend against a monthly cap, requests this month, top model by cost, and per-environment and per-member breakdowns.

A real gateway — not a Python proxy

Most AI gateways are a scripting-language shim in your hot path. AISIX is gateway infrastructure, rebuilt for AI.

Rs

Built in Rust

Sub-millisecond overhead, no garbage-collection pauses, and a stateless data plane that scales horizontally with your traffic.

OS

Open source, Apache-2.0

Self-host the full gateway as a single binary — free, forever, no lock-in. Or run it fully managed on AISIX Cloud.

A6

From the Apache APISIX team

Five-plus years of production gateway engineering from API7.ai, the creators of Apache APISIX — now rebuilt for LLMs and agents.

VPC

Cloud or your own VPC

Start on the managed cloud in minutes, or deploy the control plane and data planes entirely inside your own infrastructure.

Enterprise-ready
SOC 2 Type IIISO 27001GDPRHIPAASSO (SAML / OIDC)Audit logs

Put a gateway in front of your AI today

OpenAI-compatible — point your SDK at AISIX and your existing code just works. Start free, no credit card.