Discover how Apache APISIX serves as an AI gateway with AI proxy, LLMs load balancing, retry and fallback, token rate limiting, and security for efficient and reliable AI agents.
To Keep Up with the Rapid Evolution of AI and LLMs
Powered by Apache APISIX
LLMs and API management features
Powerful and Open-Source Plugins for LLMs Load Balancing and Token Rate Limiting
All AI plugins are fully open-source, including multi-LLM load balancing, retry and fallback mechanisms, token rate limiting, content moderation, AI RAG, prompt decorator and auditing.
Supports multiple LLM providers (OpenAI, DeepSeek, Claude, Mistral, Gemini, etc.) to prevent vendor lock-in, while dynamically adjusting LLM weights based on latency, cost, and stability.
Token usage can be rate-limited and throttled based on various dimensions such as Route, Service, Consumer, Consumer Group, or custom parameters. Supports both single-node and cluster-level rate limiting. Additionally, different rate-limiting strategies can be configured for each LLM.
Through RAG, LLMs can leverage the enterprise knowledge base to answer questions or generate content, improving the professionalism and accuracy of the generated output while avoiding LLM hallucinations.
By utilizing access logs and observability components, track token usage to prevent API abuse and avoid excessive billing.
Supports configurable LLM health checks, with automatic retries and fallback to other LLM services, ensuring service stability and quality.
Utilize plugins such as Prompt Guard, Prompt Decorator, Prompt Template, Content Moderation, and Logging & Auditing to ensure the security and compliance of user inputs and LLM responses.
API7 AI Gateway supports multiple LLMs, including but not limited to OpenAI, DeepSeek, Claude, Mistral, and Gemini, ensuring your AI applications are adaptable to diverse scenarios.
Learn More“Airwallex has made a smooth transition to multi-cloud and microservices architectures thanks to APISIX's highly optimized and scalable platform and the support of our developer community!”
Ryan Cao
Chief Software Architect
“API7 solution performs surprisingly well in its practice in production scenarios. We love its high availability, high performance, and rich functionality, allowing us to build and grow our business in a cloud-native way.”
Xu Zhao
Infrastructure Architect