New

Announcing AISIX: The AI-Native AI Gateway for LLMs and AI AgentsLearn More

Learn More

All posts tagged

"Cost Optimization"

How AI Gateways Cut Costs by 75% with Smart Caching: Lessons from DeepSeek

Technology

May 26, 2026

How AI Gateways Cut Costs by 75% with Smart Caching: Lessons from DeepSeek

Learn how intelligent caching in AI Gateways reduces API costs by 75% while maintaining performance, inspired by DeepSeek's approach.

BitNet 100B Models on CPU: The Case for Intelligent LLM Routing

Technology

March 12, 2026

BitNet 100B Models on CPU: The Case for Intelligent LLM Routing

Learn how an AI Gateway with multi-LLM routing optimizes cost, latency, and reliability across cloud, local, and edge models.