<?xml version="1.0" encoding="UTF-8" ?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>BUZZ AI Gateway Blog</title>
    <link>https://buzzai.cc/</link>
    <description>Engineering writing on AI gateways, transparent forwarding, prompt caching, multi-model routing, and pricing strategy. From the team that runs https://buzzai.cc.</description>
    <language>en</language>
    <atom:link href="https://buzzai.cc/feed.xml" rel="self" type="application/rss+xml" />
    <lastBuildDate>Fri, 22 May 2026 18:50:00 +0000</lastBuildDate>
    <pubDate>Fri, 22 May 2026 09:00:00 +0000</pubDate>
    <generator>buzzai.cc</generator>
    <image>
      <url>https://buzzai.cc/og/home.png</url>
      <title>BUZZ AI Gateway</title>
      <link>https://buzzai.cc/</link>
      <width>1200</width>
      <height>630</height>
    </image>

    <item>
      <title>Choosing an AI Gateway: BUZZ vs OpenRouter vs Helicone vs LiteLLM</title>
      <link>https://buzzai.cc/blog/ai-gateway-comparison-buzz-openrouter-helicone</link>
      <guid isPermaLink="true">https://buzzai.cc/blog/ai-gateway-comparison-buzz-openrouter-helicone</guid>
      <pubDate>Fri, 22 May 2026 18:30:00 +0000</pubDate>
      <description>A decision framework for choosing between BUZZ, OpenRouter, Helicone, and LiteLLM, covering model breadth, pricing, privacy, observability, and self-hosting trade-offs.</description>
    </item>

    <item>
      <title>Tool Use With Claude Through a Gateway: Streaming, Errors, and Cost Patterns</title>
      <link>https://buzzai.cc/blog/claude-tool-use-gateway</link>
      <guid isPermaLink="true">https://buzzai.cc/blog/claude-tool-use-gateway</guid>
      <pubDate>Fri, 22 May 2026 18:00:00 +0000</pubDate>
      <description>How tool_use and tool_result round trips work through a transparent gateway, including streaming + tool combinations, retry boundaries, cost math, and a working agent loop.</description>
    </item>

    <item>
      <title>Anthropic Prompt Caching in Production: A Practical Cost-Reduction Playbook</title>
      <link>https://buzzai.cc/blog/anthropic-prompt-caching-playbook</link>
      <guid isPermaLink="true">https://buzzai.cc/blog/anthropic-prompt-caching-playbook</guid>
      <pubDate>Fri, 22 May 2026 17:30:00 +0000</pubDate>
      <description>Cache_control patterns, 5-minute vs 1-hour break-even math, three production code patterns, and the gateway-side guarantee that cache directives pass through unchanged.</description>
    </item>

    <item>
      <title>Using the OpenAI SDK to Talk to Claude (and Gemini, and Grok)</title>
      <link>https://buzzai.cc/blog/openai-sdk-claude-compatibility</link>
      <guid isPermaLink="true">https://buzzai.cc/blog/openai-sdk-claude-compatibility</guid>
      <pubDate>Fri, 22 May 2026 17:00:00 +0000</pubDate>
      <description>The OpenAI-compatible Chat Completions adapter at /v1, what works out of the box, what doesn't translate cleanly, and when to fall back to the native Anthropic SDK.</description>
    </item>

    <item>
      <title>Cutting Claude Code Costs Without Losing Capability: Routing Through a Gateway</title>
      <link>https://buzzai.cc/blog/claude-code-cheaper-with-gateway</link>
      <guid isPermaLink="true">https://buzzai.cc/blog/claude-code-cheaper-with-gateway</guid>
      <pubDate>Fri, 22 May 2026 09:15:00 +0000</pubDate>
      <description>How to route Claude Code through BUZZ to reduce per-session cost without losing extended thinking, tool use, or prompt caching.</description>
    </item>

    <item>
      <title>One API Key for Claude, GPT, Gemini, and Grok: A Multi-Model Gateway in Practice</title>
      <link>https://buzzai.cc/blog/one-key-claude-gpt-gemini</link>
      <guid isPermaLink="true">https://buzzai.cc/blog/one-key-claude-gpt-gemini</guid>
      <pubDate>Fri, 22 May 2026 09:10:00 +0000</pubDate>
      <description>The operational cost of N model providers, what a unified gateway gives you, and when not to use one.</description>
    </item>

    <item>
      <title>Zero-Retention LLM Gateways: Why Enterprises Need Forwarders That Forget</title>
      <link>https://buzzai.cc/blog/zero-retention-llm-gateway</link>
      <guid isPermaLink="true">https://buzzai.cc/blog/zero-retention-llm-gateway</guid>
      <pubDate>Fri, 22 May 2026 09:05:00 +0000</pubDate>
      <description>What zero retention should actually mean, the architecture that makes it possible, why retention is dangerous in production, and how to verify a vendor's claim.</description>
    </item>

    <item>
      <title>Claude API Through a Gateway: A Practical Guide to Reliability, Pricing, and Zero-Retention Forwarding</title>
      <link>https://buzzai.cc/blog/claude-api-gateway-guide</link>
      <guid isPermaLink="true">https://buzzai.cc/blog/claude-api-gateway-guide</guid>
      <pubDate>Fri, 22 May 2026 09:00:00 +0000</pubDate>
      <description>How to use Claude API through a gateway, comparing direct integration with gateway routing on reliability, pricing, transparent forwarding, and operational concerns.</description>
    </item>

  </channel>
</rss>
