Question 1

What is BUZZ?

Accepted Answer

BUZZ is an AI Gateway for developers. A single API key gives you access to Claude (Anthropic), GPT (OpenAI), Gemini (Google), and Grok (xAI) model families through one unified endpoint. BUZZ forwards requests and responses unmodified to the upstream model and never persists request bodies.

Question 2

How is BUZZ different from a traditional API relay or middleman?

Accepted Answer

Traditional relays often modify prompts, inject system messages, downgrade models silently, or log request bodies. BUZZ does none of these. Requests and responses are forwarded byte-for-byte to the upstream provider. Models are never silently swapped or downgraded. No request body is written to disk or database. Only billing metadata such as token counts and model name is retained for invoicing.

Question 3

Which models does BUZZ support?

Accepted Answer

Anthropic Claude families (Opus 4.5, 4.6, 4.7; Sonnet 4.5, 4.6; Haiku 4.5), OpenAI GPT-5 family (gpt-5, gpt-5.1-codex, gpt-5.1-codex-max, gpt-5.2, gpt-5.2-codex, gpt-5.3-codex, gpt-5.4, gpt-5.4-mini, gpt-5.5), Google Gemini, and xAI Grok. New models are added shortly after upstream release. The live model and pricing list is available at https://buzzai.cc/api/pricing.

Question 4

How is BUZZ priced?

Accepted Answer

Pay-as-you-go. No monthly fees, no subscriptions, no minimums. BUZZ pricing sits significantly below first-party rates from Anthropic and OpenAI across all supported models. Specific per-model pricing is published live at https://buzzai.cc/api/pricing and on the Model Pricing page. Pricing is subject to change; integrators should always read the live API rather than hard-coding rates.

Question 5

What does zero data retention mean at BUZZ?

Accepted Answer

Request bodies and model responses are not written to disk, not written to a database, and not persisted to logs. After a request completes, the request body is dropped from memory. Only billing metadata (model name, input and output token counts, timestamp, user ID) is retained, since this is required to charge accurately. Operators of BUZZ cannot retrieve the prompt or response of any past request, because that data does not exist after the request completes.

Question 6

What does transparent forwarding mean?

Accepted Answer

BUZZ does not rewrite, augment, or filter the request body before sending it upstream. The system prompt, user messages, tool definitions, model parameters, and any other field arrive at the upstream provider exactly as the developer sent them. The response stream is forwarded back unmodified. The model selected by the developer is the model that runs the request - BUZZ never silently substitutes a cheaper model.

Question 7

How do I integrate BUZZ in my code?

Accepted Answer

Replace the base URL in the Anthropic SDK or OpenAI SDK with the BUZZ endpoint. With the Anthropic SDK: set base_url to https://buzzai.cc and put your BUZZ API key in the Authorization header. With the OpenAI SDK: set base_url to https://buzzai.cc/v1 and put your BUZZ API key in the api_key parameter. No other code changes are required.

Question 8

Does BUZZ support Claude Code?

Accepted Answer

Yes. Claude Code works with BUZZ end-to-end, including extended thinking, tool use, and prompt caching. A one-line install script is provided at https://buzzai.cc/sh/claudecode.sh which configures the Claude Code CLI to point at BUZZ.

Question 9

Does BUZZ support prompt caching?

Accepted Answer

Yes. BUZZ forwards Anthropic prompt caching directives unchanged. Cache writes and cache reads are billed at the upstream multiplier (5-minute write at 1.25x base input, 1-hour write at 2x, cache read at 0.1x), then BUZZ applies its standard discount on top. Per-model cache pricing is shown on the Model Pricing page.

Question 10

Where is BUZZ hosted? Can I use it from outside China?

Accepted Answer

BUZZ is built for global developers. The service is reachable worldwide and exposes documentation and a dashboard in English by default. Sign-up does not require a credit card; payment is supported via multiple methods. The platform is operated independently of any single region's infrastructure.

BUZZ - AI Gateway for Developers

What is BUZZ

Why BUZZ is not a traditional API relay

Pricing

Supported models

Anthropic Claude

OpenAI GPT

How to integrate

Anthropic SDK (Python)

OpenAI SDK (Python)

Claude Code (CLI)

Prompt caching

Security and privacy

Contact and resources