Skip to main content
The Muxx Gateway is a proxy that sits between your application and LLM providers. It intercepts requests, logs them, and can apply caching and rate limiting—all with minimal code changes.

How It Works

Your App → Muxx Gateway → LLM Provider (OpenAI, Anthropic, etc.)

         Logging & Metrics

         Muxx Dashboard
  1. Your app sends requests to the Muxx Gateway instead of directly to the provider
  2. The gateway forwards requests to the appropriate provider
  3. Responses are logged and returned to your app
  4. View everything in the Muxx dashboard

Key Features

Request Logging

Every request and response is logged with full payloads, token counts, and timing.

Response Caching

Cache identical requests to reduce latency and costs.

Rate Limiting

Protect your budget with requests-per-minute limits.

Cost Tracking

Real-time cost calculation for every request.

Supported Endpoints

The gateway supports OpenAI-compatible and Anthropic-compatible endpoints:
ProviderEndpointGateway URL
OpenAI/v1/chat/completionshttps://gateway.muxx.dev/v1/chat/completions
Anthropic/v1/messageshttps://gateway.muxx.dev/v1/messages

When to Use Gateway

The gateway is ideal when you want:
  • Zero code changes - Just update the base URL
  • Caching - Reduce costs and latency for repeated queries
  • Rate limiting - Protect against runaway costs
  • Quick integration - Get logging in minutes
For deeper tracing with spans and custom instrumentation, combine the gateway with the SDK.

Next Steps

Gateway Setup

Configure the gateway for your application