5 Providers · 1 Endpoint · Open Source

One gateway for
every frontier model.

Supergate is a self-hosted LLM proxy that routes to OpenAI, Anthropic, Google, Cohere, and Mistral through a single OpenAI-compatible API — with semantic caching, multi-tenancy, BYOK, and per-key cost tracking built in.

example.ts
// One client. Any model. Zero provider SDKs.
const client = new OpenAI({
  baseURL: 'https://your-gateway/v1',
  apiKey: 'gw_your_api_key',
})

// Route to GPT-4o
await client.chat.completions.create({ model: 'gpt-4o' })
// Route to Claude Opus 4.8 — same client
await client.chat.completions.create({ model: 'claude-opus-4-8' })
5
LLM Providers
15
Current Models
~40%
Avg. Cost Saved via Cache
MIT
License · Self-Hostable

Built different

Everything you need.
Nothing you don't.

Supergate covers the hard parts of production LLM infrastructure — so you can focus on building your product.

Unified API

OpenAI-Compatible Endpoint

A single `/v1/chat/completions` endpoint handles all 5 providers. Change one string and your app routes to any model — no SDK changes, no adapter layers.

Cost Reduction

Semantic Caching via pgvector

Vector-similarity caching means near-duplicate prompts serve cached responses instantly. Cut your token spend by 30–50% with zero application changes.

Multi-Tenancy

Row-Level Tenant Isolation

Every query is scoped to a tenant ID enforced at the database level. Tenants cannot see each other's data, keys, or usage — ever.

Observability

Per-Key Cost Attribution

Every request logs model, provider, input/output tokens, latency, and dollar cost — attributed to the specific API key that made it.

Security

Hybrid Key Management

Use our managed gateway keys for instant access, or store your own provider credentials with AES-256-GCM encryption. BYOK keys take precedence automatically.

Open Source

Self-Hosted & Open Source

Deploy on your own infrastructure with a single `docker compose up`. MIT licensed. No vendor lock-in, no egress fees, no surprise bills.

Current model lineup

All frontier models.
One endpoint.

Models are kept up to date with provider deprecation schedules. Currently reflecting the June 2026 lineup.

OpenAI

The most capable frontier models for complex reasoning and generation.

GPT-4o
GPT-4.1
GPT-4o Mini
GPT-4.1 Nano
Anthropic

Safety-first models with exceptional instruction following and analysis.

Claude Opus 4.8
Claude Sonnet 4.6
Claude Haiku 4.5
Google

Multimodal-first models with massive context windows at low cost.

Gemini 2.5 Pro
Gemini 2.5 Flash
Gemini 2.5 Flash Lite
Cohere

RAG-optimised models built for enterprise search and retrieval.

Command A (03-2025)
Command R7B
Mistral

European open-weight models with strong multilingual performance.

Mistral Large 3
Mistral Medium 3.5
Mistral Small 4

How routing works

Supergate reads the model field in your request and routes to the correct provider automatically. BYOK keys take precedence over gateway keys per provider.

gpt-4o → OpenAI
claude-opus-4-8 → Anthropic
gemini-2.5-flash → Google
command-a-03-2025 → Cohere
mistral-large-latest → Mistral

Flexible authentication

Managed keys or your own. Your choice.

Supergate supports a hybrid model — start with managed gateway keys, move to BYOK when you need compliance. Mix and match per provider.

Gateway Keys

Use Supergate's managed provider keys. Zero configuration — just generate a gateway key and start calling any model immediately. Ideal for prototypes and early-stage teams.

  • No provider accounts needed
  • Instant access to all 5 providers
  • Unified billing through one gateway
  • Great for rapid prototyping

Bring Your Own Key (BYOK)

Store your own provider credentials inside Supergate, encrypted with AES-256-GCM. Your keys are used directly — Supergate never proxies billing. Available per provider, per tenant.

  • AES-256-GCM encrypted at rest
  • Direct provider billing relationship
  • Per-provider granularity
  • Enterprise-Independent mode (BYOK-only tenants)

Transparent pricing

Start free. Scale when ready.

Or self-host entirely for free — all tiers are available on your own infrastructure.

Free

$0/mo

Get started instantly with managed gateway keys. No credit card required.

  • All 5 providers via gateway keys
  • 50K tokens / month
  • Semantic caching included
  • Usage dashboard
  • 1 API key
  • Community support
Most Popular

Pro

$29/mo

For teams building real products. Generous limits, full analytics.

  • Everything in Free
  • 5M tokens / month
  • Priority routing
  • BYOK per provider
  • Up to 10 API keys
  • Full usage telemetry
  • Cost attribution per key
  • Email support

Enterprise

Custom

Full isolation, compliance controls, and dedicated infrastructure.

  • Everything in Pro
  • Unlimited tokens
  • Enterprise-Independent mode (BYOK-only)
  • Private VPC deployment
  • SSO / SAML
  • SLA guarantee
  • Dedicated Slack channel
  • Custom data retention policies

All plans include semantic caching, usage telemetry, and multi-tenancy. Self-hosting is always free.

Self-hosted in minutes

One command.
Fully running.

Supergate ships as a Turborepo monorepo with Docker Compose. Bring your own PostgreSQL and Redis, or let the compose file spin them up. Copy your .env, run the stack, seed the database, and you're live.

Docker Compose stack (gateway + dashboard + docs + postgres + redis)
Turborepo monorepo — build only what changed
Database migrations with Drizzle ORM — no manual SQL
GitHub Actions CI/CD template included
bash
$ git clone github.com/shivanshshekhar11/supergate
$ cp .env.example .env # add your keys
$ docker compose -f docker-compose.prod.yml up -d
Pulling gateway ... done
Pulling dashboard ... done
Starting postgres ... done
Starting redis ... done
$ pnpm db:migrate && pnpm db:seed
✅ Supergate is live on :3000

Ship faster with a
gateway that just works.

Open source, MIT licensed, and ready to run in your own infrastructure in under five minutes.

Open Source · MIT License · Docker Ready · No Vendor Lock-in