Modelrelay

Local router that benchmarks free coding models across providers and forwards requests to the best available model. Compatible with Opencode and Openclaw

Generate Convert Improve

Install / Use

/learn @ellipticmarketing/Modelrelay

About this skill

Quality Score

0/100

README

🚀 modelrelay

Join our Discord for discussions, feature requests, and community support.

<div align="center"> <img src="docs/assets/dashboard.png" alt="ModelRelay Dashboard" width="100%"> <br/> <p><i>The smartest, fastest, and completely free local router for your AI coding needs.</i></p> </div>

🔥 100% Free • Auto-Routing • 80+ Models • 11+ Providers • OpenAI-Compatible

modelrelay is an OpenAI-compatible local router that benchmarks free coding models across top providers and automatically forwards your requests to the best available model.

✨ Why use modelrelay?

💸 Completely Free: Stop paying for API usage. We seamlessly provide access to robust free models.
🧠 State-of-the-Art (SOTA) Models: Out-of-the-box availability for top-tier models including Kimi K2.5, Minimax M2.5, GLM 5, Deepseek V3.2, and more.
🏢 Reliable Providers: We route requests securely through trusted, high-performance platforms like NVIDIA, Groq, OpenRouter, OpenCode Zen, Ollama, and Google.
⚡ Lightning Fast: The built-in benchmark continually evaluates metrics to pick the fastest and most capable LLM for your request.
🔄 OpenAI-Compatible: A perfect drop-in replacement that works seamlessly with your existing tools, scripts, and workflows.

🚀 Install via NPM

npm install -g modelrelay

# Start it
modelrelay

Once started, modelrelay is accessible at http://localhost:7352/.

Router endpoint:

Base URL: http://127.0.0.1:7352/v1
API key: any string
Model: auto-fastest (router picks actual backend)

🚀 Install via Docker

Prerequisites

Docker Engine
Docker Compose (the docker compose command)

mkdir modelrelay

cd modelrelay

curl -fsSL -o Dockerfile https://raw.githubusercontent.com/ellipticmarketing/modelrelay/master/Dockerfile
curl -fsSL -o docker-compose.yml https://raw.githubusercontent.com/ellipticmarketing/modelrelay/master/docker-compose.yml

docker compose up -d --build

Once running, modelrelay is accessible at http://localhost:7352/.

🔌 Installing Integrations

Use modelrelay onboard to save provider keys and auto-configure integrations for OpenClaw or OpenCode.

modelrelay onboard

If you prefer manual setup, use the examples below.

OpenCode Integration

modelrelay onboard can auto-configure OpenCode.

If you want manual setup, put this in ~/.config/opencode/opencode.json:

{
  "$schema": "https://opencode.ai/config.json",
  "provider": {
    "router": {
      "npm": "@ai-sdk/openai-compatible",
      "name": "modelrelay",
      "options": {
        "baseURL": "http://127.0.0.1:7352/v1",
        "apiKey": "dummy-key"
      },
      "models": {
        "auto-fastest": {
          "name": "Auto Fastest"
        }
      }
    }
  },
  "model": "router/auto-fastest"
}

OpenClaw Integration

modelrelay onboard can auto-configure OpenClaw.

If you want manual setup, merge this into ~/.openclaw/openclaw.json:

{
  "models": {
    "providers": {
      "modelrelay": {
        "baseUrl": "http://127.0.0.1:7352/v1",
        "api": "openai-completions",
        "apiKey": "no-key",
        "models": [
          { "id": "auto-fastest", "name": "Auto Fastest" }
        ]
      }
    }
  },
  "agents": {
    "defaults": {
      "model": {
        "primary": "modelrelay/auto-fastest"
      },
      "models": {
        "modelrelay/auto-fastest": {}
      }
    }
  }
}

CLI

modelrelay [--port <number>] [--log] [--ban <model1,model2>]
modelrelay onboard [--port <number>]
modelrelay install --autostart
modelrelay start --autostart
modelrelay uninstall --autostart
modelrelay status --autostart
modelrelay update
modelrelay autoupdate [--enable|--disable|--status] [--interval <hours>]
modelrelay autostart [--install|--start|--uninstall|--status]
modelrelay config export
modelrelay config import <token>

Request terminal logging is disabled by default. Use --log to enable it.

modelrelay install --autostart also triggers an immediate start attempt so you do not need a separate command after install.

During modelrelay onboard, you will also be prompted to enable auto-start on login.

modelrelay update upgrades the global npm package and, when autostart is configured, stops the background service first and starts it again after the update.

Auto-update is enabled by default. While the router is running, modelrelay checks npm periodically (default: every 24 hours) and applies updates automatically.

Use modelrelay autoupdate --status to inspect state, modelrelay autoupdate --disable to turn it off, and modelrelay autoupdate --enable --interval 12 to re-enable with a custom interval.

Use modelrelay config export to print a transferable config token (base64url-encoded JSON), and modelrelay config import <token> to load it on another machine. You can also import by stdin:

modelrelay config export | modelrelay config import

Endpoints

`/v1/chat/completions`

POST /v1/chat/completions is an OpenAI-compatible chat completions endpoint.

Use model: "auto-fastest" to route to the best model overall
Use a grouped model ID such as minimax-m2.5, kimi-k2.5, or glm4.7 to route within that model group
For grouped IDs, modelrelay selects the provider with the best current QoS for that group
In the Web UI, pinned models can now use either Canonical Group mode (default, pins the same model across providers) or Exact Provider Row mode from Settings
Streaming and non-streaming requests are both supported

`/v1/models`

GET /v1/models returns the models exposed by the router.

Model IDs are grouped slugs such as minimax-m2.5, kimi-k2.5, and glm4.7
Each grouped ID can represent the same model across multiple providers
When you select one of these IDs in /v1/chat/completions, modelrelay routes the request to the provider with the best current QoS for that model group
auto-fastest is also exposed and routes to the best model overall

Example:

{
  "object": "list",
  "data": [
    { "id": "auto-fastest", "object": "model", "owned_by": "router" },
    { "id": "minimax-m2.5", "object": "model", "owned_by": "relay" },
    { "id": "kimi-k2.5", "object": "model", "owned_by": "relay" },
    { "id": "glm4.7", "object": "model", "owned_by": "relay" }
  ]
}

Config

Router config file: ~/.modelrelay.json
API key env overrides:
- NVIDIA_API_KEY
- GROQ_API_KEY
- CEREBRAS_API_KEY
- SAMBANOVA_API_KEY
OPENROUTER_API_KEY
OPENCODE_API_KEY
OLLAMA_API_KEY
OLLAMA_BASE_URL
OLLAMA_MODEL
- CODESTRAL_API_KEY
- HYPERBOLIC_API_KEY
- SCALEWAY_API_KEY
- QWEN_CODE_API_KEY (or DASHSCOPE_API_KEY)
- GOOGLE_API_KEY

For Qwen Code, modelrelay supports both API keys and Qwen OAuth cached credentials (~/.qwen/oauth_creds.json). If OAuth credentials exist, modelrelay will use them and refresh access tokens automatically. You can also start OAuth directly from the Web UI Providers tab using Login with Qwen Code.

For hosted Ollama, set OLLAMA_API_KEY and optionally override OLLAMA_BASE_URL / OLLAMA_MODEL. If you leave the Ollama base URL blank in the UI, modelrelay defaults to https://ollama.com/v1. With a valid Ollama API key, modelrelay will discover available Ollama models automatically. If you point Ollama at a local host such as http://127.0.0.1:11434, modelrelay will also auto-discover models and does not require an API key.

Config migration (CLI + Web UI)

In the Web UI, open Settings -> Configuration Transfer to export/copy/import a token.
The token includes your full config (including API keys, provider toggles, pinning mode, bans, filter rules, and auto-update settings).
Treat tokens as secrets. Anyone with the token can import your keys/settings.
Alternative: copy the config file directly from ~/.modelrelay.json to the other machine at the same path (~/.modelrelay.json).

Troubleshooting

Clicking the update button or running `modelrelay` won't perform an update

To trigger a manual npm update and restart the service, run:

npm i -g modelrelay@latest
modelrelay autostart --start

Testing updates locally without publishing to npm

You can point the updater at a local tarball instead of the npm registry:

npm pack
MODELRELAY_UPDATE_TARBALL=./modelrelay-1.8.3.tgz pnpm start

If you want the Web UI to always show an update while testing, set a higher forced version:

MODELRELAY_FORCE_UPDATE_VERSION=9.9.9

If the tarball filename does not contain a semantic version, also set:

MODELRELAY_UPDATE_VERSION=1.8.3

When MODELRELAY_UPDATE_TARBALL is set, the Web UI update flow and modelrelay update install from that tarball and bypass the normal Git checkout update block. This is for local testing only. MODELRELAY_FORCE_UPDATE_VERSION only affects version detection; the actual install still comes from the tarball path.

⭐️ If you find modelrelay useful, please consider starring the repo!

Related Skills

node-connect

350.1k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

109.9k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

350.1k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

350.1k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。

ellipticmarketing

View profile

View on GitHub

GitHub Stars219

CategoryDevelopment

Updated3h ago

Forks21

ellipticmarketing/modelrelay

Languages

JavaScript

Security Score

95/100

Audited on Apr 6, 2026

No findings

Modelrelay

Install / Use

README

🚀 modelrelay

🔥 100% Free • Auto-Routing • 80+ Models • 11+ Providers • OpenAI-Compatible

✨ Why use modelrelay?

🚀 Install via NPM

🚀 Install via Docker

Prerequisites

🔌 Installing Integrations

OpenCode Integration

OpenClaw Integration

CLI

Endpoints

/v1/chat/completions

/v1/models

Config

Config migration (CLI + Web UI)

Troubleshooting

Clicking the update button or running modelrelay won't perform an update

Testing updates locally without publishing to npm

Related Skills

`/v1/chat/completions`

`/v1/models`

Clicking the update button or running `modelrelay` won't perform an update