mirror of
https://github.com/diegosouzapw/OmniRoute.git
synced 2026-04-28 14:29:54 +00:00
* fix(minimax): switch auth from x-api-key to Authorization Bearer (#1076) Integrated into release/v3.5.6 — MiniMax auth fix with authHeader consistency normalization * feat(CI,i18n): autogenerate language files + Add missing strings (#1071) Integrated into release/v3.5.6 — i18n translations for memory, skills, and missing keys across 31 languages * fix(ci): restore i18n continue-on-error, remove auto-commit race condition * fix(husky): load nvm in hooks for VS Code compatibility * fix(husky): gracefully skip hooks when npm is not in PATH * fix: convert OpenAI function tool_choice to Claude tool format (#1072) * fix: prevent EPIPE feedback loop filling logs at GB/s (#1006) * fix: fallback to native fetch when undici dispatcher fails (#1054) * fix: improve Qoder PAT validation with actionable error messages (#966) - Add QODER_PERSONAL_ACCESS_TOKEN env var fallback for both validation and execution - Pre-flight ping check to diagnose connectivity issues (Docker/proxy) - Detect encrypted auth blobs from ~/.qoder/.auth/user and guide to website PAT - Clear error messages for auth failures with link to integrations page - Treat non-auth 4xx as auth-pass (request format issue, not token issue) - Update tests to cover new validation paths (23 tests, all passing) * feat: Improve the Chinese translation (#1079) Integrated into release/v3.5.6 * chore(release): v3.5.6 — i18n updates and credential security fixes * fix(ci): resolve e2e and docs-sync pipeline failures * fix(security): bump next to 16.2.3 to resolve SNYK-JS-NEXT-15954202 * fix: guard Memory/Cache UI against null toLocaleString crash (#1083) * fix: translate OpenAI tool_choice type 'function' to Claude 'tool' format (#1072) * fix: pass custom baseUrl in provider API key validation (#1078) * docs: update CHANGELOG with v3.5.6 bug fixes and security patches * docs: rewrite implement-features workflow with 5-phase harvest-research-report-plan-execute pipeline * docs: organize _ideia/ into viable/defer/notfit + add Phase 2.5 auto-response workflow * docs: implementation plans for #1025, #750, #960, #1046 + close already-implemented #833, #973, #982 * feat: mask email addresses in dashboard for privacy (#1025) * feat: add OpenRouter and GitHub to embedding/image provider registries (#960) * feat: add model visibility toggle and search filter to provider page (#750) * docs: move implemented features to notfit, update task plans status * chore: untrack _ideia/ and _tasks/ from git — private/internal only * chore(release): bump to v3.5.6 — changelog, docs, version sync & any-budget fix * fix: remove explicit .ts extension in qoderCli import that caused 500 error in production build --------- Co-authored-by: Jean Brito <jeanfbrito@gmail.com> Co-authored-by: zenobit <zenobit@disroot.org> Co-authored-by: diegosouzapw <diegosouzapw@users.noreply.github.com> Co-authored-by: Ethan Hunt <136065060+only4copilot@users.noreply.github.com>
463 lines
17 KiB
Markdown
463 lines
17 KiB
Markdown
# API Reference
|
|
|
|
🌐 **Languages:** 🇺🇸 [English](API_REFERENCE.md) | 🇧🇷 [Português (Brasil)](i18n/pt-BR/API_REFERENCE.md) | 🇪🇸 [Español](i18n/es/API_REFERENCE.md) | 🇫🇷 [Français](i18n/fr/API_REFERENCE.md) | 🇮🇹 [Italiano](i18n/it/API_REFERENCE.md) | 🇷🇺 [Русский](i18n/ru/API_REFERENCE.md) | 🇨🇳 [中文 (简体)](i18n/zh-CN/API_REFERENCE.md) | 🇩🇪 [Deutsch](i18n/de/API_REFERENCE.md) | 🇮🇳 [हिन्दी](i18n/in/API_REFERENCE.md) | 🇹🇭 [ไทย](i18n/th/API_REFERENCE.md) | 🇺🇦 [Українська](i18n/uk-UA/API_REFERENCE.md) | 🇸🇦 [العربية](i18n/ar/API_REFERENCE.md) | 🇯🇵 [日本語](i18n/ja/API_REFERENCE.md) | 🇻🇳 [Tiếng Việt](i18n/vi/API_REFERENCE.md) | 🇧🇬 [Български](i18n/bg/API_REFERENCE.md) | 🇩🇰 [Dansk](i18n/da/API_REFERENCE.md) | 🇫🇮 [Suomi](i18n/fi/API_REFERENCE.md) | 🇮🇱 [עברית](i18n/he/API_REFERENCE.md) | 🇭🇺 [Magyar](i18n/hu/API_REFERENCE.md) | 🇮🇩 [Bahasa Indonesia](i18n/id/API_REFERENCE.md) | 🇰🇷 [한국어](i18n/ko/API_REFERENCE.md) | 🇲🇾 [Bahasa Melayu](i18n/ms/API_REFERENCE.md) | 🇳🇱 [Nederlands](i18n/nl/API_REFERENCE.md) | 🇳🇴 [Norsk](i18n/no/API_REFERENCE.md) | 🇵🇹 [Português (Portugal)](i18n/pt/API_REFERENCE.md) | 🇷🇴 [Română](i18n/ro/API_REFERENCE.md) | 🇵🇱 [Polski](i18n/pl/API_REFERENCE.md) | 🇸🇰 [Slovenčina](i18n/sk/API_REFERENCE.md) | 🇸🇪 [Svenska](i18n/sv/API_REFERENCE.md) | 🇵🇭 [Filipino](i18n/phi/API_REFERENCE.md) | 🇨🇿 [Čeština](i18n/cs/API_REFERENCE.md)
|
|
|
|
Complete reference for all OmniRoute API endpoints.
|
|
|
|
---
|
|
|
|
## Table of Contents
|
|
|
|
- [Chat Completions](#chat-completions)
|
|
- [Embeddings](#embeddings)
|
|
- [Image Generation](#image-generation)
|
|
- [List Models](#list-models)
|
|
- [Compatibility Endpoints](#compatibility-endpoints)
|
|
- [Semantic Cache](#semantic-cache)
|
|
- [Dashboard & Management](#dashboard--management)
|
|
- [Request Processing](#request-processing)
|
|
- [Authentication](#authentication)
|
|
|
|
---
|
|
|
|
## Chat Completions
|
|
|
|
```bash
|
|
POST /v1/chat/completions
|
|
Authorization: Bearer your-api-key
|
|
Content-Type: application/json
|
|
|
|
{
|
|
"model": "cc/claude-opus-4-6",
|
|
"messages": [
|
|
{"role": "user", "content": "Write a function to..."}
|
|
],
|
|
"stream": true
|
|
}
|
|
```
|
|
|
|
### Custom Headers
|
|
|
|
| Header | Direction | Description |
|
|
| ------------------------ | --------- | ------------------------------------------------ |
|
|
| `X-OmniRoute-No-Cache` | Request | Set to `true` to bypass cache |
|
|
| `X-OmniRoute-Progress` | Request | Set to `true` for progress events |
|
|
| `X-Session-Id` | Request | Sticky session key for external session affinity |
|
|
| `x_session_id` | Request | Underscore variant also accepted (direct HTTP) |
|
|
| `Idempotency-Key` | Request | Dedup key (5s window) |
|
|
| `X-Request-Id` | Request | Alternative dedup key |
|
|
| `X-OmniRoute-Cache` | Response | `HIT` or `MISS` (non-streaming) |
|
|
| `X-OmniRoute-Idempotent` | Response | `true` if deduplicated |
|
|
| `X-OmniRoute-Progress` | Response | `enabled` if progress tracking on |
|
|
| `X-OmniRoute-Session-Id` | Response | Effective session ID used by OmniRoute |
|
|
|
|
> Nginx note: if you rely on underscore headers (for example `x_session_id`), enable `underscores_in_headers on;`.
|
|
|
|
---
|
|
|
|
## Embeddings
|
|
|
|
```bash
|
|
POST /v1/embeddings
|
|
Authorization: Bearer your-api-key
|
|
Content-Type: application/json
|
|
|
|
{
|
|
"model": "nebius/Qwen/Qwen3-Embedding-8B",
|
|
"input": "The food was delicious"
|
|
}
|
|
```
|
|
|
|
Available providers: Nebius, OpenAI, Mistral, Together AI, Fireworks, NVIDIA, **OpenRouter**, **GitHub Models**.
|
|
|
|
```bash
|
|
# List all embedding models
|
|
GET /v1/embeddings
|
|
```
|
|
|
|
---
|
|
|
|
## Image Generation
|
|
|
|
```bash
|
|
POST /v1/images/generations
|
|
Authorization: Bearer your-api-key
|
|
Content-Type: application/json
|
|
|
|
{
|
|
"model": "openai/dall-e-3",
|
|
"prompt": "A beautiful sunset over mountains",
|
|
"size": "1024x1024"
|
|
}
|
|
```
|
|
|
|
Available providers: OpenAI (DALL-E, GPT Image 1), xAI (Grok Image), Together AI (FLUX), Fireworks AI, Nebius (FLUX), Hyperbolic, NanoBanana, **OpenRouter**, SD WebUI (local), ComfyUI (local).
|
|
|
|
```bash
|
|
# List all image models
|
|
GET /v1/images/generations
|
|
```
|
|
|
|
---
|
|
|
|
## List Models
|
|
|
|
```bash
|
|
GET /v1/models
|
|
Authorization: Bearer your-api-key
|
|
|
|
→ Returns all chat, embedding, and image models + combos in OpenAI format
|
|
```
|
|
|
|
---
|
|
|
|
## Compatibility Endpoints
|
|
|
|
| Method | Path | Format |
|
|
| ------ | --------------------------- | ---------------------- |
|
|
| POST | `/v1/chat/completions` | OpenAI |
|
|
| POST | `/v1/messages` | Anthropic |
|
|
| POST | `/v1/responses` | OpenAI Responses |
|
|
| POST | `/v1/embeddings` | OpenAI |
|
|
| POST | `/v1/images/generations` | OpenAI |
|
|
| GET | `/v1/models` | OpenAI |
|
|
| POST | `/v1/messages/count_tokens` | Anthropic |
|
|
| GET | `/v1beta/models` | Gemini |
|
|
| POST | `/v1beta/models/{...path}` | Gemini generateContent |
|
|
| POST | `/v1/api/chat` | Ollama |
|
|
|
|
### Dedicated Provider Routes
|
|
|
|
```bash
|
|
POST /v1/providers/{provider}/chat/completions
|
|
POST /v1/providers/{provider}/embeddings
|
|
POST /v1/providers/{provider}/images/generations
|
|
```
|
|
|
|
The provider prefix is auto-added if missing. Mismatched models return `400`.
|
|
|
|
---
|
|
|
|
## Semantic Cache
|
|
|
|
```bash
|
|
# Get cache stats
|
|
GET /api/cache/stats
|
|
|
|
# Clear all caches
|
|
DELETE /api/cache/stats
|
|
```
|
|
|
|
Response example:
|
|
|
|
```json
|
|
{
|
|
"semanticCache": {
|
|
"memorySize": 42,
|
|
"memoryMaxSize": 500,
|
|
"dbSize": 128,
|
|
"hitRate": 0.65
|
|
},
|
|
"idempotency": {
|
|
"activeKeys": 3,
|
|
"windowMs": 5000
|
|
}
|
|
}
|
|
```
|
|
|
|
---
|
|
|
|
## Dashboard & Management
|
|
|
|
### Authentication
|
|
|
|
| Endpoint | Method | Description |
|
|
| ----------------------------- | ------- | --------------------- |
|
|
| `/api/auth/login` | POST | Login |
|
|
| `/api/auth/logout` | POST | Logout |
|
|
| `/api/settings/require-login` | GET/PUT | Toggle login required |
|
|
|
|
### Provider Management
|
|
|
|
| Endpoint | Method | Description |
|
|
| ---------------------------- | --------------------- | ---------------------------------------------- |
|
|
| `/api/providers` | GET/POST | List / create providers |
|
|
| `/api/providers/[id]` | GET/PUT/DELETE | Manage a provider |
|
|
| `/api/providers/[id]/test` | POST | Test provider connection |
|
|
| `/api/providers/[id]/models` | GET | List provider models |
|
|
| `/api/providers/validate` | POST | Validate provider config |
|
|
| `/api/provider-nodes*` | Various | Provider node management |
|
|
| `/api/provider-models` | GET/POST/PATCH/DELETE | Custom models (add, update, hide/show, delete) |
|
|
|
|
### OAuth Flows
|
|
|
|
| Endpoint | Method | Description |
|
|
| -------------------------------- | ------- | ----------------------- |
|
|
| `/api/oauth/[provider]/[action]` | Various | Provider-specific OAuth |
|
|
|
|
### Routing & Config
|
|
|
|
| Endpoint | Method | Description |
|
|
| --------------------- | -------- | ----------------------------- |
|
|
| `/api/models/alias` | GET/POST | Model aliases |
|
|
| `/api/models/catalog` | GET | All models by provider + type |
|
|
| `/api/combos*` | Various | Combo management |
|
|
| `/api/keys*` | Various | API key management |
|
|
| `/api/pricing` | GET | Model pricing |
|
|
|
|
### Usage & Analytics
|
|
|
|
| Endpoint | Method | Description |
|
|
| --------------------------- | ------ | -------------------- |
|
|
| `/api/usage/history` | GET | Usage history |
|
|
| `/api/usage/logs` | GET | Usage logs |
|
|
| `/api/usage/request-logs` | GET | Request-level logs |
|
|
| `/api/usage/[connectionId]` | GET | Per-connection usage |
|
|
|
|
### Settings
|
|
|
|
| Endpoint | Method | Description |
|
|
| ------------------------------- | ------------- | ---------------------- |
|
|
| `/api/settings` | GET/PUT/PATCH | General settings |
|
|
| `/api/settings/proxy` | GET/PUT | Network proxy config |
|
|
| `/api/settings/proxy/test` | POST | Test proxy connection |
|
|
| `/api/settings/ip-filter` | GET/PUT | IP allowlist/blocklist |
|
|
| `/api/settings/thinking-budget` | GET/PUT | Reasoning token budget |
|
|
| `/api/settings/system-prompt` | GET/PUT | Global system prompt |
|
|
|
|
### Monitoring
|
|
|
|
| Endpoint | Method | Description |
|
|
| ------------------------ | ---------- | ---------------------------------------------------------------------------------------------------- |
|
|
| `/api/sessions` | GET | Active session tracking |
|
|
| `/api/rate-limits` | GET | Per-account rate limits |
|
|
| `/api/monitoring/health` | GET | Health check + provider summary (`catalogCount`, `configuredCount`, `activeCount`, `monitoredCount`) |
|
|
| `/api/cache/stats` | GET/DELETE | Cache stats / clear |
|
|
|
|
### Backup & Export/Import
|
|
|
|
| Endpoint | Method | Description |
|
|
| --------------------------- | ------ | --------------------------------------- |
|
|
| `/api/db-backups` | GET | List available backups |
|
|
| `/api/db-backups` | PUT | Create a manual backup |
|
|
| `/api/db-backups` | POST | Restore from a specific backup |
|
|
| `/api/db-backups/export` | GET | Download database as .sqlite file |
|
|
| `/api/db-backups/import` | POST | Upload .sqlite file to replace database |
|
|
| `/api/db-backups/exportAll` | GET | Download full backup as .tar.gz archive |
|
|
|
|
### Cloud Sync
|
|
|
|
| Endpoint | Method | Description |
|
|
| ---------------------- | ------- | --------------------- |
|
|
| `/api/sync/cloud` | Various | Cloud sync operations |
|
|
| `/api/sync/initialize` | POST | Initialize sync |
|
|
| `/api/cloud/*` | Various | Cloud management |
|
|
|
|
### Tunnels
|
|
|
|
| Endpoint | Method | Description |
|
|
| -------------------------- | ------ | ----------------------------------------------------------------------- |
|
|
| `/api/tunnels/cloudflared` | GET | Read Cloudflare Quick Tunnel install/runtime status for the dashboard |
|
|
| `/api/tunnels/cloudflared` | POST | Enable or disable the Cloudflare Quick Tunnel (`action=enable/disable`) |
|
|
|
|
### CLI Tools
|
|
|
|
| Endpoint | Method | Description |
|
|
| ---------------------------------- | ------ | ------------------- |
|
|
| `/api/cli-tools/claude-settings` | GET | Claude CLI status |
|
|
| `/api/cli-tools/codex-settings` | GET | Codex CLI status |
|
|
| `/api/cli-tools/droid-settings` | GET | Droid CLI status |
|
|
| `/api/cli-tools/openclaw-settings` | GET | OpenClaw CLI status |
|
|
| `/api/cli-tools/runtime/[toolId]` | GET | Generic CLI runtime |
|
|
|
|
CLI responses include: `installed`, `runnable`, `command`, `commandPath`, `runtimeMode`, `reason`.
|
|
|
|
### ACP Agents
|
|
|
|
| Endpoint | Method | Description |
|
|
| ----------------- | ------ | -------------------------------------------------------- |
|
|
| `/api/acp/agents` | GET | List all detected agents (built-in + custom) with status |
|
|
| `/api/acp/agents` | POST | Add custom agent or refresh detection cache |
|
|
| `/api/acp/agents` | DELETE | Remove a custom agent by `id` query param |
|
|
|
|
GET response includes `agents[]` (id, name, binary, version, installed, protocol, isCustom) and `summary` (total, installed, notFound, builtIn, custom).
|
|
|
|
### Resilience & Rate Limits
|
|
|
|
| Endpoint | Method | Description |
|
|
| ----------------------- | --------- | ------------------------------- |
|
|
| `/api/resilience` | GET/PATCH | Get/update resilience profiles |
|
|
| `/api/resilience/reset` | POST | Reset circuit breakers |
|
|
| `/api/rate-limits` | GET | Per-account rate limit status |
|
|
| `/api/rate-limit` | GET | Global rate limit configuration |
|
|
|
|
### Evals
|
|
|
|
| Endpoint | Method | Description |
|
|
| ------------ | -------- | --------------------------------- |
|
|
| `/api/evals` | GET/POST | List eval suites / run evaluation |
|
|
|
|
### Policies
|
|
|
|
| Endpoint | Method | Description |
|
|
| --------------- | --------------- | ----------------------- |
|
|
| `/api/policies` | GET/POST/DELETE | Manage routing policies |
|
|
|
|
### Compliance
|
|
|
|
| Endpoint | Method | Description |
|
|
| --------------------------- | ------ | ----------------------------- |
|
|
| `/api/compliance/audit-log` | GET | Compliance audit log (last N) |
|
|
|
|
### v1beta (Gemini-Compatible)
|
|
|
|
| Endpoint | Method | Description |
|
|
| -------------------------- | ------ | --------------------------------- |
|
|
| `/v1beta/models` | GET | List models in Gemini format |
|
|
| `/v1beta/models/{...path}` | POST | Gemini `generateContent` endpoint |
|
|
|
|
These endpoints mirror Gemini's API format for clients that expect native Gemini SDK compatibility.
|
|
|
|
### Internal / System APIs
|
|
|
|
| Endpoint | Method | Description |
|
|
| --------------- | ------ | ---------------------------------------------------- |
|
|
| `/api/init` | GET | Application initialization check (used on first run) |
|
|
| `/api/tags` | GET | Ollama-compatible model tags (for Ollama clients) |
|
|
| `/api/restart` | POST | Trigger graceful server restart |
|
|
| `/api/shutdown` | POST | Trigger graceful server shutdown |
|
|
|
|
> **Note:** These endpoints are used internally by the system or for Ollama client compatibility. They are not typically called by end users.
|
|
|
|
---
|
|
|
|
## Audio Transcription
|
|
|
|
```bash
|
|
POST /v1/audio/transcriptions
|
|
Authorization: Bearer your-api-key
|
|
Content-Type: multipart/form-data
|
|
```
|
|
|
|
Transcribe audio files using Deepgram or AssemblyAI.
|
|
|
|
**Request:**
|
|
|
|
```bash
|
|
curl -X POST http://localhost:20128/v1/audio/transcriptions \
|
|
-H "Authorization: Bearer your-api-key" \
|
|
-F "file=@recording.mp3" \
|
|
-F "model=deepgram/nova-3"
|
|
```
|
|
|
|
**Response:**
|
|
|
|
```json
|
|
{
|
|
"text": "Hello, this is the transcribed audio content.",
|
|
"task": "transcribe",
|
|
"language": "en",
|
|
"duration": 12.5
|
|
}
|
|
```
|
|
|
|
**Supported providers:** `deepgram/nova-3`, `assemblyai/best`.
|
|
|
|
**Supported formats:** `mp3`, `wav`, `m4a`, `flac`, `ogg`, `webm`.
|
|
|
|
---
|
|
|
|
## Ollama Compatibility
|
|
|
|
For clients that use Ollama's API format:
|
|
|
|
```bash
|
|
# Chat endpoint (Ollama format)
|
|
POST /v1/api/chat
|
|
|
|
# Model listing (Ollama format)
|
|
GET /api/tags
|
|
```
|
|
|
|
Requests are automatically translated between Ollama and internal formats.
|
|
|
|
---
|
|
|
|
## Telemetry
|
|
|
|
```bash
|
|
# Get latency telemetry summary (p50/p95/p99 per provider)
|
|
GET /api/telemetry/summary
|
|
```
|
|
|
|
**Response:**
|
|
|
|
```json
|
|
{
|
|
"providers": {
|
|
"claudeCode": { "p50": 245, "p95": 890, "p99": 1200, "count": 150 },
|
|
"github": { "p50": 180, "p95": 620, "p99": 950, "count": 320 }
|
|
}
|
|
}
|
|
```
|
|
|
|
---
|
|
|
|
## Budget
|
|
|
|
```bash
|
|
# Get budget status for all API keys
|
|
GET /api/usage/budget
|
|
|
|
# Set or update a budget
|
|
POST /api/usage/budget
|
|
Content-Type: application/json
|
|
|
|
{
|
|
"keyId": "key-123",
|
|
"limit": 50.00,
|
|
"period": "monthly"
|
|
}
|
|
```
|
|
|
|
---
|
|
|
|
## Model Availability
|
|
|
|
```bash
|
|
# Get real-time model availability across all providers
|
|
GET /api/models/availability
|
|
|
|
# Check availability for a specific model
|
|
POST /api/models/availability
|
|
Content-Type: application/json
|
|
|
|
{
|
|
"model": "claude-sonnet-4-5-20250929"
|
|
}
|
|
```
|
|
|
|
---
|
|
|
|
## Request Processing
|
|
|
|
1. Client sends request to `/v1/*`
|
|
2. Route handler calls `handleChat`, `handleEmbedding`, `handleAudioTranscription`, or `handleImageGeneration`
|
|
3. Model is resolved (direct provider/model or alias/combo)
|
|
4. Credentials selected from local DB with account availability filtering
|
|
5. For chat: `handleChatCore` — format detection, translation, cache check, idempotency check
|
|
6. Provider executor sends upstream request
|
|
7. Response translated back to client format (chat) or returned as-is (embeddings/images/audio)
|
|
8. Usage/logging recorded
|
|
9. Fallback applies on errors according to combo rules
|
|
|
|
Full architecture reference: [`ARCHITECTURE.md`](ARCHITECTURE.md)
|
|
|
|
---
|
|
|
|
## Authentication
|
|
|
|
- Dashboard routes (`/dashboard/*`) use `auth_token` cookie
|
|
- Login uses saved password hash; fallback to `INITIAL_PASSWORD`
|
|
- `requireLogin` toggleable via `/api/settings/require-login`
|
|
- `/v1/*` routes optionally require Bearer API key when `REQUIRE_API_KEY=true`
|