* fix(minimax): switch auth from x-api-key to Authorization Bearer (#1076) Integrated into release/v3.5.6 — MiniMax auth fix with authHeader consistency normalization * feat(CI,i18n): autogenerate language files + Add missing strings (#1071) Integrated into release/v3.5.6 — i18n translations for memory, skills, and missing keys across 31 languages * fix(ci): restore i18n continue-on-error, remove auto-commit race condition * fix(husky): load nvm in hooks for VS Code compatibility * fix(husky): gracefully skip hooks when npm is not in PATH * fix: convert OpenAI function tool_choice to Claude tool format (#1072) * fix: prevent EPIPE feedback loop filling logs at GB/s (#1006) * fix: fallback to native fetch when undici dispatcher fails (#1054) * fix: improve Qoder PAT validation with actionable error messages (#966) - Add QODER_PERSONAL_ACCESS_TOKEN env var fallback for both validation and execution - Pre-flight ping check to diagnose connectivity issues (Docker/proxy) - Detect encrypted auth blobs from ~/.qoder/.auth/user and guide to website PAT - Clear error messages for auth failures with link to integrations page - Treat non-auth 4xx as auth-pass (request format issue, not token issue) - Update tests to cover new validation paths (23 tests, all passing) * feat: Improve the Chinese translation (#1079) Integrated into release/v3.5.6 * chore(release): v3.5.6 — i18n updates and credential security fixes * fix(ci): resolve e2e and docs-sync pipeline failures * fix(security): bump next to 16.2.3 to resolve SNYK-JS-NEXT-15954202 * fix: guard Memory/Cache UI against null toLocaleString crash (#1083) * fix: translate OpenAI tool_choice type 'function' to Claude 'tool' format (#1072) * fix: pass custom baseUrl in provider API key validation (#1078) * docs: update CHANGELOG with v3.5.6 bug fixes and security patches * docs: rewrite implement-features workflow with 5-phase harvest-research-report-plan-execute pipeline * docs: organize _ideia/ into viable/defer/notfit + add Phase 2.5 auto-response workflow * docs: implementation plans for #1025, #750, #960, #1046 + close already-implemented #833, #973, #982 * feat: mask email addresses in dashboard for privacy (#1025) * feat: add OpenRouter and GitHub to embedding/image provider registries (#960) * feat: add model visibility toggle and search filter to provider page (#750) * docs: move implemented features to notfit, update task plans status * chore: untrack _ideia/ and _tasks/ from git — private/internal only * chore(release): bump to v3.5.6 — changelog, docs, version sync & any-budget fix * fix: remove explicit .ts extension in qoderCli import that caused 500 error in production build --------- Co-authored-by: Jean Brito <jeanfbrito@gmail.com> Co-authored-by: zenobit <zenobit@disroot.org> Co-authored-by: diegosouzapw <diegosouzapw@users.noreply.github.com> Co-authored-by: Ethan Hunt <136065060+only4copilot@users.noreply.github.com>
178 KiB
Changelog
[Unreleased]
[3.5.6] — 2026-04-09
✨ New Features
- Email Privacy Masking: OAuth account emails are now masked in the provider dashboard (e.g.
di*****@g****.com) to prevent accidental exposure when sharing screenshots. Full address visible on hover viatitleattribute (#1025). - OpenRouter & GitHub in Embedding/Image Registries: OpenRouter (3 embedding models, 4 image models) and GitHub Models (2 embedding models via Azure inference) are now first-class entries in the provider registries, enabling their use for
/v1/embeddingsand/v1/images/generations(#960). - Model Visibility Toggle & Search Filter: The provider page model list now includes a real-time search/filter bar and a per-model visibility toggle (👁 icon). Hidden models are grayed out and excluded from the
/v1/modelscatalog. An active-count badge (N/M active) shows at a glance how many models are enabled (#750). - Chinese Localization (zh-CN): Added missing translations for Context Relay, Memory, LKGP, and Models.dev sync features, while standardizing terminology across the application (#1079).
- Environment Auto-Sync: Added
sync-env.mjsto auto-generate and append.envfrom.env.exampleduring installation, automatically generating cryptographic secrets on first run. - Source Mode Dashboard Update: Fixed real-time Source (git-checkout) updating in the dashboard, enabling secure, real-time update pipelines for non-NPM installations.
🐛 Bug Fixes & Security
- Hardcoded Secret Cleanup: Removed 12 hardcoded OAuth credential fallbacks from the source code, forcing secure reliance on environment variables and resolving static analysis security alerts.
- Next.js Security Patch: Bumped
nextfrom 16.2.2 to 16.2.3 to resolve critical RSC deserialization RCE vulnerability (SNYK-JS-NEXT-15954202). - Memory/Cache UI Crash: Added null-safety guards (
?? 0) to.toLocaleString()calls in Memory and Cache dashboard pages, preventingTypeErrorcrashes when database tables are empty or contain null numeric values (#1083). - WebSearch tool_choice Translation: Fixed OpenAI-to-Claude translator dropping
tool_choiceobjects withtype: "function"as-is, which Claude rejects. Now properly maps all OpenAItool_choicevariants (function,required,none) to Claude-compatible format (tool,any,auto), fixing "Did 0 searches" in Claude Code WebSearch (#1072). - Provider Validation baseUrl Override: Added
baseUrlpassthrough from frontend validation requests to the backend validation endpoint. Chinese-site users of Alibaba Coding Plan (bailian-coding-plan) can now validate API keys against their custom Base URL instead of always hitting the international endpoint (#1078). - Minimax Auth Header: Switched Minimax provider from
x-api-keytoAuthorization: Bearerheader format, matching the current API spec (#1076). - Native Fetch Fallback: Added graceful fallback to native
fetchwhen theundicidispatcher fails, improving resilience in environments where undici is unavailable (#1054). - EPIPE Flood Fix: Added circuit-breaker logic to prevent EPIPE errors from creating a feedback loop that fills logs at GB/s (#1006).
- Qoder PAT Validation: Improved Qoder Personal Access Token validation with actionable error messages that guide users to the correct token format (#966).
- CI/CD Pipeline: Fixed
check:docs-syncfailure by syncing OpenAPI version to 3.5.6 and finalizing CHANGELOG release heading. Commented outDATA_DIRin.env.exampleto prevent E2E test failures in CI runners lacking root permissions.
🌍 i18n
- Auto Language Generation (CI): Added CI pipeline to auto-generate missing language files and strings via
feat(CI,i18n)workflow, covering 30+ locales (#1071).
[3.5.5] — 2026-04-08
✨ New Features
- Node.js 24 Compatibility Warning: Added a proactive version incompatibility warning on the login page to guide users to the stable Node.js 22 LTS, preventing native sqlite binding crashes.
- Context Relay Combo Strategy: Added the new
context-relaycombo strategy with priority-style routing, structured handoff summary generation once quota usage reaches the warning threshold, and handoff injection after the next real account switch. - Global Context Relay Defaults: Added global Settings defaults plus combo-level configuration for
handoffThreshold,handoffModel, andhandoffProviders, so new or unconfigured combos can inherit the feature consistently.
🐛 Bug Fixes
- Proxy Connection Healthchecks: Applied proxy resolution per connection in the sweeping loop (
tokenHealthCheck.ts) and global provider validation sweeps, resolving Node 22 bypass and improving proxy stability (#1051, #1056, #1061). - Security Vulnerability Remediation: Resolved multiple CodeQL scanning alerts including SSRF in model sync, insecure randomness in web crypto (
generateSessionId), and incomplete URL sanitization. - Context Relay Typing & Synchronization: Reverted out-of-scope test breakages and resolved
handoffProviderand responseinputextraction payload typing. - Legacy OpenAI-Compatible Responses Routing: Fixed legacy/imported OpenAI-compatible providers (for example
openai-compatible-sp-openai) incorrectly routing Chat Completions traffic to/chat/completionswhen the real provider node was configured asapiType: "responses". OmniRoute now treatsproviderSpecificData.apiTypeas authoritative across routing, executors, and translator tools, avoiding false empty-content failures during combo/provider smoke tests (#1069). - Gemini PDF Attachment Integration: Fixed payload generation and format for parsing
inline_dataand generic base64 sources for deep Gemini PDF routing (#993, #1021). - Vercel AI SDK Fallbacks: Mapped
max_output_tokenstomax_tokensfor strict OpenAI-compatible providers, resolving errors from standard AI agents and frameworks (#994). - External Auth & UI Reliability: Handled null
statefailures in Cline OAuth exchange (#1016), added 3rd-party 400 error patterns to combo fallback (#1024), and resolved desktop sidebar layout and popover overflows (#1039, #1001). - Context Relay In-Flight Deduplication: Prevented duplicate handoff generation for the same session/combo while an earlier summary request is still in flight.
- Context Relay Provider Gating: Aligned runtime behavior with configuration so explicit
handoffProvidersexclusions, including an empty array, now disable handoff generation as expected.
🛠️ Maintenance & Dependabot
- Updated Sub-dependencies: Bumped
honoto4.12.12and@hono/node-serverto1.19.13to patch critical security gaps (#1063, #1064, #1067, #1068).
📚 Documentation
- Documentation Synchronization: Updated system documentation (README, Architecture, Features, Tools, Troubleshooting) and synced
i18nconfigurations to match the v3.5.5 context relay patterns and proxy troubleshooting steps. - Context Relay Delivery Notes: Documented the current architecture, runtime flow, and Codex-focused scope in the feature docs, changelog, and agent guidance.
[3.5.4] — 2026-04-07
✨ New Features
- Detailed Token Tracking: Added granular token breakdown columns (cache read, cache write, reasoning) to call logs with proper null vs zero distinction. Includes DB migration 018 and 5-label UI display per provider capability (#1017 — thanks @rdself).
- Legacy JSON Config Import/Export: Restored JSON-based settings export and import for migration from legacy 9router configurations. Security-hardened with Zero-Trust redaction of passwords and
requireLoginfields, and automatic pre-import database backups (#1012 — thanks @luandiasrj). - Non-Stream Aliases: Added API support for explicit non-streaming aliases (
non_stream,disable_stream,disable_streaming,streaming=false), normalized at the boundary before provider translation (#1036 — thanks @wlfonseca). - Russian Dashboard Localization: Comprehensive Russian translation for the dashboard UI, including fixes for 2 Ukrainian locale keys (#1003 — thanks @mercs2910).
🐛 Bug Fixes
- Anthropic Streaming Input Undercount: Fixed a critical bug where Anthropic streaming
prompt_tokensonly reported non-cached tokens (e.g.,in=3when actual total was 113,616). Cache tokens are now summed into prompt_tokens during streaming (#1017). - Built-in Responses API Tool Types: Preserved built-in Responses API tools (
web_search,file_search,computer,code_interpreter,image_generation) from being silently stripped by the empty-name tool filter — these tools carry no.namefield (#1014 — thanks @rdself). - Cursor/Codex Responses Compatibility: Fixed empty output in Cursor when using Codex models by hoisting system input items to
instructions, sanitizing invalid tool names, and detecting Responses-format payloads on chat/completions endpoint (#1002 — thanks @mercs2910). - OAuth Token Expiry Display: Fixed OAuth connections showing "expired" badge even with valid tokens by reading
tokenExpiresAt(updated on refresh) instead ofexpiresAt(original grant timestamp) (#1032 — thanks @tombii). - Codex Fast-Tier Copy: Corrected dashboard settings copy from
service_tier=fasttoservice_tier=priority, matching the actual Codex wire format (#1045 — thanks @kfiramar). - macOS Desktop App Startup: Stabilized packaged macOS app launch by excluding desktop artifacts from the standalone bundle and improving launch path detection (#1004 — thanks @mercs2910).
- macOS Sidebar Layout: Fixed macOS traffic light overlap, sidebar spacing, and button overflow in the Electron desktop app (#1001 — thanks @mercs2910).
⚡ Performance
- Analytics Page Load: Dramatically reduced analytics page load times (30s→1-2s for 50K entries) via date-filtered DB queries, parallel
Promise.all()cost calculations, and merged 6 COUNT queries into a single CASE WHEN aggregate (#1038 — thanks @oyi77).
🔒 Security & Dependencies
- Node Base Image: Upgraded Docker base from
22-bookworm-slimto22.22.2-trixie-slim(#1011 — Snyk). - Production Dependencies: Bumped 5 production dependencies (#1044 — Dependabot).
- Vite: Bumped from 8.0.3 to 8.0.5 (#1031 — Dependabot).
- Development Dependencies: Bumped 4 development dependencies (#1030 — Dependabot).
🧪 Tests
- Token Accounting Tests: Added 18 new unit tests covering detailed token breakdown, null vs zero semantics, per-provider token extraction, and Anthropic streaming input fix (#1017).
- Built-in Tool Tests: Added 3 new test cases for built-in Responses API tool type preservation (#1014).
- ChatCore Sanitization: Updated sanitization tests to accommodate Responses format detection (PR #1002) and built-in tool preservation (PR #1014).
🛠️ Maintenance
- PR Workflow: Updated
/review-prsworkflow to merge PRs into the release branch (release/vX.Y.Z) instead of directly intomain, ensuring proper pre-release staging.
Coverage
- 2537 tests, 2532 passing — Statement coverage: 91.95%, Branch coverage: 78.79%, Function coverage: 93.19%
[3.5.3] - 2026-04-07
Security
- Vulnerabilities: Fully remediated 12 High-Severity CodeQL vulnerabilities by migrating from Math.random to
crypto.randomUUID(), wrapping SSE injection points with aggressive backslash escaping, sanitizing trailing HTTP fragments, and enforcing rigid SSRF HTTP verification schemes across internal routes. - Dependencies: Upgraded Next.js to
^16.2.2and Vite to>=8.0.5resolving critical DoS, arbitrary file reads and CSRF vectors in the build/server environments.
Fixed
- E2E Stability: Eliminated extreme CI unreliability and transient test timeouts (Playwright) by propagating internal standalone
_next/staticassets properly and refactoring deep UI interactions inside defensiveexpect().toPass()loops. - Middleware: Resolved infinite redirect loop on dashboard for fresh instances when requireLogin is disabled.
- Core Fallbacks: Preserved primary failure contexts and enhanced Edge-case error handling pipelines across chat and fallback loops.
- Proxy/Hooks: Optimized local git hooks, normalized token coverage endpoints into
/coverage, and guarded GLM region lookups.
🛠️ Maintenance
- CI/CD Stabilization: Prevented random GitHub Runner freezes by decoupling sharded processes, adjusting test concurrencies, unref-ing active connections on server teardown, and strictly capping job timeout durations.
Documentation
- I18n Engine: Synchronized and pushed deep Machine Translation updates across all 32 natively-supported languages (682 translation nodes aligned).
Coverage
- Testing: Consolidated the workspace test coverage framework hitting 92.1% statement line coverage, with new rigid unit-tests matching API key policies and tool scopes.
[3.5.2] — 2026-04-05
✨ New Features
- Qoder API Native Integration: Completely refactored the Qoder Executor to bypass the legacy COSY AES/RSA encryption algorithm, routing directly into the native DashScope OpenAi-compatible URL. Eliminates complex dependencies on Node
cryptomodules while improving stream fidelity. - Resilience Engine Overhaul: Integrated context overflow graceful fallbacks, proactive OAuth token detection, and empty-content emission prevention (#990).
- Context-Optimized Routing Strategy: Added new intelligent routing capability to natively maximize context windows in automated combo deployments (#990).
🐛 Bug Fixes
- Responses API Stream Corruption: Fixed deep-cloning corruption where Anthropic/OpenAI translation boundaries stripped
response.specific SSE prefixes from streaming boundaries (#992). - Claude Cache Passthrough Alignment: Aligned CC-Compatible cache markers consistently with upstream Client Pass-Through mode preserving prompt caching.
- Turbopack Memory Leak: Pinned Next.js to strict
16.0.10preventing memory leaks and build staleness from recent upstream Turbopack hashed module regressions (#987).
[3.5.1] — 2026-04-04
✨ New Features
- Models.dev Integration: Integrated models.dev as the authoritative runtime source for model pricing, capabilities, and specifications, overriding hardcoded prices. Includes a settings UI to manage sync intervals, translation strings for all 30 languages, and robust test coverage.
- Provider Native Capabilities: Added support for declaring and checking native API features (e.g.
systemInstructions_supported) preventing failures by sanitizing invalid roles. Currently configured for Gemini Base and Antigravity OAuth providers. - API Provider Advanced Settings: Added per-connection custom
User-Agentoverrides for API-key provider connections. The override is stored inproviderSpecificData.customUserAgentand now applies to validation probes and upstream execution requests.
🐛 Bug Fixes
- Qwen OAuth Reliability: Resolved a series of OAuth integration issues including a 400 Bad Request blocker on expired tokens, fallback generation for parsing OIDC
access_tokenproperties whenid_tokenis omitted, model catalog discovery errors, and strict filtering ofX-Dashscope-*headers to avoid 400 rejection from OpenAI-compatible endpoints.
[3.5.0] — 2026-04-03
✨ New Features
- Auto-Combo & Routing: Completed native CRUD lifecycle integration for the advanced Auto-Combo engine (#955).
- Core Operations: Fixed missing translations for new native Auto-Combos options (#955).
- Security Validation: Disabled SQLite auto-backup tasks natively during unit test CI execution to explicitly resolve Node 22 Event Loop hanging memory leaks (#956).
- Ecosystem Proxies: Completed explicit integration mapping model synchronization schedulers, OAuth cycles, and Token Check refreshes safely through OmniRoute's native system upstream proxies (#953).
- MCP Extensibility: Added and successfully registered the new
omniroute_web_searchMCP framework tool out of beta into production schemas (#951). - Tokens Buffer Logic: Added runtime configuration limits extending configurable input/output token buffers for precise Usage Tracking metrics (#959).
🐛 Bug Fixes
- CodeQL Remediation: Fully resolved and secured critical string indexing operations preventing Server-Side Request Forgery (SSRF) arrays indexing heuristics alongside polynomial algorithmic backtracking (ReDoS) inside deep proxy dispatcher modules.
- Crypto Hashes: Replaced weak unverified legacy OAuth 1.0 hashes with robust HMAC-SHA-256 standard validation primitives ensuring tight access controls.
- API Boundary Protection: Correctly verified and mapped structural route protections enforcing strict
isAuthenticated()middleware logic covering newer dynamic endpoints targeting settings manipulation and native skills loading. - CLI Ecosystem Compat: Resolved broken native runtime parser bindings crashing
whereenvironment detectors strictly over.cmd/.exeedge cases gracefully for external plugins (#969). - Cache Architecture: Refactored exact Analytics and System Settings dashboard parameters layout structure caching to maintain stable re-hydration persistence cycles resolving visual unaligned state flashes (#952).
- Claude Caching Standards: Normalized and accurately strictly preserved critical ephemeral block markers
ephemeralcaching TTL orders for downstream nodes enforcing standard compatible CC requests mapping cleanly without dropped metrics (#948). - Internal Aliases Auth: Simplified internal runtime mappings normalizing Codex credential payload lookups inside global translation parameters resolving 401 unauthenticated drops (#958).
🛠️ Maintenance
- UI Discoverability: Correctly adjusted layout categorizations explicitly separating free tier providers logic improving UX sorting flows inside the general API registry pages (#950).
- Deployment Topology: Unified Docker deployment artifacts ensuring the root
fly.tomlmatches expected cloud instance parameters out-of-the-box natively handling automated deployments scaling properly. - Development Tooling: Decoupled
LKGPruntime parameters into explicit DB layer abstraction caching utilities ensuring strict test isolation coverage for core caching layers safely.
[3.4.9] — 2026-04-03
Features & Refactoring
- Dashboard Auto-Combo Panel: Completely refactored the
/dashboard/auto-comboUI to seamlessly integrate with native Dashboard Cards and standardized visual padding/headers. Added dynamic visual progress bars mapping model selection weight mechanisms. - Settings Routing Sync: Fully exposed advanced routing
priorityandweightedschema targets internally inside global settings fallback lists.
Bug Fixes
- Memory & Skills Locale Nodes: Resolved empty rendering tags for Memory and Skills options directly inside global settings views by wiring all
settings.*mapping values internally intoen.json(also mapped implicitly for cross-translation tools).
Internal Integrations
- Integrated PR #946 — fix: preserve Claude Code compatibility in responses conversion
- Integrated PR #944 — fix(gemini): preserve thought signatures across antigravity tool calls
- Integrated PR #943 — fix: restore GitHub Copilot body
- Integrated PR #942 — Fix cc-compatible cache markers
- Integrated PR #941 — refactor(auth): improve NVIDIA alias lookup + add LKGP error logging
- Integrated PR #939 — Restore Claude OAuth localhost callback handling
- (Note: PR #934 was omitted from 3.4.9 cycle to prevent core conflict regressions)
[3.4.8] — 2026-04-03
Security
- Fully remediated all outstanding Github Advanced Security (CodeQL) findings and Dependabot alerts.
- Fixed insecure randomness vulnerabilities by migrating from
Math.randomtocrypto.randomUUID(). - Secured shell commands in automated scripts from string injection.
- Migrated vulnerable catastrophic backtracking RegEx parsing patterns in chat/translation pipelines.
- Enhanced output sanitization controls inside React UI components and Server Sent Events (SSE) tag injection.
[3.4.7] — 2026-04-03
Features
- Added
Cryptographynode to Monitoring and MCP health checks (#798) - Hardened model-catalog route permissions mapping (
/models) (#781)
Bug Fixes
- Fixed Claude OAuth token refreshes failing to preserve cache contexts (#937)
- Fixed CC-Compatible provider errors rendering cached models unreachable (#937)
- Fixed GitHub Executor errors related to invalid context arrays (#937)
- Fixed NPM-installed CLI tools healthcheck failures on Windows (#935)
- Fixed payload translation dropping valid content due to invalid API fields (#927)
- Fixed runtime crash in Node 25 regarding API key execution (#867)
- Fixed MCP standalone module-resolution (
ERR_MODULE_NOT_FOUND) viaesbuild(#936) - Fixed NVIDIA NIM routing credential resolution alias mismatch (#931)
Security
- Added safe strict input boundary protection against raw
shell: trueremote-code execution injections.
[3.4.6] - 2026-04-02
✨ New Features
- Providers: Registered new image, video, and audio generation providers from the community-requested list (#926).
- Dashboard UI: Added standalone sidebar navigation for the new Memory and Skills modules (#926).
- i18n: Added translation strings and layout mappings across 30 languages for the Memory and Skills namespaces.
🐛 Bug Fixes
- Resilience: Prevented the proxy Circuit Breaker from becoming stuck in an OPEN state indefinitely by handling direct transitions to CLOSED state inside fallback combo paths (#930).
- Protocol Translation: Patched the streaming transformer to sanitize response blocks based on the expected source protocol rather than the provider target protocol, fixing Anthropics models wrapped in OpenAI payloads crashing Claude Code (#929).
- API Specs & Gemini: Fixed
thought_signatureparsing inopenai-to-geminiandclaude-to-geminitranslators, preventing HTTP 400 errors across all Gemini 3 API tool-calls. - Providers: Cleaned up non-OpenAI-compatible endpoints preventing valid upstream connections (#926).
- Cache Trends: Fixed an invalid property mapping data mismatch causing Cache Trends UI charts to crash, and extracted redundant cache metric widgets (#926).
[3.4.5] - 2026-04-02
✨ New Features
- CLIProxyAPI Ecosystem Integration: Added the
cliproxyapiexecutor with built-in module-level caching and proxy routing. Introduced a comprehensive Version Manager service to automatically test health, download binaries from GitHub, spawn isolated background processes, and cleanly manage the lifecycle of external CLI tools directly through the UI. Includes DB tables for proxy configuration to enable automatic SSRF-gated cross-routing of external OpenAI requests via the local CLI tool layer (#914, #915, #916). - Qoder PAT Support: Integrated Personal Access Tokens (PAT) support directly via the local
qoderclitransport instead of legacy remote.cnbrowser configurations (#913). - Gemini 3.1 Pro Preview (GitHub): Added
gemini-3.1-pro-previewcanonical explicit model support natively into the GitHub Copilot provider while preserving older routing aliases (#924).
🐛 Bug Fixes
- GitHub Copilot Token Stability: Repaired the Copilot token refresh loop where stale tokens weren't deep-merged into DB, and removed
reasoning_textfields that were fatally breaking downstream Anthropic block conversions for multi-turn chats (#923). - Global Timeout Matrix: Centralized and parameterized request timeouts explicitly from
REQUEST_TIMEOUT_MSto prevent hidden (~300s) default fetch buffers prematurely cutting off long-lived SSE streaming responses from heavy reasoning models (#918). - Cloudflare Quick Tunnels State: Fixed a severe state inconsistency where restarted OmniRoute instances erroneously showed destroyed tunnels as active, and defaulted cloudflared tunneling to
HTTP/2to eliminate UDP receive buffer log spam (#925). - i18n Translation Overhaul (Czech & Hindi): Fixed Hindi code from DEPRECATED
in.jsonto canonicalhi.json, overhauled Czech text mappings, extracteduntranslatable-keys.jsonto fix CI/CD false-positive validations, and generated comprehensiveI18N.mddocs to guide translators (#912). - Tokens Provider Recovery: Fixed Qwen losing specific
resourceUrlendpoints after automatic health-check token refreshes because of missing DB deep merges (#917). - CC Compatible UX & Streaming: Unified the Add CC/OpenAI/Anthropic compatible actions around the Anthropic UI treatment, forced CC-compatible upstream requests to use SSE while still returning streaming or non-streaming responses based on the client request, removed CC model-list configuration/import support in favor of an explicit unsupported-model-listing error, and made CC-compatible Available Models mirror the OAuth Claude Code registry list (#921).
[3.4.4] - 2026-04-02
🐛 Bug Fixes
- Responses API Token Reporting: Emit
response.completedwith correctinput_tokens/output_tokensfields for Codex CLI clients, fixing token usage display (#909 — thanks @christopher-s). - SQLite WAL Checkpoint on Shutdown: Flush WAL changes into the primary database file during graceful shutdown/restart, preventing data loss on Docker container stops (#905 — thanks @rdself).
- Graceful Shutdown Signal: Changed
/api/restartand/api/shutdownroutes fromprocess.exit(0)toprocess.kill(SIGTERM), ensuring the shutdown handler runs before exit. - Docker Stop Grace Period: Added
stop_grace_period: 40sto Docker Compose files and--stop-timeout 40to Docker run examples.
🛠️ Maintenance
- Closed 5 resolved/not-a-bug issues (#872, #814, #816, #890, #877).
- Triaged 6 issues with needs-info requests (#892, #887, #886, #865, #895, #870).
- Responded to CLI detection tracking issue (#863) with contributor guidance.
[3.4.3] - 2026-04-02
✨ New Features
- Antigravity Memory & Skills: Completed remote memory and skills injection for the Antigravity provider at the proxy network level.
- Claude Code Compatibility: Built a natively hidden compatibility bridge for Claude Code, passing tools and formatting through cleanly.
- Web Search MCP: Added the
omniroute_web_searchtool with theexecute:searchscope. - Cache Components: Implemented dynamic cache components utilizing TDD.
- UI & Customization: Added custom favicon support, appearance tabs, wired whitelabeling to the sidebar, and added Windsurf guide steps across all 33 languages.
- Log Retention: Unified request log retention and artifacts natively.
- Model Enhancements: Added explicit
contextLengthfor all opencode-zen models. - i18n & translations: Integrated 33 language translations natively, including placeholder CI validations and Chinese documentation updates (#873, #869).
🐛 Bug Fixes
- Qwen OAuth Mapping: Reverted
id_tokenreliance toaccess_tokenand enabled dynamicresource_urlAPI endpoint injection for proper regional routing (#900). - Model Sync Engine: Stored the strict internal Provider ID in
getCustomModels()sync routines instead of the UI Channel Alias format, preventing SQLite catalog insertion failures (#903). - Claude Code & Codex: Standardized non-streaming blank responses to Anthropic-formatted
(empty response)to prevent CLI proxy crashes (#866). - CC Compatible Routing: Resolved duplicate
/v1endpoint collision during path concatenation for generic Claude Code gateways (#904). - Antigravity Dashboards: Blocked unlimited quota models from falsely registering as exhausted
100% Usagelimit states in the Provider Usage UI (#857). - Claude Image Passthrough: Fixed Claude models missing image block passthroughs (#898).
- Gemini CLI Routing: Resolved 403 authorization lockouts and content accumulation issues by refreshing the project ID via
loadCodeAssist(#868). - Antigravity Stability: Corrected model access lists, enforced 404 lockouts, fixed 429 cascades locking out standard connections, and capped
gemini-3.1-prooutput tokens (#885). - Provider Sync Cadence: Repaired the provider limits synchronization cadence via the internal scheduler (#888).
- Dashboard Optimization: Resolved
/dashboard/limitsUI freezing when processing 70+ accounts via chunk parallelization (#784). - SSRF Hardening: Enforced strict SSRF IP range filtering and blocked the
::1loopback interface. - MIME Types: Standardized
mime_typeto snake_case to match Gemini API specifications. - CI Stabilization: Fixed failing analytics/settings Playwright selectors and request assertions so GitHub Actions E2E runs pass reliably across localized UIs and switch-based controls.
- Deterministic Tests: Removed date-sensitive quota fixtures from Copilot usage tests and aligned idempotency/model catalog tests with the merged runtime behavior.
- MCP Type Hardening: Removed zero-budget explicit
anyregressions from the MCP server tool registration path. - Model Sync Engine: Bypassed destructive
replaceoverrides when the provider's auto-sync yields an empty model list, maintaining stability for dynamic catalogs (#899).
🛠️ Maintenance
- Pipeline Logging: Refined pipeline logging artifacts and enforce retention caps (#880).
- AGENTS.md Overhaul: Condensed from 297→153 lines. Added build/test/style guidelines, code workflows (Prettier, TypeScript, ESLint), and trimmed verbose tables (#882).
- Release Branch Integration: Consolidated the active feature branches into
release/v3.4.2on top of currentmainand validated the branch with lint, unit, coverage, build, and CI-mode E2E runs. - Testing: Added vitest configuration for component testing and Playwright specs for settings toggles.
- Doc Updates: Expanded root readmes, translated chinese documents natively, and cleaned up obsolete files.
[3.4.1] - 2026-03-31
Warning
BREAKING CHANGE: request logging, retention, and logging environment variables have been redesigned. On the first startup after upgrading, OmniRoute archives legacy request logs from
DATA_DIR/logs/, legacyDATA_DIR/call_logs/, andDATA_DIR/log.txtintoDATA_DIR/log_archives/*.zip, then removes the deprecated layout and switches to the new unified artifact format underDATA_DIR/call_logs/.
✨ New Features
- .ENV Migration Utility: Included
scripts/migrate-env.mjsto seamlessly migrate<v3.3configurations tov3.4.xstrict security validation constraints (FASE-01), repairing startup crashes caused by shortJWT_SECRETinstances. - Kiro AI Cache Optimization: Implemented deterministic
conversationIdgeneration (uuidv5) to enable AWS Builder ID Prompt Caching properly across invocations (#814). - Dashboard UI Restoration & Consolidation: Resolved sidebar logic omitting the Debug section, and cleared Nextjs routing warnings by moving standalone
/dashboard/mcpand/dashboard/a2apages explicitly into embedded Endpoint Proxy UI components. - Unified Request Log Artifacts: Request logging now stores one SQLite index row plus one JSON artifact per request under
DATA_DIR/call_logs/, with optional pipeline capture embedded in the same file. - Language: Improved the Chinese translation (#855)
- Opencode-Zen Models: Added 4 free models to opencode-zen registry (#854)
- Tests: Added unit and E2E tests for settings toggles and bug fixes (#850)
🐛 Bug Fixes
- 429 Quota Parsing: Parsed long quota reset times from error bodies to honor correct backoffs and prevent rate-limited account bans (#859)
- Prompt Caching: Preserved client
cache_controlheaders for all Claude-protocol providers (like Minimax, GLM, and Bailian), correctly recognizing caching support (#856) - Model Sync Logs: Reduced log spam by recording
sync-modelsonly when the channel actually modifies the list (#853) - Provider Quota & Token Parsing: Switched Antigravity limits to use
retrieveUserQuotanatively and correctly mapped Claude token refresh payloads to URL-encoded forms (#862) - Rate-Limiting Stability: Universalized the 429 Retry-After parsing architecture to cap provider-induced cooldowns at 24 hours max (#862)
- Dashboard Limit Rendering: Re-architected
/dashboard/limitsquota mapping to render immediately inside chunks, fixing a major UI freezing delay on accounts exceeding 70 active connections (#784) - QWEN OAuth Authorization: Mapped the OIDC
id_tokenas the primary API Bearer token for Dashscope requests, fixing immediate 401 Unauthorized errors after connecting accounts or refreshing tokens (#864) - ZAI API Stability: Hardened Server-Sent Events compiler to gracefully fallback to empty strings when DeepSeek providers stream mathematically null content during reasoning phases (#871)
- Claude Code/Codex Translations: Protected non-streaming payload conversions against empty responses from upstream Codex tools, avoiding catastrophic TypeErrors (#866)
- NVIDIA NIM Rendering: Conditionally stripped identical provider prefixes dynamically pushed by audio models, eliminating duplicate
nim/nimtag structures throwing 404 on the Media Playground (#872)
⚠️ Breaking Changes
- Request Log Layout: Removed the old multi-file
DATA_DIR/logs/request log sessions and theDATA_DIR/log.txtsummary file. New requests are written as single JSON artifacts inDATA_DIR/call_logs/YYYY-MM-DD/. - Logging Environment Variables: Replaced
LOG_*,ENABLE_REQUEST_LOGS,CALL_LOGS_MAX,CALL_LOG_PAYLOAD_MODE, andPROXY_LOG_MAX_ENTRIESwith the newAPP_LOG_*andCALL_LOG_RETENTION_DAYSconfiguration model. - Pipeline Toggle Setting: Replaced the legacy
detailed_logs_enabledsetting withcall_log_pipeline_enabled. New pipeline details are embedded inside the request artifact instead of being stored as separaterequest_detail_logsrecords.
🛠️ Maintenance
- Legacy Request Log Upgrade Backup: Upgrades now archive old
data/logs/, legacydata/call_logs/, anddata/log.txtlayouts intoDATA_DIR/log_archives/*.zipbefore removing the deprecated structure. - Streaming Usage Persistence: Streaming requests now write a single
usage_historyrow on completion instead of emitting a duplicate in-progress usage row with empty status metadata. - Logging Follow-up Cleanup: Pipeline logs no longer capture
SOURCE REQUEST, request artifact entries now honorCALL_LOG_MAX_ENTRIES, and application log archives now honorAPP_LOG_MAX_FILES.
[3.4.0] - 2026-03-31
🚀 Features
- Subscription Utilization Analytics: Added quota snapshot time-series tracking, Provider Utilization and Combo Health tabs with recharts visualizations, and corresponding API endpoints (#847)
- SQLite Backup Control: New
OMNIROUTE_DISABLE_AUTO_BACKUPenv flag to disable automatic SQLite backups (#846) - Model Registry Update: Injected
gpt-5.4-miniinto the Codex provider's array of models (#756) - Provider Limit Tracking: Track and display when provider rate limits were last refreshed per account (#843)
🐛 Bug Fixes
- Qwen Auth Routing: Re-routed Qwen OAuth completions from the DashScope API to the Web Inference API (
chat.qwen.ai), resolving authorization failures (#844, #807, #832) - Qwen Auto-Retry Loop: Added targeted 429 Quota Exceeded backoff handling inside
chatCoreprotecting burst requests - Codex OAuth Fallback: Modern browser popup blocking no longer traps the user; it automatically falls back to manual URL entry (#808)
- Claude Token Refresh: Anthropic's strict
application/jsonboundaries are now respected during token generation instead of encoded URLs (#836) - Codex Messages Schema: Stripped purist
messagesinjects from native passthrough requests to avoid structural rejections from the ChatGPT upstream (#806) - CLI Detection Size Limit: Safely bumped the Node binary scanning upper bound from 100MB to 350MB, allowing heavy standalone tools like Claude Code (229MB) and OpenCode (153MB) to be correctly detected by the VPS runtime (#809)
- CLI Runtime Environment: Restored ability for CLI configurations to respect user override paths (
CLI_{PROVIDER}_BIN) bypassing strict path-bound discovery rules - Nvidia Header Conflicts: Removed
prompt_cache_keyproperties from upstream headers when calling non-Anthropic providers (#848) - Codex Fast Tier Toggle: Restored Codex service tier toggle contrast in light mode (#842)
- Test Infrastructure: Updated
t28-model-catalog-updatestest that incorrectly expected the outdated DashScope endpoint for the Qwen native registry
[3.3.9] - 2026-03-31
🐛 Bug Fixes
- Custom Provider Rotation: Integrated
getRotatingApiKeyinternally inside DefaultExecutor, ensuringextraApiKeysrotation triggers correctly for custom and compatible upstream providers (#815)
[3.3.8] - 2026-03-30
🚀 Features
- Models API Filtering: Endpoint
/v1/modelsnow dynamically filters its list based on the permissions tied to theAuthorization: Bearer <token>when restricted access is on (#781) - Qoder Integration: Native integration for Qoder AI natively replacing the legacy iFlow platform mappings (#660)
- Prompt Cache Tracking: Added tracking capabilities and frontend visualization (Stats card) for semantic and prompt caching in the Dashboard UI
🐛 Bug Fixes
- Cache Dashboard Sizing: Improved the UI layout sizes and context headers for the advanced cache pages (#835)
- Debug Sidebar Visibility: Fixed an issue where the debug toggle wouldn't correctly show/hide sidebar debug details (#834)
- Gemini Model Prefixing: Modified the namespace fallback to properly route via
gemini-cli/instead ofgc/to respect upstream specs (#831) - OpenRouter Sync: Improved compatibility synchronization to automatically ingest the available models catalog correctly from OpenRouter (#830)
- Streaming Payloads Mapping: Reserialization of reasoning fields natively resolves conflict alias paths when output is streaming to edge devices
[3.3.7] - 2026-03-30
🐛 Bug Fixes
- OpenCode Config: Restructured generated
opencode.jsonto use the@ai-sdk/openai-compatiblerecord-based schema withoptionsandmodelsas object maps instead of flat arrays, fixing config validation failures (#816) - i18n Missing Keys: Added missing
cloudflaredUrlNoticetranslation key across all 30 language files to preventMISSING_MESSAGEconsole errors in the Endpoint page (#823)
[3.3.6] - 2026-03-30
🐛 Bug Fixes
- Token Accounting: Included prompt cache tokens safely in historical usage inputs calculations for correct quota deductions (PR #822)
- Combo Test Probes: Fixed combo testing logic false negatives by resolving parsing for reasoning-only responses and enabled massive parallelization via Promise.all (PR #828)
- Docker Quick Tunnels: Embedded required ca-certificates inside the base runtime container to resolve Cloudflared TLS startup failures, and surfaced stdout network errors replacing generic exit codes (PR #829)
[3.3.5] - 2026-03-30
✨ New Features
- Gemini Quota Tracking: Added real-time Gemini CLI quota tracking via the
retrieveUserQuotaAPI (PR #825) - Cache Dashboard: Enhanced the Cache Dashboard to display prompt cache metrics, 24h trends, and estimated cost savings (PR #824)
🐛 Bug Fixes
- User Experience: Removed invasive auto-opening OAuth modal loops on barren provider detailed pages (PR #820)
- Dependency Updates: Bumped and locked down dependencies for development and production trees including Next.js 16.2.1, Recharts, and TailwindCSS 4.2.2 (PR #826, #827)
[3.3.4] - 2026-03-30
✨ New Features
- A2A Workflows: Added deterministic FSM orchestrator for multi-step agent workflows.
- Graceful Degradation: Added a new multi-layer fallback framework to preserve core functionality during partial system outages.
- Config Audit: Added an audit trail with diff detection to track changes and enable configuration rollbacks.
- Provider Health: Added provider expiration tracking with proactive UI alerts for expiring API keys.
- Adaptive Routing: Added an adaptive volume and complexity detector to override routing strategies dynamically based on load.
- Provider Diversity: Implemented provider diversity scoring via Shannon entropy to improve load distribution.
- Auto-Disable Bounds: Added an Auto-Disable Banned Accounts setting toggle to the Resilience dashboard.
🐛 Bug Fixes
- Codex & Claude Compatibility: Fixed UI fallbacks, patched Codex non-streaming integration issues, and resolved CLI runtime detection on Windows.
- Release Automation: Expanded permissions required for the Electron App build in GitHub Actions.
- Cloudflare Runtime: Addressed correct runtime isolation exit codes for Cloudflared tunnel components.
🧪 Tests
- Test Suite Updates: Expanded test coverage for volume detectors, provider diversity, configuration audit, and FSM.
[3.3.3] - 2026-03-29
🐛 Bug Fixes
- CI/CD Reliability: Patched GitHub Actions to stable dependency versions (
actions/checkout@v4,actions/upload-artifact@v4) to mitigate unannounced builder environment deprecations. - Image Fallbacks: Replaced arbitrary fallback chains in
ProviderIcon.tsxwith explicit asset validation to prevent UI loading<Image>components for files that don't exist, eliminating404errors in dashboard console logs (#745). - Admin Updater: Dynamic source-installation detection for the dashboard Updater. Safely disables the
Update Nowbutton when OmniRoute is built locally rather than through npm, prompting forgit pull(#743). - Update ERESOLVE Error: Injected
package.jsonoverrides forreact/react-domand enabled--legacy-peer-depswithin the internal automatic updater scripts to resolve breaking dependency tree conflicts with@lobehub/ui.
[3.3.2] - 2026-03-29
✨ New Features
- Cloudflare Tunnels: Cloudflare Quick Tunnel integration with dashboard controls (PR #772).
- Diagnostics: Semantic cache bypass for combo live tests (PR #773).
🐛 Bug Fixes
- Streaming Stability: Apply
FETCH_TIMEOUT_MSto streaming requests' initialfetch()call to prevent 300s Node.js TCP timeout causing silent task failures (#769). - i18n: Add missing
windsurfandcopilotentries totoolDescriptionsacross all 33 locale files (#748). - GLM Coding Audit: Complete provider audit fixing ReDoS vulnerabilities, context window sizing (128k/16k), and model registry syncing (PR #778).
[3.3.1] - 2026-03-29
🐛 Bug Fixes
- OpenAI Codex: Fallback processing fix for
type: "text"elements carrying null or empty datasets that caused 400 rejection (#742). - Opencode: Update schema alignment to singular
providerto match official spec (#774). - Gemini CLI: Inject missing end-user quota headers preventing 403 authorization lockouts (#775).
- DB Recovery: Refactor multipart payload imports into raw binary buffered arrays to bypass reverse proxy max body limits (#770).
[3.3.0] - 2026-03-29
✨ Enhancements & Refactoring
- Release Stabilization — Finalized v3.2.9 release (combo diagnostics, quality gates, Gemini tool fix) and created missing git tag. Consolidated all staged changes into a single atomic release commit.
🐛 Bug Fixes
- Auto-Update Test — Fixed
buildDockerComposeUpdateScripttest assertion to match unexpanded shell variable references ($TARGET_TAG,${TARGET_TAG#v}) in the generated deploy script, aligning with the refactored template from v3.2.8. - Circuit Breaker Test — Hardened
combo-circuit-breaker.test.mjsby injectingmaxRetries: 0to prevent retry inflation from skewing failure count assertions during breaker state transitions.
[3.2.9] - 2026-03-29
✨ Enhancements & Refactoring
- Combo Diagnostics — Introduced a live test bypass flag (
forceLiveComboTest) allowing administrators to execute real upstream health checks that bypass all local circuit-breaker and cooldown state mechanisms, enabling precise diagnostics during rolling outages (PR #759) - Quality Gates — Added automated response quality validation for combos and officially integrated
claude-4.6model support into the core routing schemas (PR #762)
🐛 Bug Fixes
- Tool Definition Validation — Repaired Gemini API integration by normalizing enum types inside tool definitions, preventing upstream HTTP 400 parameter errors (PR #760)
[3.2.8] - 2026-03-29
✨ Enhancements & Refactoring
- Docker Auto-Update UI — Integrated a detached background update process for Docker Compose deployments. The Dashboard UI now seamlessly tracks update lifecycle events combining JSON REST responses with SSE streaming progress overlays for robust cross-environment reliability.
- Cache Analytics — Repaired zero-metrics visualization mapping by migrating Semantic Cache telemetry logs directly into the centralized tracking SQLite module.
🐛 Bug Fixes
- Authentication Logic — Fixed a bug where saving dashboard settings or adding models failed with a 401 Unauthorized error when
requireLoginwas disabled. API endpoints now correctly evaluate the global authentication toggle. Resolved global redirection by reactivatingsrc/middleware.ts. - CLI Tool Detection (Windows) — Prevented fatal initialization exceptions during CLI environment detection by catching
cross-spawnENOENT errors correctly. Adds explicit detection paths for\AppData\Local\droid\droid.exe. - Codex Native Passthrough — Normalized model translation parameters preventing context poisoning in proxy pass-through mode, enforcing generic
store: falseconstraints explicitly for all Codex-originated requests. - SSE Token Reporting — Normalized provider tool-call chunk
finish_reasondetection, fixing 0% Usage analytics for stream-only responses missing strict<DONE>indicators. - DeepSeek Tags — Implemented an explicit
<think>extraction mapping insideresponsesHandler.ts, ensuring DeepSeek reasoning streams map equivalently to native Anthropic<thinking>structures.
[3.2.7] - 2026-03-29
Fixed
- Seamless UI Updates: The "Update Now" feature on the Dashboard now provides live, transparent feedback using Server-Sent Events (SSE). It performs package installation, native module rebuilds (better-sqlite3), and PM2 restarts reliably while showing real-time loaders instead of silently hanging.
[3.2.6] — 2026-03-29
✨ Enhancements & Refactoring
- API Key Reveal (#740) — Added a scoped API key copy flow in the Api Manager, protected by the
ALLOW_API_KEY_REVEALenvironment variable. - Sidebar Visibility Controls (#739) — Admins can now hide any sidebar navigation link via the Appearance settings to reduce visual clutter.
- Strict Combo Testing (#735) — Hardened the combo health check endpoint to require live text responses from models instead of just soft reachability signals.
- Streamed Detailed Logs (#734) — Switched detailed request logging for SSE streams to reconstruct the final payload, saving immense amounts of SQLite database size and significantly cleaning up the UI.
🐛 Bug Fixes
- OpenCode Go MiniMax Auth (#733) — Corrected the authentication header logic for
minimaxmodels on OpenCode Go to usex-api-keyinstead of standard bearer tokens across the/messagesprotocol.
[3.2.5] — 2026-03-29
✨ Enhancements & Refactoring
- Void Linux Deployment Support (#732) — Integrated
xbps-srcpackaging template and instructions to natively compile and install OmniRoute withbetter-sqlite3bindings via cross-compilation target.
[3.2.4] — 2026-03-29
✨ Enhancements & Refactoring
- Qoder AI Migration (#660) — Completely migrated the legacy
iFlowcore provider ontoQoder AImaintaining stable API routing capabilities.
🐛 Bug Fixes
- Gemini Tools HTTP 400 Payload Invalid Argument (#731) — Prevented
thoughtSignaturearray injections inside standard GeminifunctionCallsequences blocking agentic routing flows.
[3.2.3] — 2026-03-29
✨ Enhancements & Refactoring
- Provider Limits Quota UI (#728) — Normalized quota limit logic and data labeling inside the Limits interface.
🐛 Bug Fixes
- Core Routing Schemas & Leaks — Expanded
comboStrategySchemato natively supportfill-firstandp2cstrategies to unblock complex combo editing natively. - Thinking Tags Extraction (CLI) — Restructured CLI token responses sanitizer RegEx capturing model reasoning structures inside streams avoiding broken
<thinking>extractions breaking response text output format. - Strict Format Enforcements — Hardened pipeline sanitization execution making it universally apply to translation mode targets.
[3.2.2] — 2026-03-29
✨ New Features
- Four-Stage Request Log Pipeline (#705) — Refactored log persistence to save comprehensive payloads at four distinct pipeline stages: Client Request, Translated Provider Request, Provider Response, and Translated Client Response. Introduced
streamPayloadCollectorfor robust SSE stream truncation and payload serialization.
🐛 Bug Fixes
- Mobile UI Fixes (#659) — Prevented table components on the dashboard from breaking the layout on narrow viewports by adding proper horizontal scrolling and overflow containment to
DashboardLayout. - Claude Prompt Cache Fixes (#708) — Ensured
cache_controlblocks in Claude-to-Claude fallback loops are faithfully preserved and passed safely back to Anthropic models. - Gemini Tool Definitions (#725) — Fixed schema translation errors when declaring simple
objectparameter types for Gemini function calling.
[3.2.1] — 2026-03-29
✨ New Features
- Global Fallback Provider (#689) — When all combo models are exhausted (502/503), OmniRoute now attempts a configurable global fallback model before returning the error. Set
globalFallbackModelin settings to enable.
🐛 Bug Fixes
- Fix #721 — Fixed context pinning bypass during tool-call responses. Non-streaming tagging used wrong JSON path (
json.messages→json.choices[0].message). Streaming injection now triggers onfinish_reasonchunks for tool-call-only streams.injectModelTag()now appends synthetic pin messages for non-string content. - Fix #709 — Confirmed already fixed (v3.1.9) —
system-info.mjscreates directories recursively. Closed. - Fix #707 — Confirmed already fixed (v3.1.9) — empty tool name sanitization in
chatCore.ts. Closed.
🧪 Tests
- Added 6 unit tests for context pinning with tool-call responses (null content, array content, roundtrip, re-injection)
[3.2.0] — 2026-03-28
✨ New Features
- Cache Management UI — Added a dedicated semantic caching dashboard at `/dashboard/cache` with targeted API invalidation and 31-language i18n support (PR #701 by @oyi77)
- GLM Quota Tracking — Added real-time usage and session quota tracking for the GLM Coding (Z.AI) provider (PR #698 by @christopher-s)
- Detailed Log Payloads — Wired full four-stage pipeline payload capturing (original, translated, provider-response, streamed-deltas) directly into the UI (PR #705 by @rdself)
🐛 Bug Fixes
- Fix #708 — Prevented token bleeding for Claude Code users routing through OmniRoute by correctly preserving native `cache_control` headers during Claude-to-Claude passthrough (PR #708 by @tombii)
- Fix #719 — Setup internal auth boundaries for `ModelSyncScheduler` to prevent unauthenticated daemon failures on startup (PR #719 by @rdself)
- Fix #718 — Rebuilt badge rendering in Provider Limits UI preventing bad quota boundaries overlap (PR #718 by @rdself)
- Fix #704 — Fixed Combo Fallbacks breaking on HTTP 400 content-policy errors preventing model-rotation dead-routing (PR #704 by @rdself)
🔒 Security & Dependencies
- Bumped `path-to-regexp` to `8.4.0` resolving dependabot vulnerabilities (PR #715)
[3.1.10] — 2026-03-28
🐛 Bug Fixes
- Fix #706 — Fixed icon fallback rendering caused by Tailwind V4
font-sansoverride by applying!importantto.material-symbols-outlined. - Fix #703 — Fixed GitHub Copilot broken streams by enabling
responsestoopenaiformat translation for any custom models leveragingapiFormat: "responses". - Fix #702 — Replaced flat-rate usage tracking with accurate DB pricing calculations for both streaming and non-streaming responses.
- Fix #716 — Cleaned up Claude tool-call translation state, correctly parsing streaming arguments and preventing OpenAI
tool_callschunks from repeating theidfield.
[3.1.9] — 2026-03-28
✨ New Features
- Schema Coercion — Auto-coerce string-encoded numeric JSON Schema constraints (e.g.
"minimum": "1") to proper types, preventing 400 errors from Cursor, Cline, and other clients sending malformed tool schemas. - Tool Description Sanitization — Ensure tool descriptions are always strings; converts
null,undefined, or numeric descriptions to empty strings before sending to providers. - Clear All Models Button — Added i18n translations for the "Clear All Models" provider action across all 30 languages.
- Codex Auth Export — Added Codex
auth.jsonexport and apply-local buttons for seamless CLI integration. - Windsurf BYOK Notes — Added official limitation warnings to the Windsurf CLI tool card documenting BYOK constraints.
🐛 Bug Fixes
- Fix #709 —
system-info.mjsno longer crashes when the output directory doesn't exist (addedmkdirSyncwith recursive flag). - Fix #710 — A2A
TaskManagersingleton now usesglobalThisto prevent state leakage across Next.js API route recompilations in dev mode. E2E test suite updated to handle 401 gracefully. - Fix #711 — Added provider-specific
max_tokenscap enforcement for upstream requests. - Fix #605 / #592 — Strip
proxy_prefix from tool names in non-streaming Claude responses; fixed LongCat validation URL. - Call Logs Max Cap — Upgraded
getMaxCallLogs()with caching layer, env var support (CALL_LOGS_MAX), and DB settings integration.
🧪 Tests
- Test suite expanded from 964 → 1027 tests (63 new tests)
- Added
schema-coercion.test.mjs— 9 tests for numeric field coercion and tool description sanitization - Added
t40-opencode-cli-tools-integration.test.mjs— OpenCode/Windsurf CLI integration tests - Enhanced feature-tests branch with comprehensive coverage tooling
📁 New Files
| File | Purpose |
|---|---|
open-sse/translator/helpers/schemaCoercion.ts |
Schema coercion and tool description sanitization utilities |
tests/unit/schema-coercion.test.mjs |
Unit tests for schema coercion |
tests/unit/t40-opencode-cli-tools-integration.test.mjs |
CLI tool integration tests |
COVERAGE_PLAN.md |
Test coverage planning document |
🐛 Bug Fixes
- Claude Prompt Caching Passthrough — Fixed cache_control markers being stripped in Claude passthrough mode (Claude → OmniRoute → Claude), which caused Claude Code users to deplete their Anthropic API quota 5-10x faster than direct connections. OmniRoute now preserves client's cache_control markers when sourceFormat and targetFormat are both Claude, ensuring prompt caching works correctly and dramatically reducing token consumption.
[3.1.8] - 2026-03-27
🐛 Bug Fixes & Features
- Platform Core: Implemented global state handling for Hidden Models & Combos preventing them from cluttering the catalog or leaking into connected MCP agents (#681).
- Stability: Patched streaming crashes related to the native Antigravity provider integration failing due to unhandled undefined state arrays (#684).
- Localization Sync: Deployed a fully overhauled
i18nsynchronizer detecting missing nested JSON properties and retro-fitting 30 locales sequentially (#685).## [3.1.7] - 2026-03-27
🐛 Bug Fixes
- Streaming Stability: Fixed
hasValuableContentreturningundefinedfor empty chunks in SSE streams (#676). - Tool Calling: Fixed an issue in
sseParser.tswhere non-streaming Claude responses with multiple tool calls dropped theidof subsequent tool calls due to incorrect index-based deduplication (#671).
[3.1.6] — 2026-03-27
🐛 Bug Fixes
- Claude Native Tool Name Restoration — Tool names like
TodoWriteare no longer prefixed withproxy_in Claude passthrough responses (both streaming and non-streaming). Includes unit test coverage (PR #663 by @coobabm) - Clear All Models Alias Cleanup — "Clear All Models" button now also removes associated model aliases, preventing ghost models in the UI (PR #664 by @rdself)
[3.1.5] — 2026-03-27
🐛 Bug Fixes
- Backoff Auto-Decay — Rate-limited accounts now auto-recover when their cooldown window expires, fixing a deadlock where high
backoffLevelpermanently deprioritized accounts (PR #657 by @brendandebeasi)
🌍 i18n
- Chinese translation overhaul — Comprehensive rewrite of
zh-CN.jsonwith improved accuracy (PR #658 by @only4copilot)
[3.1.4] — 2026-03-27
🐛 Bug Fixes
- Streaming Override Fix — Explicit
stream: truein request body now takes priority overAccept: application/jsonheader. Clients sending both will correctly receive SSE streaming responses (#656)
🌍 i18n
- Czech string improvements — Refined terminology across
cs.json(PR #655 by @zen0bit)
[3.1.3] — 2026-03-26
🌍 i18n & Community
- ~70 missing translation keys added to
en.jsonand 12 languages (PR #652 by @zen0bit) - Czech documentation updated — CLI-TOOLS, API_REFERENCE, VM_DEPLOYMENT guides (PR #652)
- Translation validation scripts —
check_translations.pyandvalidate_translation.pyfor CI/QA (PR #651 by @zen0bit)
[3.1.2] — 2026-03-26
🐛 Bug Fixes
- Critical: Tool Calling Regression — Fixed
proxy_Basherrors by disabling theproxy_tool name prefix in the Claude passthrough path. Tools likeBash,Read,Writewere being renamed toproxy_Bash,proxy_Read, etc., causing Claude to reject them (#618) - Kiro Account Ban Documentation — Documented as upstream AWS anti-fraud false positive, not an OmniRoute issue (#649)
🧪 Tests
- 936 tests, 0 failures
[3.1.1] — 2026-03-26
✨ New Features
- Vision Capability Metadata: Added
capabilities.vision,input_modalities, andoutput_modalitiesto/v1/modelsentries for vision-capable models (PR #646) - Gemini 3.1 Models: Added
gemini-3.1-pro-previewandgemini-3.1-flash-lite-previewto the Antigravity provider (#645)
🐛 Bug Fixes
- Ollama Cloud 401 Error: Fixed incorrect API base URL — changed from
api.ollama.comto officialollama.com/v1/chat/completions(#643) - Expired Token Retry: Added bounded retry with exponential backoff (5→10→20 min) for expired OAuth connections instead of permanently skipping them (PR #647)
🧪 Tests
- 936 tests, 0 failures
[3.1.0] — 2026-03-26
✨ New Features
- GitHub Issue Templates: Added standardized bug report, feature request, and config/proxy issue templates (#641)
- Clear All Models: Added a "Clear All Models" button to the provider detail page with i18n support in 29 languages (#634)
🐛 Bug Fixes
- Locale Conflict (
in.json): Renamed the Hindi locale file fromin.json(Indonesian ISO code) tohi.jsonto fix translation conflicts in Weblate (#642) - Codex Empty Tool Names: Moved tool name sanitization before the native Codex passthrough, fixing 400 errors from upstream providers when tools had empty names (#637)
- Streaming Newline Artifacts: Added
collapseExcessiveNewlinesto the response sanitizer, collapsing runs of 3+ consecutive newlines from thinking models into a standard double newline (#638) - Claude Reasoning Effort: Converted OpenAI
reasoning_effortparam to Claude's nativethinkingbudget block across all request paths, including automaticmax_tokensadjustment (#627) - Qwen Token Refresh: Implemented proactive pre-expiry OAuth token refreshes (5-minute buffer) to prevent requests from failing when using short-lived tokens (#631)
🧪 Tests
- 936 tests, 0 failures (+10 tests since 3.0.9)
[3.0.9] — 2026-03-26
🐛 Bug Fixes
- NaN tokens in Claude Code / client responses (#617):
sanitizeUsage()now cross-mapsinput_tokens→prompt_tokensandoutput_tokens→completion_tokensbefore the whitelist filter, fixing responses showing NaN/0 token counts when providers return Claude-style usage field names
🔒 Security
- Updated
yamlpackage to fix stack overflow vulnerability (GHSA-48c2-rrv3-qjmp)
📋 Issue Triage
- Closed #613 (Codestral — resolved with Custom Provider workaround)
- Commented on #615 (OpenCode dual-endpoint — workaround provided, tracked as feature request)
- Commented on #618 (tool call visibility — requesting v3.0.9 test)
- Commented on #627 (effort level — already supported)
[3.0.8] — 2026-03-25
🐛 Bug Fixes
- Translation Failures for OpenAI-format Providers in Claude CLI (#632):
- Handle
reasoning_details[]array format from StepFun/OpenRouter — converts toreasoning_content - Handle
reasoningfield alias from some providers → normalized toreasoning_content - Cross-map usage field names:
input_tokens↔prompt_tokens,output_tokens↔completion_tokensinfilterUsageForFormat - Fix
extractUsageto accept bothinput_tokens/output_tokensandprompt_tokens/completion_tokensas valid usage fields - Applied to both streaming (
sanitizeStreamingChunk,openai-to-claude.tstranslator) and non-streaming (sanitizeMessage) paths
- Handle
[3.0.7] — 2026-03-25
🐛 Bug Fixes
- Antigravity Token Refresh: Fixed
client_secret is missingerror for npm-installed users — theclientSecretDefaultwas empty in providerRegistry, causing Google to reject token refresh requests (#588) - OpenCode Zen Models: Added
modelsUrlto the OpenCode Zen registry entry so "Import from /models" works correctly (#612) - Streaming Artifacts: Fixed excessive newlines left in responses after thinking-tag signature stripping (#626)
- Proxy Fallback: Added automatic retry without proxy when SOCKS5 relay fails
- Proxy Test: Test endpoint now resolves real credentials from DB via proxyId
✨ New Features
- Playground Account/Key Selector: Persistent, always-visible dropdown to select specific provider accounts/keys for testing — fetches all connections at startup and filters by selected provider
- CLI Tools Dynamic Models: Model selection now dynamically fetches from
/v1/modelsAPI — providers like Kiro now show their full model catalog - Antigravity Model List: Updated with Claude Sonnet 4.5, Claude Sonnet 4, GPT 5, GPT 5 Mini; enabled
passthroughModelsfor dynamic model access (#628)
🔧 Maintenance
- Merged PR #625 — Provider Limits light mode background fix
[3.0.6] — 2026-03-25
🐛 Bug Fixes
- Limits/Proxy: Fixed Codex limit fetching for accounts behind SOCKS5 proxies — token refresh now runs inside proxy context
- CI: Fixed integration test
v1/modelsassertion failure in CI environments without provider connections - Settings: Proxy test button now shows success/failure results immediately (previously hidden behind health data)
✨ New Features
- Playground: Added Account selector dropdown — test specific connections individually when a provider has multiple accounts
🔧 Maintenance
- Merged PR #623 — LongCat API base URL path correction
[3.0.5] — 2026-03-25
✨ New Features
- Limits UI: Added tag grouping feature to the connections dashboard to improve visual organization for accounts with custom tags.
[3.0.4] — 2026-03-25
🐛 Bug Fixes
- Streaming: Fixed
TextDecoderstate corruption inside combosanitizeTransformStream which caused SSE garbled output matching multibyte characters (PR #614) - Providers UI: Safely render HTML tags inside provider connection error tooltips using
dangerouslySetInnerHTML - Proxy Settings: Added missing
usernameandpasswordpayload body properties allowing authenticated proxies to be successfully verified from the Dashboard. - Provider API: Bound soft exception returns to
getCodexUsagepreventing API HTTP 500 failures when token fetch fails
[3.0.3] — 2026-03-25
✨ New Features
- Auto-Sync Models: Added a UI toggle and
sync-modelsendpoint to automatically synchronise model lists per provider using a scheduled interval scheduler (PR #597)
🐛 Bug Fixes
- Timeouts: Elevated default proxies
FETCH_TIMEOUT_MSandSTREAM_IDLE_TIMEOUT_MSto 10 minutes to properly support deep reasoning models (like o1) without aborting requests (Fixes #609) - CLI Tool Detection: Improved cross-platform detection handling NVM paths, Windows
PATHEXT(preventing.cmdwrappers issue), and custom NPM prefixes (PR #598) - Streaming Logs: Implemented
tool_callsdelta accumulation in streaming response logs so function calls are tracked and persisted accurately in DB (PR #603) - Model Catalog: Removed auth exemption, properly hiding
comfyuiandsdwebuimodels when no provider is explicitly configured (PR #599)
🌐 Translations
- cs: Improved Czech translation strings across the app (PR #601)
[3.0.2] — 2026-03-25
🚀 Enhancements & Features
feat(ui): Connection Tag Grouping
- Added a Tag/Group field to
EditConnectionModal(stored inproviderSpecificData.tag) without requiring DB schema migrations. - Connections in the provider view now dynamically group by tag with visual dividers.
- Untagged connections appear first without a header, followed by tagged groups in alphabetical order.
- The tag grouping automatically applies to the Codex/Copilot/Antigravity Limits section since toggles exist inside connection rows.
🐛 Bug Fixes
fix(ui): Proxy Management UI Stabilization
- Missing badges on connection cards: Fixed by using
resolveProxyForConnection()rather than static mapping. - Test Connection disabled in saved mode: Enabled the Test button by resolving proxy config from the saved list.
- Config Modal freezing: Added
onClose()calls after save/clear to prevent the UI from freezing. - Double usage counting:
ProxyRegistryManagernow loads usage eagerly on mount with deduplication byscope+scopeId. Usage counts were replaced with a Test button displaying IP/latency inline.
fix(translator): function_call prefix stripping
- Repaired an incomplete fix from PR #607 where only
tool_useblocks stripped Claude'sproxy_tool prefix. Now, clients using the OpenAI Responses API format will also correctly receive tool tools without theproxy_prefix.
[3.0.1] — 2026-03-25
🔧 Hotfix Patch — Critical Bug Fixes
Three critical regressions reported by users after the v3.0.0 launch have been resolved.
fix(translator): strip proxy_ prefix in non-streaming Claude responses (#605)
The proxy_ prefix added by Claude OAuth was only stripped from streaming responses. In non-streaming mode, translateNonStreamingResponse had no access to the toolNameMap, causing clients to receive mangled tool names like proxy_read_file instead of read_file.
Fix: Added optional toolNameMap parameter to translateNonStreamingResponse and applied prefix stripping in the Claude tool_use block handler. chatCore.ts now passes the map through.
fix(validation): add LongCat specialty validator to skip /models probe (#592)
LongCat AI does not expose GET /v1/models. The generic validateOpenAICompatibleProvider validator fell through to a chat-completions fallback only if validationModelId was set, which LongCat doesn't configure. This caused provider validation to fail with a misleading error on add/save.
Fix: Added longcat to the specialty validators map, probing /chat/completions directly and treating any non-auth response as a pass.
fix(translator): normalize object tool schemas for Anthropic (#595)
MCP tools (e.g. pencil, computer_use) forward tool definitions with {type:"object"} but without a properties field. Anthropic's API rejects these with: object schema missing properties.
Fix: In openai-to-claude.ts, inject properties: {} as a safe default when type is "object" and properties is absent.
🔀 Community PRs Merged (2)
| PR | Author | Summary |
|---|---|---|
| #589 | @flobo3 | docs(i18n): fix Russian translation for Playground and Testbed |
| #591 | @rdself | fix(ui): improve Provider Limits light mode contrast and plan tier display |
✅ Issues Resolved
#592 #595 #605
🧪 Tests
- 926 tests, 0 failures (unchanged from v3.0.0)
[3.0.0] — 2026-03-24
🎉 OmniRoute v3.0.0 — The Free AI Gateway, Now with 67+ Providers
The biggest release ever. From 36 providers in v2.9.5 to 67+ providers in v3.0.0 — with MCP Server, A2A Protocol, auto-combo engine, Provider Icons, Registered Keys API, 926 tests, and contributions from 12 community members across 10 merged PRs.
Consolidated from v3.0.0-rc.1 through rc.17 (17 release candidates over 3 days of intense development).
🆕 New Providers (+31 since v2.9.5)
| Provider | Alias | Tier | Notes |
|---|---|---|---|
| OpenCode Zen | opencode-zen |
Free | 3 models via opencode.ai/zen/v1 (PR #530 by @kang-heewon) |
| OpenCode Go | opencode-go |
Paid | 4 models via opencode.ai/zen/go/v1 (PR #530 by @kang-heewon) |
| LongCat AI | lc |
Free | 50M tokens/day (Flash-Lite) + 500K/day (Chat/Thinking) during public beta |
| Pollinations AI | pol |
Free | No API key needed — GPT-5, Claude, Gemini, DeepSeek V3, Llama 4 (1 req/15s) |
| Cloudflare Workers AI | cf |
Free | 10K Neurons/day — ~150 LLM responses or 500s Whisper audio, edge inference |
| Scaleway AI | scw |
Free | 1M free tokens for new accounts — EU/GDPR compliant (Paris) |
| AI/ML API | aiml |
Free | $0.025/day free credits — 200+ models via single endpoint |
| Puter AI | pu |
Free | 500+ models (GPT-5, Claude Opus 4, Gemini 3 Pro, Grok 4, DeepSeek V3) |
| Alibaba Cloud (DashScope) | ali |
Paid | International + China endpoints via alicode/alicode-intl |
| Alibaba Coding Plan | bcp |
Paid | Alibaba Model Studio with Anthropic-compatible API |
| Kimi Coding (API Key) | kmca |
Paid | Dedicated API-key-based Kimi access (separate from OAuth) |
| MiniMax Coding | minimax |
Paid | International endpoint |
| MiniMax (China) | minimax-cn |
Paid | China-specific endpoint |
| Z.AI (GLM-5) | zai |
Paid | Zhipu AI next-gen GLM models |
| Vertex AI | vertex |
Paid | Google Cloud — Service Account JSON or OAuth access_token |
| Ollama Cloud | ollamacloud |
Paid | Ollama's hosted API service |
| Synthetic | synthetic |
Paid | Passthrough models gateway |
| Kilo Gateway | kg |
Paid | Passthrough models gateway |
| Perplexity Search | pplx-search |
Paid | Dedicated search-grounded endpoint |
| Serper Search | serper-search |
Paid | Web search API integration |
| Brave Search | brave-search |
Paid | Brave Search API integration |
| Exa Search | exa-search |
Paid | Neural search API integration |
| Tavily Search | tavily-search |
Paid | AI search API integration |
| NanoBanana | nb |
Paid | Image generation API |
| ElevenLabs | el |
Paid | Text-to-speech voice synthesis |
| Cartesia | cartesia |
Paid | Ultra-fast TTS voice synthesis |
| PlayHT | playht |
Paid | Voice cloning and TTS |
| Inworld | inworld |
Paid | AI character voice chat |
| SD WebUI | sdwebui |
Self-hosted | Stable Diffusion local image generation |
| ComfyUI | comfyui |
Self-hosted | ComfyUI local workflow node-based generation |
| GLM Coding | glm |
Paid | BigModel/Zhipu coding-specific endpoint |
Total: 67+ providers (4 Free, 8 OAuth, 55 API Key) + unlimited OpenAI/Anthropic-Compatible custom providers.
✨ Major Features
🔑 Registered Keys Provisioning API (#464)
Auto-generate and issue OmniRoute API keys programmatically with per-provider and per-account quota enforcement.
| Endpoint | Method | Description |
|---|---|---|
/api/v1/registered-keys |
POST |
Issue a new key — raw key returned once only |
/api/v1/registered-keys |
GET |
List registered keys (masked) |
/api/v1/registered-keys/{id} |
GET/DELETE |
Get metadata / Revoke |
/api/v1/quotas/check |
GET |
Pre-validate quota before issuing |
/api/v1/providers/{id}/limits |
GET/PUT |
Configure per-provider issuance limits |
/api/v1/accounts/{id}/limits |
GET/PUT |
Configure per-account issuance limits |
/api/v1/issues/report |
POST |
Report quota events to GitHub Issues |
Security: Keys stored as SHA-256 hashes. Raw key shown once on creation, never retrievable again.
🎨 Provider Icons via @lobehub/icons (#529)
130+ provider logos using @lobehub/icons React components (SVG). Fallback chain: Lobehub SVG → existing PNG → generic icon. Applied across Dashboard, Providers, and Agents pages with standardized ProviderIcon component.
🔄 Model Auto-Sync Scheduler (#488)
Auto-refreshes model lists for connected providers every 24 hours. Runs on server startup. Configurable via MODEL_SYNC_INTERVAL_HOURS.
🔀 Per-Model Combo Routing (#563)
Map model name patterns (glob) to specific combos for automatic routing:
claude-sonnet*→ code-combo,gpt-4o*→ openai-combo,gemini-*→ google-combo- New
model_combo_mappingstable with glob-to-regex matching - Dashboard UI section: "Model Routing Rules" with inline add/edit/toggle/delete
🧭 API Endpoints Dashboard
Interactive catalog, webhooks management, OpenAPI viewer — all in one tabbed page at /dashboard/endpoint.
🔍 Web Search Providers
5 new search provider integrations: Perplexity Search, Serper, Brave Search, Exa, Tavily — enabling grounded AI responses with real-time web data.
📊 Search Analytics
New tab in /dashboard/analytics — provider breakdown, cache hit rate, cost tracking. API: GET /api/v1/search/analytics.
🛡️ Per-API-Key Rate Limits (#452)
max_requests_per_day and max_requests_per_minute columns with in-memory sliding-window enforcement returning HTTP 429.
🎵 Media Playground
Full media generation playground at /dashboard/media: Image Generation, Video, Music, Audio Transcription (2GB upload limit), and Text-to-Speech.
🔒 Security & CI/CD
- CodeQL remediation — Fixed 10+ alerts: 6 polynomial-redos, 1 insecure-randomness (
Math.random()→crypto.randomUUID()), 1 shell-command-injection - Route validation — Zod schemas +
validateBody()on 176/176 API routes — CI enforced - CVE fix — dompurify XSS vulnerability (GHSA-v2wj-7wpq-c8vv) resolved via npm overrides
- Flatted — Bumped 3.3.3 → 3.4.2 (CWE-1321 prototype pollution)
- Docker — Upgraded
docker/setup-buildx-actionv3 → v4
🐛 Bug Fixes (40+)
OAuth & Auth
- #537 — Gemini CLI OAuth: clear actionable error when
GEMINI_OAUTH_CLIENT_SECRETmissing in Docker - #549 — CLI settings routes now resolve real API key from
keyId(not masked strings) - #574 — Login no longer freezes after skipping wizard password setup
- #506 — Cross-platform
machineIdrewritten (Windows REG.exe → macOS ioreg → Linux → hostname fallback)
Providers & Routing
- #536 — LongCat AI: fixed
baseUrlandauthHeader - #535 — Pinned model override:
body.modelcorrectly set topinnedModel - #570 — Unprefixed Claude models now resolve to Anthropic provider
- #585 —
<omniModel>internal tags no longer leak to clients in SSE streaming - #493 — Custom provider model naming no longer mangled by prefix stripping
- #490 — Streaming + context cache protection via
TransformStreaminjection - #511 —
<omniModel>tag injected into first content chunk (not after[DONE])
CLI & Tools
- #527 — Claude Code + Codex loop:
tool_resultblocks now converted to text - #524 — OpenCode config saved correctly (XDG_CONFIG_HOME, TOML format)
- #522 — API Manager: removed misleading "Copy masked key" button
- #546 —
--versionreturningunknownon Windows (PR by @k0valik) - #544 — Secure CLI tool detection via known installation paths (PR by @k0valik)
- #510 — Windows MSYS2/Git-Bash paths normalized automatically
- #492 — CLI detects
mise/nvm-managed Node whenapp/server.jsmissing
Streaming & SSE
- PR #587 — Revert
resolveDataDirimport in responsesTransformer for Cloudflare Workers compat (@k0valik) - PR #495 — Bottleneck 429 infinite wait: drop waiting jobs on rate limit (@xandr0s)
- #483 — Stop trailing
data: nullafter[DONE]signal - #473 — Zombie SSE streams: timeout reduced 300s → 120s for faster fallback
Media & Transcription
- Transcription — Deepgram
video/mp4→audio/mp4MIME mapping, auto language detection, punctuation - TTS —
[object Object]error display fixed for ElevenLabs-style nested errors - Upload limits — Media transcription increased to 2GB (nginx
client_max_body_size 2g+maxDuration=300)
🔧 Infrastructure & Improvements
Sub2api Gap Analysis (T01–T15 + T23–T42)
- T01 —
requested_modelcolumn in call logs (migration 009) - T02 — Strip empty text blocks from nested
tool_result.content - T03 — Parse
x-codex-5h-*/x-codex-7d-*quota headers - T04 —
X-Session-Idheader for external sticky routing - T05 — Rate-limit DB persistence with dedicated API
- T06 — Account deactivated → permanent block (1-year cooldown)
- T07 — X-Forwarded-For IP validation (
extractClientIp()) - T08 — Per-API-key session limits with sliding-window enforcement
- T09 — Codex vs Spark rate-limit scopes (separate pools)
- T10 — Credits exhausted → distinct 1h cooldown fallback
- T11 —
maxreasoning effort → 131072 budget tokens - T12 — MiniMax M2.7 pricing entries
- T13 — Stale quota display fix (reset window awareness)
- T14 — Proxy fast-fail TCP check (≤2s, cached 30s)
- T15 — Array content normalization for Anthropic
- T23 — Intelligent quota reset fallback (header extraction)
- T24 —
503cooldown +406mapping - T25 — Provider validation fallback
- T29 — Vertex AI Service Account JWT auth
- T33 — Thinking level to budget conversion
- T36 —
403vs429error classification - T38 — Centralized model specifications (
modelSpecs.ts) - T39 — Endpoint fallback for
fetchAvailableModels - T41 — Background task auto-redirect to flash models
- T42 — Image generation aspect ratio mapping
Other Improvements
- Per-model upstream custom headers — via configuration UI (PR #575 by @zhangqiang8vip)
- Model context length — configurable in model metadata (PR #578 by @hijak)
- Model prefix stripping — option to remove provider prefix from model names (PR #582 by @jay77721)
- Gemini CLI deprecation — marked deprecated with Google OAuth restriction warning
- YAML parser — replaced custom parser with
js-yamlfor correct OpenAPI spec parsing - ZWS v5 — HMR leak fix (485 DB connections → 1, memory 2.4GB → 195MB)
- Log export — New JSON export button on dashboard with time range dropdown
- Update notification banner — dashboard homepage shows when new versions are available
🌐 i18n & Documentation
- 30 languages at 100% parity — 2,788 missing keys synced
- Czech — Full translation: 22 docs, 2,606 UI strings (PR by @zen0bit)
- Chinese (zh-CN) — Complete retranslation (PR by @only4copilot)
- VM Deployment Guide — Translated to English as source document
- API Reference — Added
/v1/embeddingsand/v1/audio/speechendpoints - Provider count — Updated from 36+/40+/44+ to 67+ across README and all 30 i18n READMEs
🔀 Community PRs Merged (10)
| PR | Author | Summary |
|---|---|---|
| #587 | @k0valik | fix(sse): revert resolveDataDir import for Cloudflare Workers compat |
| #582 | @jay77721 | feat(proxy): model name prefix stripping option |
| #581 | @jay77721 | fix(npm): link electron-release to npm-publish workflow |
| #578 | @hijak | feat: configurable context length in model metadata |
| #575 | @zhangqiang8vip | feat: per-model upstream headers, compat PATCH, chat alignment |
| #562 | @coobabm | fix: MCP session management, Claude passthrough, detectFormat |
| #561 | @zen0bit | fix(i18n): Czech translation corrections |
| #555 | @k0valik | fix(sse): centralized resolveDataDir() for path resolution |
| #546 | @k0valik | fix(cli): --version returning unknown on Windows |
| #544 | @k0valik | fix(cli): secure CLI tool detection via installation paths |
| #542 | @rdself | fix(ui): light mode contrast CSS theme variables |
| #530 | @kang-heewon | feat: OpenCode Zen + Go providers with OpencodeExecutor |
| #512 | @zhangqiang8vip | feat: per-protocol model compatibility (compatByProtocol) |
| #497 | @zhangqiang8vip | fix: dev-mode HMR resource leaks (ZWS v5) |
| #495 | @xandr0s | fix: Bottleneck 429 infinite wait (drop waiting jobs) |
| #494 | @zhangqiang8vip | feat: MiniMax developer→system role fix |
| #480 | @prakersh | fix: stream flush usage extraction |
| #479 | @prakersh | feat: Codex 5.3/5.4 and Anthropic pricing entries |
| #475 | @only4copilot | feat(i18n): improved Chinese translation |
Thank you to all contributors! 🙏
📋 Issues Resolved (50+)
#452 #458 #462 #464 #466 #473 #474 #481 #483 #487 #488 #489 #490 #491 #492 #493 #506 #508 #509 #510 #511 #513 #520 #521 #522 #524 #525 #527 #529 #531 #532 #535 #536 #537 #541 #546 #549 #563 #570 #574 #585
🧪 Tests
- 926 tests, 0 failures (up from 821 in v2.9.5)
- +105 new tests covering: model-combo mappings, registered keys, OpencodeExecutor, Bailian provider, route validation, error classification, aspect ratio mapping, and more
📦 Database Migrations
| Migration | Description |
|---|---|
| 008 | registered_keys, provider_key_limits, account_key_limits tables |
| 009 | requested_model column in call_logs |
| 010 | model_combo_mappings table for per-model combo routing |
⬆️ Upgrading from v2.9.5
# npm
npm install -g omniroute@3.0.0
# Docker
docker pull diegosouzapw/omniroute:3.0.0
# Migrations run automatically on first startup
Breaking changes: None. All existing configurations, combos, and API keys are preserved. Database migrations 008-010 run automatically on startup.
[3.0.0-rc.17] — 2026-03-24
🔒 Security & CI/CD
- CodeQL remediation — Fixed 10+ alerts:
- 6 polynomial-redos in
provider.ts/chatCore.ts(replaced(?:^|/)alternation patterns with segment-based matching) - 1 insecure-randomness in
acp/manager.ts(Math.random()→crypto.randomUUID()) - 1 shell-command-injection in
prepublish.mjs(JSON.stringify()path escaping)
- 6 polynomial-redos in
- Route validation — Added Zod schemas +
validateBody()to 5 routes missing validation:model-combo-mappings(POST, PUT),webhooks(POST, PUT),openapi/try(POST)- CI
check:route-validation:t06now passes: 176/176 routes validated
🐛 Bug Fixes
- #585 —
<omniModel>internal tags no longer leak to clients in SSE responses. Added outbound sanitizationTransformStreamincombo.ts
⚙️ Infrastructure
- Docker — Upgraded
docker/setup-buildx-actionfrom v3 → v4 (Node.js 20 deprecation fix) - CI cleanup — Deleted 150+ failed/cancelled workflow runs
🧪 Tests
- Test suite: 926 tests, 0 failures (+3 new)
[3.0.0-rc.16] — 2026-03-24
✨ New Features
- Increased media transcription limits
- Added Model Context Length to registry metadata
- Added per-model upstream custom headers via configuration UI
- Fixed multiple bugs, Zod valiadation for patches, and resolved various community issues.
[3.0.0-rc.15] — 2026-03-24
✨ New Features
- #563 — Per-model Combo Routing: map model name patterns (glob) to specific combos for automatic routing
- New
model_combo_mappingstable (migration 010) with pattern, combo_id, priority, enabled resolveComboForModel()DB function with glob-to-regex matching (case-insensitive,*and?wildcards)getComboForModel()inmodel.ts: augmentsgetCombo()with model-pattern fallbackchat.ts: routing decision now checks model-combo mappings before single-model handling- API:
GET/POST /api/model-combo-mappings,GET/PUT/DELETE /api/model-combo-mappings/:id - Dashboard: "Model Routing Rules" section added to Combos page with inline add/edit/toggle/delete
- Examples:
claude-sonnet*→ code-combo,gpt-4o*→ openai-combo,gemini-*→ google-combo
- New
🌐 i18n
- Full i18n Sync: 2,788 missing keys added across 30 language files — all languages now at 100% parity with
en.json - Agents page i18n: OpenCode Integration section fully internationalized (title, description, scanning, download labels)
- 6 new keys added to
agentsnamespace for OpenCode section
🎨 UI/UX
- Provider Icons: 16 missing provider icons added (3 copied, 2 downloaded, 11 SVG created)
- SVG fallback:
ProviderIconcomponent updated with 4-tier strategy: Lobehub → PNG → SVG → Generic icon - Agents fingerprinting: Synced with CLI tools — added droid, openclaw, copilot, opencode to fingerprint list (14 total)
🔒 Security
- CVE fix: Resolved dompurify XSS vulnerability (GHSA-v2wj-7wpq-c8vv) via npm overrides forcing
dompurify@^3.3.2 npm auditnow reports 0 vulnerabilities
🧪 Tests
- Test suite: 923 tests, 0 failures (+15 new model-combo mapping tests)
[3.0.0-rc.14] — 2026-03-23
🔀 Community PRs Merged
| PR | Author | Summary |
|---|---|---|
| #562 | @coobabm | fix(ux): MCP session management, Claude passthrough normalization, OAuth modal, detectFormat |
| #561 | @zen0bit | fix(i18n): Czech translation corrections — HTTP method names and documentation updates |
🧪 Tests
- Test suite: 908 tests, 0 failures
[3.0.0-rc.13] — 2026-03-23
🔧 Bug Fixes
- config: resolve real API key from
keyIdin CLI settings routes (codex-settings,droid-settings,kilo-settings) to prevent writing masked strings (#549)
[3.0.0-rc.12] — 2026-03-23
🔀 Community PRs Merged
| PR | Author | Summary |
|---|---|---|
| #546 | @k0valik | fix(cli): --version returning unknown on Windows — use JSON.parse(readFileSync) instead of ESM import |
| #555 | @k0valik | fix(sse): centralized resolveDataDir() for path resolution in credentials, autoCombo, responses logger, and request logger |
| #544 | @k0valik | fix(cli): secure CLI tool detection via known installation paths (8 tools) with symlink validation, file-type checks, size bounds, minimal env in healthcheck |
| #542 | @rdself | fix(ui): improve light mode contrast — add missing CSS theme variables (bg-primary, bg-subtle, text-primary) and fix dark-only colors in log detail |
🔧 Bug Fixes
- TDZ fix in
cliRuntime.ts—validateEnvPathwas used before initialization at module startup bygetExpectedParentPaths(). Reordered declarations to fixReferenceError. - Build fixes — Added
pinoandpino-prettytoserverExternalPackagesto prevent Turbopack from breaking Pino's internal worker loading.
🧪 Tests
- Test suite: 905 tests, 0 failures
[3.0.0-rc.10] — 2026-03-23
🔧 Bug Fixes
- #509 / #508 — Electron build regression: downgraded Next.js from
16.1.xto16.0.10to eliminate Turbopack module-hashing instability that caused blank screens in the Electron desktop bundle. - Unit test fixes — Corrected two stale test assertions (
nanobanana-image-handleraspect ratio/resolution,thinking-budgetGeminithinkingConfigfield mapping) that had drifted after recent implementation changes. - #541 — Responded to user feedback about installation complexity; no code changes required.
[3.0.0-rc.9] — 2026-03-23
✨ New Features
- T29 — Vertex AI SA JSON Executor: implemented using the
joselibrary to handle JWT/Service Account auth, along with configurable regions in the UI and automatic partner model URL building. - T42 — Image generation aspect ratio mapping: created
sizeMapperlogic for generic OpenAI formats (size), added nativeimagen3handling, and updated NanoBanana endpoints to utilize mapped aspect ratios automatically. - T38 — Centralized model specifications:
modelSpecs.tscreated for limits and parameters per model.
🔧 Improvements
- T40 — OpenCode CLI tools integration: native
opencode-zenandopencode-gointegration completed in earlier PR.
[3.0.0-rc.8] — 2026-03-23
🔧 Bug Fixes & Improvements (Fallback, Quota & Budget)
- T24 —
503cooldown await fix +406mapping: mapped406 Not Acceptableto503 Service Unavailablewith proper cooldown intervals. - T25 — Provider validation fallback: graceful fallback to standard validation models when a specific
validationModelIdis not present. - T36 —
403vs429provider handling refinement: extracted intoerrorClassifier.tsto properly segregate hard permissions failures (403) from rate limits (429). - T39 — Endpoint Fallback for
fetchAvailableModels: implemented a tri-tier mechanism (/models->/v1/models-> local generic catalog) +list_models_catalogMCP tool updates to reflectsourceandwarning. - T33 — Thinking level to budget conversion: translates qualitative thinking levels into precise budget allocations.
- T41 — Background task auto redirect: routes heavy background evaluation tasks to flash/efficient models automatically.
- T23 — Intelligent quota reset fallback: accurately extracts
x-ratelimit-reset/retry-afterheader values or maps static cooldowns.
[3.0.0-rc.7] — 2026-03-23 (What's New vs v2.9.5 — will be released as v3.0.0)
Upgrade from v2.9.5: 16 issues resolved · 2 community PRs merged · 2 new providers · 7 new API endpoints · 3 new features · DB migration 008+009 · 832 tests passing · 15 sub2api gap improvements (T01–T15 complete).
🆕 New Providers
| Provider | Alias | Tier | Notes |
|---|---|---|---|
| OpenCode Zen | opencode-zen |
Free | 3 models via opencode.ai/zen/v1 (PR #530 by @kang-heewon) |
| OpenCode Go | opencode-go |
Paid | 4 models via opencode.ai/zen/go/v1 (PR #530 by @kang-heewon) |
Both providers use the new OpencodeExecutor with multi-format routing (/chat/completions, /messages, /responses, /models/{model}:generateContent).
✨ New Features
🔑 Registered Keys Provisioning API (#464)
Auto-generate and issue OmniRoute API keys programmatically with per-provider and per-account quota enforcement.
| Endpoint | Method | Description |
|---|---|---|
/api/v1/registered-keys |
POST |
Issue a new key — raw key returned once only |
/api/v1/registered-keys |
GET |
List registered keys (masked) |
/api/v1/registered-keys/{id} |
GET |
Get key metadata |
/api/v1/registered-keys/{id} |
DELETE |
Revoke a key |
/api/v1/registered-keys/{id}/revoke |
POST |
Revoke (for clients without DELETE support) |
/api/v1/quotas/check |
GET |
Pre-validate quota before issuing |
/api/v1/providers/{id}/limits |
GET/PUT |
Configure per-provider issuance limits |
/api/v1/accounts/{id}/limits |
GET/PUT |
Configure per-account issuance limits |
/api/v1/issues/report |
POST |
Report quota events to GitHub Issues |
DB — Migration 008: Three new tables: registered_keys, provider_key_limits, account_key_limits.
Security: Keys stored as SHA-256 hashes. Raw key shown once on creation, never retrievable again.
Quota types: maxActiveKeys, dailyIssueLimit, hourlyIssueLimit per provider and per account.
Idempotency: idempotency_key field prevents duplicate issuance. Returns 409 IDEMPOTENCY_CONFLICT if key was already used.
Budget per key: dailyBudget / hourlyBudget — limits how many requests a key can route per window.
GitHub reporting: Optional. Set GITHUB_ISSUES_REPO + GITHUB_ISSUES_TOKEN to auto-create GitHub issues on quota exceeded or issuance failures.
🎨 Provider Icons — @lobehub/icons (#529)
All provider icons in the dashboard now use @lobehub/icons React components (130+ providers with SVG).
Fallback chain: Lobehub SVG → existing /providers/{id}.png → generic icon. Uses a proper React ErrorBoundary pattern.
🔄 Model Auto-Sync Scheduler (#488)
OmniRoute now automatically refreshes model lists for connected providers every 24 hours.
- Runs on server startup via the existing
/api/sync/initializehook - Configurable via
MODEL_SYNC_INTERVAL_HOURSenvironment variable - Covers 16 major providers
- Records last sync time in the settings database
🔧 Bug Fixes
OAuth & Auth
- #537 — Gemini CLI OAuth: Clear actionable error when
GEMINI_OAUTH_CLIENT_SECRETis missing in Docker/self-hosted deployments. Previously showed crypticclient_secret is missingfrom Google. Now provides specificdocker-compose.ymland~/.omniroute/.envinstructions.
Providers & Routing
- #536 — LongCat AI: Fixed
baseUrl(api.longcat.chat/openai) andauthHeader(Authorization: Bearer). - #535 — Pinned model override:
body.modelis now correctly set topinnedModelwhen context-cache protection is active. - #532 — OpenCode Go key validation: Now uses the
zen/v1test endpoint (testKeyBaseUrl) — same key works for both tiers.
CLI & Tools
- #527 — Claude Code + Codex loop:
tool_resultblocks are now converted to text instead of dropped, stopping infinite tool-result loops. - #524 — OpenCode config save: Added
saveOpenCodeConfig()handler (XDG_CONFIG_HOME aware, writes TOML). - #521 — Login stuck: Login no longer freezes after skipping password setup — redirects correctly to onboarding.
- #522 — API Manager: Removed misleading "Copy masked key" button (replaced with a lock icon tooltip).
- #532 — OpenCode Go config: Guide settings handler now handles
opencodetoolId.
Developer Experience
- #489 — Antigravity: Missing
googleProjectIdreturns a structured 422 error with reconnect guidance instead of a cryptic crash. - #510 — Windows paths: MSYS2/Git-Bash paths (
/c/Program Files/...) are now normalized toC:\Program Files\...automatically. - #492 — CLI startup:
omnirouteCLI now detectsmise/nvm-managed Node whenapp/server.jsis missing and shows targeted fix instructions.
📖 Documentation Updates
- #513 — Docker password reset:
INITIAL_PASSWORDenv var workaround documented - #520 — pnpm:
pnpm approve-builds better-sqlite3step documented
✅ Issues Resolved in v3.0.0
#464 #488 #489 #492 #510 #513 #520 #521 #522 #524 #527 #529 #532 #535 #536 #537
🔀 Community PRs Merged
| PR | Author | Summary |
|---|---|---|
| #530 | @kang-heewon | OpenCode Zen + Go providers with OpencodeExecutor and improved tests |
[3.0.0-rc.7] - 2026-03-23
🔧 Improvements (sub2api Gap Analysis — T05, T08, T09, T13, T14)
- T05 — Rate-limit DB persistence:
setConnectionRateLimitUntil(),isConnectionRateLimited(),getRateLimitedConnections()inproviders.ts. The existingrate_limited_untilcolumn is now exposed as a dedicated API — OAuth token refresh must NOT touch this field to prevent rate-limit loops. - T08 — Per-API-key session limit:
max_sessions INTEGER DEFAULT 0added toapi_keysvia auto-migration.sessionManager.tsgainsregisterKeySession(),unregisterKeySession(),checkSessionLimit(), andgetActiveSessionCountForKey(). Callers inchatCore.jscan enforce the limit and decrement onreq.close. - T09 — Codex vs Spark rate-limit scopes:
getCodexModelScope()andgetCodexRateLimitKey()incodex.ts. Standard models (gpt-5.x-codex,codex-mini) get scope"codex"; spark models (codex-spark*) get scope"spark". Rate-limit keys should be${accountId}:${scope}so exhausting one pool doesn't block the other. - T13 — Stale quota display fix:
getEffectiveQuotaUsage(used, resetAt)returns0when the reset window has passed;formatResetCountdown(resetAt)returns a human-readable countdown string (e.g."2h 35m"). Both exported fromproviders.ts+localDb.tsfor dashboard consumption. - T14 — Proxy fast-fail: new
src/lib/proxyHealth.tswithisProxyReachable(proxyUrl, timeoutMs=2000)(TCP check, ≤2s instead of 30s timeout),getCachedProxyHealth(),invalidateProxyHealth(), andgetAllProxyHealthStatuses(). Results cached 30s by default; configurable viaPROXY_FAST_FAIL_TIMEOUT_MS/PROXY_HEALTH_CACHE_TTL_MS.
🧪 Tests
- Test suite: 832 tests, 0 failures
[3.0.0-rc.6] - 2026-03-23
🔧 Bug Fixes & Improvements (sub2api Gap Analysis — T01–T15)
- T01 —
requested_modelcolumn incall_logs(migration 009): track which model the client originally requested vs the actual routed model. Enables fallback rate analytics. - T02 — Strip empty text blocks from nested
tool_result.content: prevents Anthropic 400 errors (text content blocks must be non-empty) when Claude Code chains tool results. - T03 — Parse
x-codex-5h-*/x-codex-7d-*headers:parseCodexQuotaHeaders()+getCodexResetTime()extract Codex quota windows for precise cooldown scheduling instead of generic 5-min fallback. - T04 —
X-Session-Idheader for external sticky routing:extractExternalSessionId()insessionManager.tsreadsx-session-id/x-omniroute-sessionheaders withext:prefix to avoid collision with internal SHA-256 session IDs. Nginx-compatible (hyphenated header). - T06 — Account deactivated → permanent block:
isAccountDeactivated()inaccountFallback.tsdetects 401 deactivation signals and applies a 1-year cooldown to prevent retrying permanently dead accounts. - T07 — X-Forwarded-For IP validation: new
src/lib/ipUtils.tswithextractClientIp()andgetClientIpFromRequest()— skipsunknown/non-IP entries inX-Forwarded-Forchains (Nginx/proxy-forwarded requests). - T10 — Credits exhausted → distinct fallback:
isCreditsExhausted()inaccountFallback.tsreturns 1h cooldown withcreditsExhaustedflag, distinct from generic 429 rate limiting. - T11 —
maxreasoning effort → 131072 budget tokens:EFFORT_BUDGETSandTHINKING_LEVEL_MAPupdated; reverse mapping now returns"max"for full-budget responses. Unit test updated. - T12 — MiniMax M2.7 pricing entries added:
minimax-m2.7,MiniMax-M2.7,minimax-m2.7-highspeedadded to pricing table (sub2api PR #1120). M2.5/GLM-4.7/GLM-5/Kimi pricing already existed. - T15 — Array content normalization:
normalizeContentToString()helper inopenai-to-claude.tscorrectly collapses array-formatted system/tool messages to string before sending to Anthropic.
🧪 Tests
- Test suite: 832 tests, 0 failures (unchanged from rc.5)
[3.0.0-rc.5] - 2026-03-22
✨ New Features
- #464 — Registered Keys Provisioning API: auto-issue API keys with per-provider & per-account quota enforcement
POST /api/v1/registered-keys— issue keys with idempotency supportGET /api/v1/registered-keys— list (masked) registered keysGET /api/v1/registered-keys/{id}— get key metadataDELETE /api/v1/registered-keys/{id}/POST ../{id}/revoke— revoke keysGET /api/v1/quotas/check— pre-validate before issuingPUT /api/v1/providers/{id}/limits— set provider issuance limitsPUT /api/v1/accounts/{id}/limits— set account issuance limitsPOST /api/v1/issues/report— optional GitHub issue reporting- DB migration 008:
registered_keys,provider_key_limits,account_key_limitstables
[3.0.0-rc.4] - 2026-03-22
✨ New Features
- #530 (PR) — OpenCode Zen and OpenCode Go providers added (by @kang-heewon)
- New
OpencodeExecutorwith multi-format routing (/chat/completions,/messages,/responses) - 7 models across both tiers
- New
[3.0.0-rc.3] - 2026-03-22
✨ New Features
- #529 — Provider icons now use @lobehub/icons with graceful PNG fallback and a
ProviderIconcomponent (130+ providers supported) - #488 — Auto-update model lists every 24h via
modelSyncScheduler(configurable viaMODEL_SYNC_INTERVAL_HOURS)
🔧 Bug Fixes
- #537 — Gemini CLI OAuth: now shows clear actionable error when
GEMINI_OAUTH_CLIENT_SECRETis missing in Docker/self-hosted deployments
[3.0.0-rc.2] - 2026-03-22
🔧 Bug Fixes
- #536 — LongCat AI key validation: fixed baseUrl (
api.longcat.chat/openai) and authHeader (Authorization: Bearer) - #535 — Pinned model override:
body.modelis now set topinnedModelwhen context-cache protection detects a pinned model - #524 — OpenCode config now saved correctly: added
saveOpenCodeConfig()handler (XDG_CONFIG_HOME aware, writes TOML)
[3.0.0-rc.1] - 2026-03-22
🔧 Bug Fixes
- #521 — Login no longer gets stuck after skipping password setup (redirects to onboarding)
- #522 — API Manager: Removed misleading "Copy masked key" button (replaced with lock icon tooltip)
- #527 — Claude Code + Codex superpowers loop:
tool_resultblocks now converted to text instead of dropped - #532 — OpenCode GO API key validation now uses the correct
zen/v1endpoint (testKeyBaseUrl) - #489 — Antigravity: missing
googleProjectIdreturns structured 422 error with reconnect guidance - #510 — Windows: MSYS2/Git-Bash paths (
/c/Program Files/...) are now normalized toC:\Program Files\... - #492 —
omnirouteCLI now detectsmise/nvmwhenapp/server.jsis missing and shows targeted fix
📖 Documentation
- #513 — Docker password reset:
INITIAL_PASSWORDenv var workaround documented - #520 — pnpm:
pnpm approve-builds better-sqlite3documented
✅ Closed Issues
#489, #492, #510, #513, #520, #521, #522, #525, #527, #532
[2.9.5] — 2026-03-22
Sprint: New OpenCode providers, embedding credentials fix, CLI masked key bug, CACHE_TAG_PATTERN fix.
🐛 Bug Fixes
- CLI tools save masked API key to config files —
claude-settings,cline-settings, andopenclaw-settingsPOST routes now accept akeyIdparam and resolve the real API key from DB before writing to disk.ClaudeToolCardupdated to sendkeyIdinstead of the masked display string. Fixes #523, #526. - Custom embedding providers:
No credentialserror —/v1/embeddingsnow trackscredentialsProviderIdseparately from the routing prefix, so credentials are fetched from the matching provider node ID rather than the public prefix string. Fixes a regression wheregoogle/gemini-embedding-001and similar custom-provider models would always fail with a credentials error. Fixes #532-related. (PR #528 by @jacob2826) - Context cache protection regex misses
prefix —CACHE_TAG_PATTERNincomboAgentMiddleware.tsupdated to match both literal(backslash-n) and actual newline U+000A thatcombo.tsstreaming injects around the<omniModel>tag after fix #515. Fixes #531.
✨ New Providers
- OpenCode Zen — Free tier gateway at
opencode.ai/zen/v1with 3 models:minimax-m2.5-free,big-pickle,gpt-5-nano - OpenCode Go — Subscription service at
opencode.ai/zen/go/v1with 4 models:glm-5,kimi-k2.5,minimax-m2.7(Claude format),minimax-m2.5(Claude format) - Both providers use the new
OpencodeExecutorwhich routes dynamically to/chat/completions,/messages,/responses, or/models/{model}:generateContentbased on the requested model. (PR #530 by @kang-heewon)
[2.9.4] — 2026-03-21
Sprint: Bug fixes — preserve Codex prompt cache key, fix tagContent JSON escaping, sync expired token status to DB.
🐛 Bug Fixes
-
fix(translator): Preserve
prompt_cache_keyin Responses API → Chat Completions translation (#517) — The field is a cache-affinity signal used by Codex; stripping it was preventing prompt cache hits. Fixed inopenai-responses.tsandresponsesApiHelper.ts. -
fix(combo): Escape
intagContentso injected JSON string is valid (#515) — Template literal newlines (U+000A) are not allowed unescaped inside JSON string values. Replaced with\nliteral sequences inopen-sse/services/combo.ts. -
fix(usage): Sync expired token status back to DB on live auth failure (#491) — When the Limits & Quotas live check returns 401/403, the connection
testStatusis now updated to"expired"in the database so the Providers page reflects the same degraded state. Fixed insrc/app/api/usage/[connectionId]/route.ts.
[2.9.3] — 2026-03-21
Sprint: Add 5 new free AI providers — LongCat, Pollinations, Cloudflare AI, Scaleway, AI/ML API.
✨ New Providers
- feat(providers/longcat): Add LongCat AI (
lc/) — 50M tokens/day free (Flash-Lite) + 500K/day (Chat/Thinking) during public beta. OpenAI-compatible, standard Bearer auth. - feat(providers/pollinations): Add Pollinations AI (
pol/) — no API key required. Proxies GPT-5, Claude, Gemini, DeepSeek V3, Llama 4 (1 req/15s free). Custom executor handles optional auth. - feat(providers/cloudflare-ai): Add Cloudflare Workers AI (
cf/) — 10K Neurons/day free (~150 LLM responses or 500s Whisper audio). 50+ models on global edge. Custom executor builds dynamic URL withaccountIdfrom credentials. - feat(providers/scaleway): Add Scaleway Generative APIs (
scw/) — 1M free tokens for new accounts. EU/GDPR compliant (Paris). Qwen3 235B, Llama 3.1 70B, Mistral Small 3.2. - feat(providers/aimlapi): Add AI/ML API (
aiml/) — $0.025/day free credit, 200+ models (GPT-4o, Claude, Gemini, Llama) via single aggregator endpoint.
🔄 Provider Updates
- feat(providers/together): Add
hasFree: true+ 3 permanently free model IDs:Llama-3.3-70B-Instruct-Turbo-Free,Llama-Vision-Free,DeepSeek-R1-Distill-Llama-70B-Free - feat(providers/gemini): Add
hasFree: true+freeNote(1,500 req/day, no credit card needed, aistudio.google.com) - chore(providers/gemini): Rename display name to
Gemini (Google AI Studio)for clarity
⚙️ Infrastructure
- feat(executors/pollinations): New
PollinationsExecutor— omitsAuthorizationheader when no API key provided - feat(executors/cloudflare-ai): New
CloudflareAIExecutor— dynamic URL construction requiresaccountIdin provider credentials - feat(executors): Register
pollinations,pol,cloudflare-ai,cfexecutor mappings
📝 Documentation
- docs(readme): Expanded free combo stack to 11 providers ($0 forever)
- docs(readme): Added 4 new free provider sections (LongCat, Pollinations, Cloudflare AI, Scaleway) with model tables
- docs(readme): Updated pricing table with 4 new free tier rows
- docs(i18n/pt-BR): Updated pricing table + added LongCat/Pollinations/Cloudflare AI/Scaleway sections in Portuguese
- docs(new-features/ai): 10 task spec files + master implementation plan in
docs/new-features/ai/
🧪 Tests
- Test suite: 821 tests, 0 failures (unchanged)
[2.9.2] — 2026-03-21
Sprint: Fix media transcription (Deepgram/HuggingFace Content-Type, language detection) and TTS error display.
🐛 Bug Fixes
- fix(transcription): Deepgram and HuggingFace audio transcription now correctly map
video/mp4→audio/mp4and other media MIME types via newresolveAudioContentType()helper. Previously, uploading.mp4files consistently returned "No speech detected" because Deepgram was receivingContent-Type: video/mp4. - fix(transcription): Added
detect_language=trueto Deepgram requests — auto-detects audio language (Portuguese, Spanish, etc.) instead of defaulting to English. Fixes non-English transcriptions returning empty or garbage results. - fix(transcription): Added
punctuate=trueto Deepgram requests for higher-quality transcription output with correct punctuation. - fix(tts):
[object Object]error display in Text-to-Speech responses fixed in bothaudioSpeech.tsandaudioTranscription.ts. TheupstreamErrorResponse()function now correctly extracts nested string messages from providers like ElevenLabs that return{ error: { message: "...", status_code: 401 } }instead of a flat error string.
🧪 Tests
- Test suite: 821 tests, 0 failures (unchanged)
Triaged Issues
- #508 — Tool call format regression: requested proxy logs and provider chain info (
needs-info) - #510 — Windows CLI healthcheck path: requested shell/Node version info (
needs-info) - #485 — Kiro MCP tool calls: closed as external Kiro issue (not OmniRoute)
- #442 — Baseten /models endpoint: closed (documented manual workaround)
- #464 — Key provisioning API: acknowledged as roadmap item
[2.9.1] — 2026-03-21
Sprint: Fix SSE omniModel data loss, merge per-protocol model compatibility.
Bug Fixes
- #511 — Critical:
<omniModel>tag was sent afterfinish_reason:stopin SSE streams, causing data loss. Tag is now injected into the first non-empty content chunk, guaranteeing delivery before SDKs close the connection.
Merged PRs
- PR #512 (@zhangqiang8vip): Per-protocol model compatibility —
normalizeToolCallIdandpreserveOpenAIDeveloperRolecan now be configured per client protocol (OpenAI, Claude, Responses API). NewcompatByProtocolfield in model config with Zod validation.
Triaged Issues
- #510 — Windows CLI healthcheck_failed: requested PATH/version info
- #509 — Turbopack Electron regression: upstream Next.js bug, documented workarounds
- #508 — macOS black screen: suggested
--disable-gpuworkaround
[2.9.0] — 2026-03-20
Sprint: Cross-platform machineId fix, per-API-key rate limits, streaming context cache, Alibaba DashScope, search analytics, ZWS v5, and 8 issues closed.
✨ New Features
- feat(search): Search Analytics tab in
/dashboard/analytics— provider breakdown, cache hit rate, cost tracking. New API:GET /api/v1/search/analytics(#feat/search-provider-routing) - feat(provider): Alibaba Cloud DashScope added with custom endpoint path validation — configurable
chatPathandmodelsPathper node (#feat/custom-endpoint-paths) - feat(api): Per-API-key request-count limits —
max_requests_per_dayandmax_requests_per_minutecolumns with in-memory sliding-window enforcement returning HTTP 429 (#452) - feat(dev): ZWS v5 — HMR leak fix (485 DB connections → 1), memory 2.4GB → 195MB,
globalThissingletons, Edge Runtime warning fix (@zhangqiang8vip)
🐛 Bug Fixes
- fix(#506): Cross-platform
machineId—getMachineIdRaw()rewritten with try/catch waterfall (Windows REG.exe → macOS ioreg → Linux file read → hostname →os.hostname()). Eliminatesprocess.platformbranching that Next.js bundler dead-code-eliminated, fixing'head' is not recognizedon Windows. Also fixes #466. - fix(#493): Custom provider model naming — removed incorrect prefix stripping in
DefaultExecutor.transformRequest()that mangled org-scoped model IDs likezai-org/GLM-5-FP8. - fix(#490): Streaming + context cache protection —
TransformStreamintercepts SSE to inject<omniModel>tag before[DONE]marker, enabling context cache protection for streaming responses. - fix(#458): Combo schema validation —
system_message,tool_filter_regex,context_cache_protectionfields now pass Zod validation on save. - fix(#487): KIRO MITM card cleanup — removed ZWS_README, generified
AntigravityToolCardto use dynamic tool metadata.
🧪 Tests
- Added Anthropic-format tools filter unit tests (PR #397) — 8 regression tests for
tool.namewithout.functionwrapper - Test suite: 821 tests, 0 failures (up from 813)
📋 Issues Closed (8)
- #506 — Windows machineId
headnot recognized (fixed) - #493 — Custom provider model naming (fixed)
- #490 — Streaming context cache (fixed)
- #452 — Per-API-key request limits (implemented)
- #466 — Windows login failure (same root cause as #506)
- #504 — MITM inactive (expected behavior)
- #462 — Gemini CLI PSA (resolved)
- #434 — Electron app crash (duplicate of #402)
[2.8.9] — 2026-03-20
Sprint: Merge community PRs, fix KIRO MITM card, dependency updates.
Merged PRs
- PR #498 (@Sajid11194): Fix Windows machine ID crash (
undefined\REG.exe). Replacesnode-machine-idwith native OS registry queries. Closes #486. - PR #497 (@zhangqiang8vip): Fix dev-mode HMR resource leaks — 485 leaked DB connections → 1, memory 2.4GB → 195MB.
globalThissingletons, Edge Runtime warning fix, Windows test stability. (+1168/-338 across 22 files) - PRs #499-503 (Dependabot): GitHub Actions updates —
docker/build-push-action@7,actions/checkout@6,peter-evans/dockerhub-description@5,docker/setup-qemu-action@4,docker/login-action@4.
Bug Fixes
- #505 — KIRO MITM card now displays tool-specific instructions (
api.anthropic.com) instead of Antigravity-specific text. - #504 — Responded with UX clarification (MITM "Inactive" is expected behavior when proxy is not running).
[2.8.8] — 2026-03-20
Sprint: Fix OAuth batch test crash, add "Test All" button to individual provider pages.
Bug Fixes
- OAuth batch test crash (ERR_CONNECTION_REFUSED): Replaced sequential for-loop with 5-connection concurrency limit + 30s per-connection timeout via
Promise.race()+Promise.allSettled(). Prevents server crash when testing large OAuth provider groups (~30+ connections).
Features
- "Test All" button on provider pages: Individual provider pages (e.g.,
/providers/codex) now show a "Test All" button in the Connections header when there are 2+ connections. UsesPOST /api/providers/test-batchwith{mode: "provider", providerId}. Results displayed in a modal with pass/fail summary and per-connection diagnosis.
[2.8.7] — 2026-03-20
Sprint: Merge PR #495 (Bottleneck 429 drop), fix #496 (custom embedding providers), triage features.
Bug Fixes
- Bottleneck 429 infinite wait (PR #495 by @xandr0s): On 429,
limiter.stop({ dropWaitingJobs: true })immediately fails all queued requests so upstream callers can trigger fallback. Limiter is deleted from Map so next request creates a fresh instance. - Custom embedding models unresolvable (#496):
POST /v1/embeddingsnow resolves custom embedding models from ALL provider_nodes (not just localhost). Enables models likegoogle/gemini-embedding-001added via dashboard.
Issues Responded
- #452 — Per-API-key request-count limits (acknowledged, on roadmap)
- #464 — Auto-issue API keys with provider/account limits (needs more detail)
- #488 — Auto-update model lists (acknowledged, on roadmap)
- #496 — Custom embedding provider resolution (fixed)
[2.8.6] — 2026-03-20
Sprint: Merge PR #494 (MiniMax role fix), fix KIRO MITM dashboard, triage 8 issues.
Features
- MiniMax developer→system role fix (PR #494 by @zhangqiang8vip): Per-model
preserveDeveloperRoletoggle. Adds "Compatibility" UI in providers page. Fixes 422 "role param error" for MiniMax and similar gateways. - roleNormalizer:
normalizeDeveloperRole()now acceptspreserveDeveloperRoleparameter with tri-state behavior (undefined=keep, true=keep, false=convert). - DB: New
getModelPreserveOpenAIDeveloperRole()andmergeModelCompatOverride()inmodels.ts.
Bug Fixes
- KIRO MITM dashboard (#481/#487):
CLIToolsPageClientnow routes anyconfigType: "mitm"tool toAntigravityToolCard(MITM Start/Stop controls). Previously only Antigravity was hardcoded. - AntigravityToolCard generic: Uses
tool.image,tool.description,tool.idinstead of hardcoded Antigravity values. Guards against missingdefaultModels.
Cleanup
- Removed
ZWS_README_V2.md(development-only docs from PR #494).
Issues Triaged (8)
- #487 — Closed (KIRO MITM fixed in this release)
- #486 — needs-info (Windows REG.exe PATH issue)
- #489 — needs-info (Antigravity projectId missing, OAuth reconnect needed)
- #492 — needs-info (missing app/server.js on mise-managed Node)
- #490 — Acknowledged (streaming + context cache blocking, fix planned)
- #491 — Acknowledged (Codex auth state inconsistency)
- #493 — Acknowledged (Modal provider model name prefix, workaround provided)
- #488 — Feature request backlog (auto-update model lists)
[2.8.5] — 2026-03-19
Sprint: Fix zombie SSE streams, context cache first-turn, KIRO MITM, and triage 5 external issues.
Bug Fixes
- Zombie SSE Streams (#473): Reduce
STREAM_IDLE_TIMEOUT_MSfrom 300s → 120s for faster combo fallback when providers hang mid-stream. Configurable via env var. - Context Cache Tag (#474): Fix
injectModelTag()to handle first-turn requests (no assistant messages) — context cache protection now works from the very first response. - KIRO MITM (#481): Change KIRO
configTypefromguide→mitmso the dashboard renders MITM Start/Stop controls. - E2E Test (CI): Fix
providers-bailian-coding-plan.spec.ts— dismiss pre-existing modal overlay before clicking Add API Key button.
Closed Issues
- #473 — Zombie SSE streams bypass combo fallback
- #474 — Context cache
<omniModel>tag missing on first turn - #481 — MITM for KIRO not activatable from dashboard
- #468 — Gemini CLI remote server (superseded by #462 deprecation)
- #438 — Claude unable to write files (external CLI issue)
- #439 — AppImage doesn't work (documented libfuse2 workaround)
- #402 — ARM64 DMG "damaged" (documented xattr -cr workaround)
- #460 — CLI not runnable on Windows (documented PATH fix)
[2.8.4] — 2026-03-19
Sprint: Gemini CLI deprecation, VM guide i18n fix, dependabot security fix, provider schema expansion.
Features
- Gemini CLI Deprecation (#462): Mark
gemini-cliprovider as deprecated with warning — Google restricts third-party OAuth usage from March 2026 - Provider Schema (#462): Expand Zod validation with
deprecated,deprecationReason,hasFree,freeNote,authHint,apiHintoptional fields
Bug Fixes
- VM Guide i18n (#471): Add
VM_DEPLOYMENT_GUIDE.mdto i18n translation pipeline, regenerate all 30 locale translations from English source (were stuck in Portuguese)
Security
- deps: Bump
flatted3.3.3 → 3.4.2 — fixes CWE-1321 prototype pollution (#484, @dependabot)
Closed Issues
- #472 — Model Aliases regression (fixed in v2.8.2)
- #471 — VM guide translations broken
- #483 — Trailing
data: nullafter[DONE](fixed in v2.8.3)
Merged PRs
- #484 — deps: bump flatted from 3.3.3 to 3.4.2 (@dependabot)
[2.8.3] — 2026-03-19
Sprint: Czech i18n, SSE protocol fix, VM guide translation.
Features
- Czech Language (#482): Full Czech (cs) i18n — 22 docs, 2606 UI strings, language switcher updates (@zen0bit)
- VM Deployment Guide: Translated from Portuguese to English as the source document (@zen0bit)
Bug Fixes
- SSE Protocol (#483): Stop sending trailing
data: nullafter[DONE]signal — fixesAI_TypeValidationErrorin strict AI SDK clients (Zod-based validators)
Merged PRs
- #482 — Add Czech language + Fix VM_DEPLOYMENT_GUIDE.md English source (@zen0bit)
[2.8.2] — 2026-03-19
Sprint: 2 merged PRs, model aliases routing fix, log export, and issue triage.
Features
- Log Export: New Export button on
/dashboard/logswith time range dropdown (1h, 6h, 12h, 24h). Downloads JSON of request/proxy/call logs via/api/logs/exportAPI (#user-request)
Bug Fixes
- Model Aliases Routing (#472): Settings → Model Aliases now correctly affect provider routing, not just format detection. Previously
resolveModelAlias()output was only used forgetModelTargetFormat()but the original model ID was sent to the provider - Stream Flush Usage (#480): Usage data from the last SSE event in the buffer is now correctly extracted during stream flush (merged from @prakersh)
Merged PRs
- #480 — Extract usage from remaining buffer in flush handler (@prakersh)
- #479 — Add missing Codex 5.3/5.4 and Anthropic model ID pricing entries (@prakersh)
[2.8.1] — 2026-03-19
Sprint: Five community PRs — streaming call log fixes, Kiro compatibility, cache token analytics, Chinese translation, and configurable tool call IDs.
✨ Features
- feat(logs): Call log response content now correctly accumulated from raw provider chunks (OpenAI/Claude/Gemini) before translation, fixing empty response payloads in streaming mode (#470, @zhangqiang8vip)
- feat(providers): Per-model configurable 9-char tool call ID normalization (Mistral-style) — only models with the option enabled get truncated IDs (#470)
- feat(api): Key PATCH API expanded to support
allowedConnections,name,autoResolve,isActive, andaccessSchedulefields (#470) - feat(dashboard): Response-first layout in request log detail UI (#470)
- feat(i18n): Improved Chinese (zh-CN) translation — complete retranslation (#475, @only4copilot)
🐛 Bug Fixes
- fix(kiro): Strip injected
modelfield from request body — Kiro API rejects unknown top-level fields (#478, @prakersh) - fix(usage): Include cache read + cache creation tokens in usage history input totals for accurate analytics (#477, @prakersh)
- fix(callLogs): Support Claude format usage fields (
input_tokens/output_tokens) alongside OpenAI format, include all cache token variants (#476, @prakersh)
[2.8.0] — 2026-03-19
Sprint: Bailian Coding Plan provider with editable base URLs, plus community contributions for Alibaba Cloud and Kimi Coding.
✨ Features
- feat(providers): Added Bailian Coding Plan (
bailian-coding-plan) — Alibaba Model Studio with Anthropic-compatible API. Static catalog of 8 models including Qwen3.5 Plus, Qwen3 Coder, MiniMax M2.5, GLM 5, and Kimi K2.5. Includes custom auth validation (400=valid, 401/403=invalid) (#467, @Mind-Dragon) - feat(admin): Editable default URL in Provider Admin create/edit flows — users can configure custom base URLs per connection. Persisted in
providerSpecificData.baseUrlwith Zod schema validation rejecting non-http(s) schemes (#467)
🧪 Tests
- Added 30+ unit tests and 2 e2e scenarios for Bailian Coding Plan provider covering auth validation, schema hardening, route-level behavior, and cross-layer integration
[2.7.10] — 2026-03-19
Sprint: Two new community-contributed providers (Alibaba Cloud Coding, Kimi Coding API-key) and Docker pino fix.
✨ Features
- feat(providers): Added Alibaba Cloud Coding Plan support with two OpenAI-compatible endpoints —
alicode(China) andalicode-intl(International), each with 8 models (#465, @dtk1985) - feat(providers): Added dedicated
kimi-coding-apikeyprovider path — API-key-based Kimi Coding access is no longer forced through OAuth-onlykimi-codingroute. Includes registry, constants, models API, config, and validation test (#463, @Mind-Dragon)
🐛 Bug Fixes
- fix(docker): Added missing
split2dependency to Docker image —pino-abstract-transportrequires it at runtime but it was not being copied into the standalone container, causingCannot find module 'split2'crashes (#459)
[2.7.9] — 2026-03-18
Sprint: Codex responses subpath passthrough natively supported, Windows MITM crash fixed, and Combos agent schemas adjusted.
✨ Features
- feat(codex): Native responses subpath passthrough for Codex — natively routes
POST /v1/responses/compactto Codex upstream, maintaining Claude Code compatibility without stripping the/compactsuffix (#457)
🐛 Bug Fixes
- fix(combos): Zod schemas (
updateComboSchemaandcreateComboSchema) now includesystem_message,tool_filter_regex, andcontext_cache_protection. Fixes bug where agent-specific settings created via the dashboard were silently discarded by the backend validation layer (#458) - fix(mitm): Kiro MITM profile crash on Windows fixed —
node-machine-idfailed due to missingREG.exeenv, and the fallback threw a fatalcrypto is not definederror. Fallback now safely and correctly imports crypto (#456)
[2.7.8] — 2026-03-18
Sprint: Budget save bug + combo agent features UI + omniModel tag security fix.
🐛 Bug Fixes
- fix(budget): "Save Limits" no longer returns 422 —
warningThresholdis now correctly sent as fraction (0–1) instead of percentage (0–100) (#451) - fix(combos):
<omniModel>internal cache tag is now stripped before forwarding requests to providers, preventing cache session breaks (#454)
✨ Features
- feat(combos): Agent Features section added to combo create/edit modal — expose
system_messageoverride,tool_filter_regex, andcontext_cache_protectiondirectly from the dashboard (#454)
[2.7.7] — 2026-03-18
Sprint: Docker pino crash, Codex CLI responses worker fix, package-lock sync.
🐛 Bug Fixes
- fix(docker):
pino-abstract-transportandpino-prettynow explicitly copied in Docker runner stage — Next.js standalone trace misses these peer deps, causingCannot find module pino-abstract-transportcrash on startup (#449) - fix(responses): Remove
initTranslators()from/v1/responsesroute — was crashing Next.js worker withthe worker has exiteduncaughtException on Codex CLI requests (#450)
🔧 Maintenance
- chore(deps):
package-lock.jsonnow committed on every version bump to ensure Dockernpm ciuses exact dependency versions
[2.7.5] — 2026-03-18
Sprint: UX improvements and Windows CLI healthcheck fix.
🐛 Bug Fixes
- fix(ux): Show default password hint on login page — new users now see
"Default password: 123456"below the password input (#437) - fix(cli): Claude CLI and other npm-installed tools now correctly detected as runnable on Windows — spawn uses
shell:trueto resolve.cmdwrappers via PATHEXT (#447)
[2.7.4] — 2026-03-18
Sprint: Search Tools dashboard, i18n fixes, Copilot limits, Serper validation fix.
🚀 Features
- feat(search): Add Search Playground (10th endpoint), Search Tools page with Compare Providers/Rerank Pipeline/Search History, local rerank routing, auth guards on search API (#443 by @Regis-RCR)
- New route:
/dashboard/search-tools - Sidebar entry under Debug section
GET /api/search/providersandGET /api/search/statswith auth guards- Local provider_nodes routing for
/v1/rerank - 30+ i18n keys in search namespace
- New route:
🐛 Bug Fixes
- fix(search): Fix Brave news normalizer (was returning 0 results), enforce max_results truncation post-normalization, fix Endpoints page fetch URL (#443 by @Regis-RCR)
- fix(analytics): Localize analytics day/date labels — replace hardcoded Portuguese strings with
Intl.DateTimeFormat(locale)(#444 by @hijak) - fix(copilot): Correct GitHub Copilot account type display, filter misleading unlimited quota rows from limits dashboard (#445 by @hijak)
- fix(providers): Stop rejecting valid Serper API keys — treat non-4xx responses as valid authentication (#446 by @hijak)
[2.7.3] — 2026-03-18
Sprint: Codex direct API quota fallback fix.
🐛 Bug Fixes
- fix(codex): Block weekly-exhausted accounts in direct API fallback (#440)
resolveQuotaWindow()prefix matching:"weekly"now matches"weekly (7d)"cache keysapplyCodexWindowPolicy()enforcesuseWeekly/use5htoggles correctly- 4 new regression tests (766 total)
[2.7.2] — 2026-03-18
Sprint: Light mode UI contrast fixes.
🐛 Bug Fixes
- fix(logs): Fix light mode contrast in request logs filter buttons and combo badge (#378)
- Error/Success/Combo filter buttons now readable in light mode
- Combo row badge uses stronger violet in light mode
[2.7.1] — 2026-03-17
Sprint: Unified web search routing (POST /v1/search) with 5 providers + Next.js 16.1.7 security fixes (6 CVEs).
✨ New Features
- feat(search): Unified web search routing —
POST /v1/searchwith 5 providers (Serper, Brave, Perplexity, Exa, Tavily)- Auto-failover across providers, 6,500+ free searches/month
- In-memory cache with request coalescing (configurable TTL)
- Dashboard: Search Analytics tab in
/dashboard/analyticswith provider breakdown, cache hit rate, cost tracking - New API:
GET /api/v1/search/analyticsfor search request statistics - DB migration:
request_typecolumn oncall_logsfor non-chat request tracking - Zod validation (
v1SearchSchema), auth-gated, cost recorded viarecordCost()
🔒 Security
- deps: Next.js 16.1.6 → 16.1.7 — fixes 6 CVEs:
- Critical: CVE-2026-29057 (HTTP request smuggling via http-proxy)
- High: CVE-2026-27977, CVE-2026-27978 (WebSocket + Server Actions)
- Medium: CVE-2026-27979, CVE-2026-27980, CVE-2026-jcc7
📁 New Files
| File | Purpose |
|---|---|
open-sse/handlers/search.ts |
Search handler with 5-provider routing |
open-sse/config/searchRegistry.ts |
Provider registry (auth, cost, quota, TTL) |
open-sse/services/searchCache.ts |
In-memory cache with request coalescing |
src/app/api/v1/search/route.ts |
Next.js route (POST + GET) |
src/app/api/v1/search/analytics/route.ts |
Search stats API |
src/app/(dashboard)/dashboard/analytics/SearchAnalyticsTab.tsx |
Analytics dashboard tab |
src/lib/db/migrations/007_search_request_type.sql |
DB migration |
tests/unit/search-registry.test.mjs |
277 lines of unit tests |
[2.7.0] — 2026-03-17
Sprint: ClawRouter-inspired features — toolCalling flag, multilingual intent detection, benchmark-driven fallback, request deduplication, pluggable RouterStrategy, Grok-4 Fast + GLM-5 + MiniMax M2.5 + Kimi K2.5 pricing.
✨ New Models & Pricing
- feat(pricing): xAI Grok-4 Fast —
$0.20/$0.50 per 1M tokens, 1143ms p50 latency, tool calling supported - feat(pricing): xAI Grok-4 (standard) —
$0.20/$1.50 per 1M tokens, reasoning flagship - feat(pricing): GLM-5 via Z.AI —
$0.5/1M, 128K output context - feat(pricing): MiniMax M2.5 —
$0.30/1M input, reasoning + agentic tasks - feat(pricing): DeepSeek V3.2 — updated pricing
$0.27/$1.10 per 1M - feat(pricing): Kimi K2.5 via Moonshot API — direct Moonshot API access
- feat(providers): Z.AI provider added (
zaialias) — GLM-5 family with 128K output
🧠 Routing Intelligence
- feat(registry):
toolCallingflag per model in provider registry — combos can now prefer/require tool-calling capable models - feat(scoring): Multilingual intent detection for AutoCombo scoring — PT/ZH/ES/AR script/language patterns influence model selection per request context
- feat(fallback): Benchmark-driven fallback chains — real latency data (p50 from
comboMetrics) used to re-order fallback priority dynamically - feat(dedup): Request deduplication via content-hash — 5-second idempotency window prevents duplicate provider calls from retrying clients
- feat(router): Pluggable
RouterStrategyinterface inautoCombo/routerStrategy.ts— custom routing logic can be injected without modifying core
🔧 MCP Server Improvements
- feat(mcp): 2 new advanced tool schemas:
omniroute_get_provider_metrics(p50/p95/p99 per provider) andomniroute_explain_route(routing decision explanation) - feat(mcp): MCP tool auth scopes updated —
metrics:readscope added for provider metrics tools - feat(mcp):
omniroute_best_combo_for_tasknow acceptslanguageHintparameter for multilingual routing
📊 Observability
- feat(metrics):
comboMetrics.tsextended with real-time latency percentile tracking per provider/account - feat(health): Health API (
/api/monitoring/health) now returns per-providerp50LatencyanderrorRatefields - feat(usage): Usage history migration for per-model latency tracking
🗄️ DB Migrations
- feat(migrations): New column
latency_p50incombo_metricstable — zero-breaking, safe for existing users
🐛 Bug Fixes / Closures
- close(#411): better-sqlite3 hashed module resolution on Windows — fixed in v2.6.10 (
f02c5b5) - close(#409): GitHub Copilot chat completions fail with Claude models when files attached — fixed in v2.6.9 (
838f1d6) - close(#405): Duplicate of #411 — resolved
[2.6.10] — 2026-03-17
Windows fix: better-sqlite3 prebuilt download without node-gyp/Python/MSVC (#426).
🐛 Bug Fixes
- fix(install/#426): On Windows,
npm install -g omnirouteused to fail withbetter_sqlite3.node is not a valid Win32 applicationbecause the bundled native binary was compiled for Linux. Adds Strategy 1.5 toscripts/postinstall.mjs: uses@mapbox/node-pre-gyp install --fallback-to-build=false(bundled withinbetter-sqlite3) to download the correct prebuilt binary for the current OS/arch without requiring any build tools (no node-gyp, no Python, no MSVC). Falls back tonpm rebuildonly if the download fails. Adds platform-specific error messages with clear manual fix instructions.
[2.6.9] — 2026-03-17
CI fixes (t11 any-budget), bug fix #409 (file attachments via Copilot+Claude), release workflow correction.
🐛 Bug Fixes
- fix(ci): Remove word "any" from comments in
openai-responses.tsandchatCore.tsthat were failing the t11anybudget check (false positive from regex counting comments) - fix(chatCore): Normalize unsupported content part types before forwarding to providers (#409 — Cursor sends
{type:"file"}when.mdfiles are attached; Copilot and other OpenAI-compat providers reject with "type has to be either 'image_url' or 'text'"; fix convertsfile/documentblocks totextand drops unknown types)
🔧 Workflow
- chore(generate-release): Add ATOMIC COMMIT RULE — version bump (
npm version patch) MUST happen before committing feature files to ensure tag always points to a commit containing all version changes together
[2.6.8] — 2026-03-17
Sprint: Combo as Agent (system prompt + tool filter), Context Caching Protection, Auto-Update, Detailed Logs, MITM Kiro IDE.
🗄️ DB Migrations (zero-breaking — safe for existing users)
- 005_combo_agent_fields.sql:
ALTER TABLE combos ADD COLUMN system_message TEXT DEFAULT NULL,tool_filter_regex TEXT DEFAULT NULL,context_cache_protection INTEGER DEFAULT 0 - 006_detailed_request_logs.sql: New
request_detail_logstable with 500-entry ring-buffer trigger, opt-in via settings toggle
✨ Features
- feat(combo): System Message Override per Combo (#399 —
system_messagefield replaces or injects system prompt before forwarding to provider) - feat(combo): Tool Filter Regex per Combo (#399 —
tool_filter_regexkeeps only tools matching pattern; supports OpenAI + Anthropic formats) - feat(combo): Context Caching Protection (#401 —
context_cache_protectiontags responses with<omniModel>provider/model</omniModel>and pins model for session continuity) - feat(settings): Auto-Update via Settings (#320 —
GET /api/system/version+POST /api/system/update— checks npm registry and updates in background with pm2 restart) - feat(logs): Detailed Request Logs (#378 — captures full pipeline bodies at 4 stages: client request, translated request, provider response, client response — opt-in toggle, 64KB trim, 500-entry ring-buffer)
- feat(mitm): MITM Kiro IDE profile (#336 —
src/mitm/targets/kiro.tstargets api.anthropic.com, reuses existing MITM infrastructure)
[2.6.7] — 2026-03-17
Sprint: SSE improvements, local provider_nodes extensions, proxy registry, Claude passthrough fixes.
✨ Features
- feat(health): Background health check for local
provider_nodeswith exponential backoff (30s→300s) andPromise.allSettledto avoid blocking (#423, @Regis-RCR) - feat(embeddings): Route
/v1/embeddingsto localprovider_nodes—buildDynamicEmbeddingProvider()with hostname validation (#422, @Regis-RCR) - feat(audio): Route TTS/STT to local
provider_nodes—buildDynamicAudioProvider()with SSRF protection (#416, @Regis-RCR) - feat(proxy): Proxy registry, management APIs, and quota-limit generalization (#429, @Regis-RCR)
🐛 Bug Fixes
- fix(sse): Strip Claude-specific fields (
metadata,anthropic_version) when target is OpenAI-compat (#421, @prakersh) - fix(sse): Extract Claude SSE usage (
input_tokens,output_tokens, cache tokens) in passthrough stream mode (#420, @prakersh) - fix(sse): Generate fallback
call_idfor tool calls with missing/empty IDs (#419, @prakersh) - fix(sse): Claude-to-Claude passthrough — forward body completely untouched, no re-translation (#418, @prakersh)
- fix(sse): Filter orphaned
tool_resultitems after Claude Code context compaction to avoid 400 errors (#417, @prakersh) - fix(sse): Skip empty-name tool calls in Responses API translator to prevent
placeholder_toolinfinite loops (#415, @prakersh) - fix(sse): Strip empty text content blocks before translation (#427, @prakersh)
- fix(api): Add
refreshable: trueto Claude OAuth test config (#428, @prakersh)
📦 Dependencies
- Bump
vitest,@vitest/*and related devDependencies (#414, @dependabot)
[2.6.6] — 2026-03-17
Hotfix: Turbopack/Docker compatibility — remove
node:protocol from allsrc/imports.
🐛 Bug Fixes
- fix(build): Removed
node:protocol prefix fromimportstatements in 17 files undersrc/. Thenode:fs,node:path,node:url,node:osetc. imports causedEcmascript file had an erroron Turbopack builds (Next.js 15 Docker) and on upgrades from older npm global installs. Affected files:migrationRunner.ts,core.ts,backup.ts,prompts.ts,dataPaths.ts, and 12 others insrc/app/api/andsrc/lib/. - chore(workflow): Updated
generate-release.mdto make Docker Hub sync and dual-VPS deploy mandatory steps in every release.
[2.6.5] — 2026-03-17
Sprint: reasoning model param filtering, local provider 404 fix, Kilo Gateway provider, dependency bumps.
✨ New Features
- feat(api): Added Kilo Gateway (
api.kilo.ai) as a new API Key provider (aliaskg) — 335+ models, 6 free models, 3 auto-routing models (kilo-auto/frontier,kilo-auto/balanced,kilo-auto/free). Passthrough models supported via/api/gateway/modelsendpoint. (PR #408 by @Regis-RCR)
🐛 Bug Fixes
- fix(sse): Strip unsupported parameters for reasoning models (o1, o1-mini, o1-pro, o3, o3-mini). Models in the
o1/o3family rejecttemperature,top_p,frequency_penalty,presence_penalty,logprobs,top_logprobs, andnwith HTTP 400. Parameters are now stripped at thechatCorelayer before forwarding. Uses a declarativeunsupportedParamsfield per model and a precomputed O(1) Map for lookup. (PR #412 by @Regis-RCR) - fix(sse): Local provider 404 now results in a model-only lockout (5 seconds) instead of a connection-level lockout (2 minutes). When a local inference backend (Ollama, LM Studio, oMLX) returns 404 for an unknown model, the connection remains active and other models continue working immediately. Also fixes a pre-existing bug where
modelwas not passed tomarkAccountUnavailable(). Local providers detected via hostname (localhost,127.0.0.1,::1, extensible viaLOCAL_HOSTNAMESenv var). (PR #410 by @Regis-RCR)
📦 Dependencies
better-sqlite312.6.2 → 12.8.0undici7.24.2 → 7.24.4https-proxy-agent7 → 8agent-base7 → 8
[2.6.4] — 2026-03-17
🐛 Bug Fixes
- fix(providers): Removed non-existent model names across 5 providers:
- gemini / gemini-cli: removed
gemini-3.1-pro/flashandgemini-3-*-preview(don't exist in Google API v1beta); replaced withgemini-2.5-pro,gemini-2.5-flash,gemini-2.0-flash,gemini-1.5-pro/flash - antigravity: removed
gemini-3.1-pro-high/lowandgemini-3-flash(invalid internal aliases); replaced with real 2.x models - github (Copilot): removed
gemini-3-flash-previewandgemini-3-pro-preview; replaced withgemini-2.5-flash - nvidia: corrected
nvidia/llama-3.3-70b-instruct→meta/llama-3.3-70b-instruct(NVIDIA NIM usesmeta/namespace for Meta models); addednvidia/llama-3.1-70b-instructandnvidia/llama-3.1-405b-instruct
- gemini / gemini-cli: removed
- fix(db/combo): Updated
free-stackcombo on remote DB: removedqw/qwen3-coder-plus(expired refresh token), correctednvidia/llama-3.3-70b-instruct→nvidia/meta/llama-3.3-70b-instruct, correctedgemini/gemini-3.1-flash→gemini/gemini-2.5-flash, addedif/deepseek-v3.2
[2.6.3] — 2026-03-16
Sprint: zod/pino hash-strip baked into build pipeline, Synthetic provider added, VPS PM2 path corrected.
🐛 Bug Fixes
- fix(build): Turbopack hash-strip now runs at compile time for ALL packages — not just
better-sqlite3. Step 5.6 inprepublish.mjswalks every.jsinapp/.next/server/and strips the 16-char hex suffix from any hashedrequire(). Fixeszod-dcb22c...,pino-..., etc. MODULE_NOT_FOUND on global npm installs. Closes #398 - fix(deploy): PM2 on both VPS was pointing to stale git-clone directories. Reconfigured to
app/server.jsin the npm global package. Updated/deploy-vpsworkflow to usenpm pack + scp(npm registry rejects 299MB packages).
✨ Features
- feat(provider): Synthetic (synthetic.new) — privacy-focused OpenAI-compatible inference.
passthroughModels: truefor dynamic HuggingFace model catalog. Initial models: Kimi K2.5, MiniMax M2.5, GLM 4.7, DeepSeek V3.2. (PR #404 by @Regis-RCR)
📋 Issues Closed
- close #398: npm hash regression — fixed by compile-time hash-strip in prepublish
- triage #324: Bug screenshot without steps — requested reproduction details
[2.6.2] — 2026-03-16
Sprint: module hashing fully fixed, 2 PRs merged (Anthropic tools filter + custom endpoint paths), Alibaba Cloud DashScope provider added, 3 stale issues closed.
🐛 Bug Fixes
- fix(build): Extended webpack
externalshash-strip to cover ALLserverExternalPackages, not justbetter-sqlite3. Next.js 16 Turbopack hasheszod,pino, and every other server-external package into names likezod-dcb22c6336e0bc69that don't exist innode_modulesat runtime. A HASH_PATTERN regex catch-all now strips the 16-char suffix and falls back to the base package name. Also addedNEXT_PRIVATE_BUILD_WORKER=0inprepublish.mjsto reinforce webpack mode, plus a post-build scan that reports any remaining hashed refs. (#396, #398, PR #403) - fix(chat): Anthropic-format tool names (
tool.namewithout.functionwrapper) were silently dropped by the empty-name filter introduced in #346. LiteLLM proxies requests withanthropic/prefix in Anthropic Messages API format, causing all tools to be filtered and Anthropic to return400: tool_choice.any may only be specified while providing tools. Fixed by falling back totool.namewhentool.function.nameis absent. Added 8 regression unit tests. (PR #397)
✨ Features
- feat(api): Custom endpoint paths for OpenAI-compatible provider nodes — configure
chatPathandmodelsPathper node (e.g./v4/chat/completions) in the provider connection UI. Includes a DB migration (003_provider_node_custom_paths.sql) and URL path sanitization (no..traversal, must start with/). (PR #400) - feat(provider): Alibaba Cloud DashScope added as OpenAI-compatible provider. International endpoint:
dashscope-intl.aliyuncs.com/compatible-mode/v1. 12 models:qwen-max,qwen-plus,qwen-turbo,qwen3-coder-plus/flash,qwq-plus,qwq-32b,qwen3-32b,qwen3-235b-a22b. Auth: Bearer API key.
📋 Issues Closed
- close #323: Cline connection error
[object Object]— fixed in v2.3.7; instructed user to upgrade from v2.2.9 - close #337: Kiro credit tracking — implemented in v2.5.5 (#381); pointed user to Dashboard → Usage
- triage #402: ARM64 macOS DMG damaged — requested macOS version, exact error, and advised
xattr -d com.apple.quarantineworkaround
[2.6.1] — 2026-03-15
Critical startup fix: v2.6.0 global npm installs crashed with a 500 error due to a Turbopack/webpack module-name hashing bug in the Next.js 16 instrumentation hook.
🐛 Bug Fixes
- fix(build): Force
better-sqlite3to always be required by its exact package name in the webpack server bundle. Next.js 16 compiled the instrumentation hook into a separate chunk and emittedrequire('better-sqlite3-<hash>')— a hashed module name that doesn't exist innode_modules— even though the package was listed inserverExternalPackages. Added an explicitexternalsfunction to the server webpack config so the bundler always emitsrequire('better-sqlite3'), resolving the startup500 Internal Server Erroron clean global installs. (#394, PR #395)
🔧 CI
- ci: Added
workflow_dispatchtonpm-publish.ymlwith version sync safeguard for manual triggers (#392) - ci: Added
workflow_dispatchtodocker-publish.yml, updated GitHub Actions to latest versions (#392)
[2.6.0] - 2026-03-15
Issue resolution sprint: 4 bugs fixed, logs UX improved, Kiro credit tracking added.
🐛 Bug Fixes
- fix(media): ComfyUI and SD WebUI no longer appear in the Media page provider list when unconfigured — fetches
/api/providerson mount and hides local providers with no connections (#390) - fix(auth): Round-robin no longer re-selects rate-limited accounts immediately after cooldown —
backoffLevelis now used as primary sort key in the LRU rotation (#340) - fix(oauth): Qoder (and other providers that redirect to their own UI) no longer leave the OAuth modal stuck at "Waiting for Authorization" — popup-closed detector auto-transitions to manual URL input mode (#344)
- fix(logs): Request log table is now readable in light mode — status badges, token counts, and combo tags use adaptive
dark:color classes (#378)
✨ Features
- feat(kiro): Kiro credit tracking added to usage fetcher — queries
getUserCreditsfrom AWS CodeWhisperer endpoint (#337)
🛠 Chores
- chore(tests): Aligned
test:plan3,test:fixes,test:securityto use sametsx/esmloader asnpm test— eliminates module resolution false negatives in targeted runs (PR #386)
[2.5.9] - 2026-03-15
Codex native passthrough fix + route body validation hardening.
🐛 Bug Fixes
- fix(codex): Preserve native Responses API passthrough for Codex clients — avoids unnecessary translation mutations (PR #387)
- fix(api): Validate request bodies on pricing/sync and task-routing routes — prevents crashes from malformed inputs (PR #388)
- fix(auth): JWT secrets persist across restarts via
src/lib/db/secrets.ts— eliminates 401 errors after pm2 restart (PR #388)
[2.5.8] - 2026-03-15
Build fix: restore VPS connectivity broken by v2.5.7 incomplete publish.
🐛 Bug Fixes
- fix(build):
scripts/prepublish.mjsstill used deprecated--webpackflag causing Next.js standalone build to fail silently — npm publish completed withoutapp/server.js, breaking VPS deployment
[2.5.7] - 2026-03-15
Media playground error handling fixes.
🐛 Bug Fixes
- fix(media): Transcription "API Key Required" false positive when audio contains no speech (music, silence) — now shows "No speech detected" instead
- fix(media):
upstreamErrorResponseinaudioTranscription.tsandaudioSpeech.tsnow returns proper JSON ({error:{message}}), enabling correct 401/403 credential error detection in the MediaPageClient - fix(media):
parseApiErrornow handles Deepgram'serr_msgfield and detects"api key"in error messages for accurate credential error classification
[2.5.6] - 2026-03-15
Critical security/auth fixes: Antigravity OAuth broken + JWT sessions lost after restart.
🐛 Bug Fixes
- fix(oauth) #384: Antigravity Google OAuth now correctly sends
client_secretto the token endpoint. The fallback forANTIGRAVITY_OAUTH_CLIENT_SECRETwas an empty string, which is falsy — soclient_secretwas never included in the request, causing"client_secret is missing"errors for all users without a custom env var. Closes #383. - fix(auth) #385:
JWT_SECRETis now persisted to SQLite (namespace='secrets') on first generation and reloaded on subsequent starts. Previously, a new random secret was generated each process startup, invalidating all existing cookies/sessions after any restart or upgrade. Affects bothJWT_SECRETandAPI_KEY_SECRET. Closes #382.
[2.5.5] - 2026-03-15
Model list dedup fix, Electron standalone build hardening, and Kiro credit tracking.
🐛 Bug Fixes
- fix(models) #380:
GET /api/modelsnow includes provider aliases when building the active-provider filter — models forclaude(aliascc) andgithub(aliasgh) were always shown regardless of whether a connection was configured, becausePROVIDER_MODELSkeys are aliases but DB connections are stored under provider IDs. Fixed by expanding each active provider ID to also include its alias viaPROVIDER_ID_TO_ALIAS. Closes #353. - fix(electron) #379: New
scripts/prepare-electron-standalone.mjsstages a dedicated/.next/electron-standalonebundle before Electron packaging. Aborts with a clear error ifnode_modulesis a symlink (electron-builder would ship a runtime dependency on the build machine). Cross-platform path sanitization viapath.basename. By @kfiramar.
✨ New Features
- feat(kiro) #381: Kiro credit balance tracking — usage endpoint now returns credit data for Kiro accounts by calling
codewhisperer.us-east-1.amazonaws.com/getUserCredits(same endpoint Kiro IDE uses internally). Returns remaining credits, total allowance, renewal date, and subscription tier. Closes #337.
[2.5.4] - 2026-03-15
Logger startup fix, login bootstrap security fix, and dev HMR reliability improvement. CI infrastructure hardened.
🐛 Bug Fixes (PRs #374, #375, #376 by @kfiramar)
- fix(logger) #376: Restore pino transport logger path —
formatters.levelcombined withtransport.targetsis rejected by pino. Transport-backed configs now strip the level formatter viagetTransportCompatibleConfig(). Also corrects numeric level mapping in/api/logs/console:30→info, 40→warn, 50→error(was shifted by one). - fix(login) #375: Login page now bootstraps from the public
/api/settings/require-loginendpoint instead of the protected/api/settings. In password-protected setups, the pre-auth page was receiving a 401 and falling back to safe defaults unnecessarily. The public route now returns all bootstrap metadata (requireLogin,hasPassword,setupComplete) with a conservative 200 fallback on error. - fix(dev) #374: Add
localhostand127.0.0.1toallowedDevOriginsinnext.config.mjs— HMR websocket was blocked when accessing the app via loopback address, producing repeated cross-origin warnings.
🔧 CI & Infrastructure
- ESLint OOM fix:
eslint.config.mjsnow ignoresvscode-extension/**,electron/**,docs/**,app/.next/**, andclipr/**— ESLint was crashing with a JS heap OOM by scanning VS Code binary blobs and compiled chunks. - Unit test fix: Removed stale
ALTER TABLE provider_connections ADD COLUMN "group"from 2 test files — column is now part of the base schema (added in #373), causingSQLITE_ERROR: duplicate column nameon every CI run. - Pre-commit hook: Added
npm run test:unitto.husky/pre-commit— unit tests now block broken commits before they reach CI.
[2.5.3] - 2026-03-14
Critical bugfixes: DB schema migration, startup env loading, provider error state clearing, and i18n tooltip fix. Code quality improvements on top of each PR.
🐛 Bug Fixes (PRs #369, #371, #372, #373 by @kfiramar)
- fix(db) #373: Add
provider_connections.groupcolumn to base schema + backfill migration for existing databases — column was used in all queries but missing from schema definition - fix(i18n) #371: Replace non-existent
t("deleteConnection")key with existingproviders.deletekey — fixesMISSING_MESSAGE: providers.deleteConnectionruntime error on provider detail page - fix(auth) #372: Clear stale error metadata (
errorCode,lastErrorType,lastErrorSource) from provider accounts after genuine recovery — previously, recovered accounts kept appearing as failed - fix(startup) #369: Unify env loading across
npm run start,run-standalone.mjs, and Electron to respectDATA_DIR/.env → ~/.omniroute/.env → ./.envpriority — prevents generating a newSTORAGE_ENCRYPTION_KEYover an existing encrypted database
🔧 Code Quality
- Documented
result.successvsresponse?.okpatterns inauth.ts(both intentional, now explained) - Normalized
overridePath?.trim()inelectron/main.jsto matchbootstrap-env.mjs - Added
preferredEnvmerge order comment in Electron startup
Codex account quota policy with auto-rotation, fast tier toggle, gpt-5.4 model, and analytics label fix.
✨ New Features (PRs #366, #367, #368)
- Codex Quota Policy (PR #366): Per-account 5h/weekly quota window toggles in Provider dashboard. Accounts are automatically skipped when enabled windows reach 90% threshold and re-admitted after
resetAt. IncludesquotaCache.tswith side-effect free status getter. - Codex Fast Tier Toggle (PR #367): Dashboard → Settings → Codex Service Tier. Default-off toggle injects
service_tier: "flex"only for Codex requests, reducing cost ~80%. Full stack: UI tab + API endpoint + executor + translator + startup restore. - gpt-5.4 Model (PR #368): Adds
cx/gpt-5.4andcodex/gpt-5.4to the Codex model registry. Regression test included.
🐛 Bug Fixes
- fix #356: Analytics charts (Top Provider, By Account, Provider Breakdown) now display human-readable provider names/labels instead of raw internal IDs for OpenAI-compatible providers.
Major release: strict-random routing strategy, API key access controls, connection groups, external pricing sync, and critical bug fixes for thinking models, combo testing, and tool name validation.
✨ New Features (PRs #363 & #365)
- Strict-Random Routing Strategy: Fisher-Yates shuffle deck with anti-repeat guarantee and mutex serialization for concurrent requests. Independent decks per combo and per provider.
- API Key Access Controls:
allowedConnections(restrict which connections a key can use),is_active(enable/disable key with 403),accessSchedule(time-based access control),autoResolvetoggle, rename keys via PATCH. - Connection Groups: Group provider connections by environment. Accordion view in Limits page with localStorage persistence and smart auto-switch.
- External Pricing Sync (LiteLLM): 3-tier pricing resolution (user overrides → synced → defaults). Opt-in via
PRICING_SYNC_ENABLED=true. MCP toolomniroute_sync_pricing. 23 new tests. - i18n: 30 languages updated with strict-random strategy, API key management strings. pt-BR fully translated.
🐛 Bug Fixes
- fix #355: Stream idle timeout increased from 60s to 300s — prevents aborting extended-thinking models (claude-opus-4-6, o3, etc.) during long reasoning phases. Configurable via
STREAM_IDLE_TIMEOUT_MS. - fix #350: Combo test now bypasses
REQUIRE_API_KEY=trueusing internal header, and uses OpenAI-compatible format universally. Timeout extended from 15s to 20s. - fix #346: Tools with empty
function.name(forwarded by Claude Code) are now filtered before upstream providers receive them, preventing "Invalid input[N].name: empty string" errors.
🗑️ Closed Issues
- #341: Debug section removed — replacement is
/dashboard/logsand/dashboard/health.
API Key Round-Robin support for multi-key provider setups, and confirmation of wildcard routing and quota window rolling already in place.
✨ New Features
- API Key Round-Robin (T07): Provider connections can now hold multiple API keys (Edit Connection → Extra API Keys). Requests rotate round-robin between primary + extra keys via
providerSpecificData.extraApiKeys[]. Keys are held in-memory indexed per connection — no DB schema changes required.
📝 Already Implemented (confirmed in audit)
- Wildcard Model Routing (T13):
wildcardRouter.tswith glob-style wildcard matching (gpt*,claude-?-sonnet, etc.) is already integrated intomodel.tswith specificity ranking. - Quota Window Rolling (T08):
accountFallback.ts:isModelLocked()already auto-advances the window — ifDate.now() > entry.until, lock is deleted immediately (no stale blocking).
UI polish, routing strategy additions, and graceful error handling for usage limits.
✨ New Features
- Fill-First & P2C Routing Strategies: Added
fill-first(drain quota before moving on) andp2c(Power-of-Two-Choices low-latency selection) to combo strategy picker, with full guidance panels and color-coded badges. - Free Stack Preset Models: Creating a combo with the Free Stack template now auto-fills 7 best-in-class free provider models (Gemini CLI, Kiro, Qoder×2, Qwen, NVIDIA NIM, Groq). Users just activate the providers and get a $0/month combo out-of-the-box.
- Wider Combo Modal: Create/Edit combo modal now uses
max-w-4xlfor comfortable editing of large combos.
🐛 Bug Fixes
- Limits page HTTP 500 for Codex & GitHub:
getCodexUsage()andgetGitHubUsage()now return a user-friendly message when the provider returns 401/403 (expired token), instead of throwing and causing a 500 error on the Limits page. - MaintenanceBanner false-positive: Banner no longer shows "Server is unreachable" spuriously on page load. Fixed by calling
checkHealth()immediately on mount and removing staleshow-state closure. - Provider icon tooltips: Edit (pencil) and delete icon buttons in the provider connection row now have native HTML tooltips — all 6 action icons are now self-documented.
Multiple improvements from community issue analysis, new provider support, bug fixes for token tracking, model routing, and streaming reliability.
✨ New Features
- Task-Aware Smart Routing (T05): Automatic model selection based on request content type — coding → deepseek-chat, analysis → gemini-2.5-pro, vision → gpt-4o, summarization → gemini-2.5-flash. Configurable via Settings. New
GET/PUT/POST /api/settings/task-routingAPI. - HuggingFace Provider: Added HuggingFace Router as an OpenAI-compatible provider with Llama 3.1 70B/8B, Qwen 2.5 72B, Mistral 7B, Phi-3.5 Mini.
- Vertex AI Provider: Added Vertex AI (Google Cloud) provider with Gemini 2.5 Pro/Flash, Gemma 2 27B, Claude via Vertex.
- Playground File Uploads: Audio upload for transcription, image upload for vision models (auto-detect by model name), inline image rendering for image generation results.
- Model Select Visual Feedback: Already-added models in combo picker now show ✓ green badge — prevents duplicate confusion.
- Qwen Compatibility (PR #352): Updated User-Agent and CLI fingerprint settings for Qwen provider compatibility.
- Round-Robin State Management (PR #349): Enhanced round-robin logic to handle excluded accounts and maintain rotation state correctly.
- Clipboard UX (PR #360): Hardened clipboard operations with fallback for non-secure contexts; Claude tool normalization improvements.
🐛 Bug Fixes
- Fix #302 — OpenAI SDK stream=False drops tool_calls: T01 Accept header negotiation no longer forces streaming when
body.streamis explicitlyfalse. Was causing tool_calls to be silently dropped when using the OpenAI Python SDK in non-streaming mode. - Fix #73 — Claude Haiku routed to OpenAI without provider prefix:
claude-*models sent without a provider prefix now correctly route to theantigravity(Anthropic) provider. Addedgemini-*/gemma-*→geminiheuristic as well. - Fix #74 — Token counts always 0 for Antigravity/Claude streaming: The
message_startSSE event which carriesinput_tokenswas not being parsed byextractUsage(), causing all input token counts to drop. Input/output token tracking now works correctly for streaming responses. - Fix #180 — Model import duplicates with no feedback:
ModelSelectModalnow shows ✓ green highlight for models already in the combo, making it obvious they're already added. - Media page generation errors: Image results now render as
<img>tags instead of raw JSON. Transcription results shown as readable text. Credential errors show an amber banner instead of silent failure. - Token refresh button on provider page: Manual token refresh UI added for OAuth providers.
🔧 Improvements
- Provider Registry: HuggingFace and Vertex AI added to
providerRegistry.tsandproviders.ts(frontend). - Read Cache: New
src/lib/db/readCache.tsfor efficient DB read caching. - Quota Cache: Improved quota cache with TTL-based eviction.
📦 Dependencies
dompurify→ 3.3.3 (PR #347)undici→ 7.24.2 (PR #348, #361)docker/setup-qemu-action→ v4 (PR #342)docker/setup-buildx-action→ v4 (PR #343)
📁 New Files
| File | Purpose |
|---|---|
open-sse/services/taskAwareRouter.ts |
Task-aware routing logic (7 task types) |
src/app/api/settings/task-routing/route.ts |
Task routing config API |
src/app/api/providers/[id]/refresh/route.ts |
Manual OAuth token refresh |
src/lib/db/readCache.ts |
Efficient DB read cache |
src/shared/utils/clipboard.ts |
Hardened clipboard with fallback |
[2.4.1] - 2026-03-13
🐛 Fix
- Combos modal: Free Stack visible and prominent — Free Stack template was hidden (4th in 3-column grid). Fixed: moved to position 1, switched to 2x2 grid so all 4 templates are visible, green border + FREE badge highlight.
[2.4.0] - 2026-03-13
Major release — Free Stack ecosystem, transcription playground overhaul, 44+ providers, comprehensive free tier documentation, and UI improvements across the board.
✨ Features
- Combos: Free Stack template — New 4th template "Free Stack ($0)" using round-robin across Kiro + Qoder + Qwen + Gemini CLI. Suggests the pre-built zero-cost combo on first use.
- Media/Transcription: Deepgram as default — Deepgram (Nova 3, $200 free) is now the default transcription provider. AssemblyAI ($50 free) and Groq Whisper (free forever) shown with free credit badges.
- README: "Start Free" section — New early-README 5-step table showing how to set up zero-cost AI in minutes.
- README: Free Transcription Combo — New section with Deepgram/AssemblyAI/Groq combo suggestion and per-provider free credit details.
- providers.ts: hasFree flag — NVIDIA NIM, Cerebras, and Groq marked with hasFree badge and freeNote for the providers UI.
- i18n: templateFreeStack keys — Free Stack combo template translated and synced to all 30 languages.
[2.3.16] - 2026-03-13
📖 Documentation
- README: 44+ Providers — Updated all 3 occurrences of "36+ providers" to "44+" reflecting the actual codebase count (44 providers in providers.ts)
- README: New Section "🆓 Free Models — What You Actually Get" — Added 7-provider table with per-model rate limits for: Kiro (Claude unlimited via AWS Builder ID), Qoder (5 models unlimited), Qwen (4 models unlimited), Gemini CLI (180K/mo), NVIDIA NIM (~40 RPM dev-forever), Cerebras (1M tok/day / 60K TPM), Groq (30 RPM / 14.4K RPD). Includes the /usr/bin/bash Ultimate Free Stack combo recommendation.
- README: Pricing Table Updated — Added Cerebras to API KEY tier, fixed NVIDIA from "1000 credits" to "dev-forever free", updated Qoder/Qwen model counts and names
- README: Qoder 8→5 models (named: kimi-k2-thinking, qwen3-coder-plus, deepseek-r1, minimax-m2, kimi-k2)
- README: Qwen 3→4 models (named: qwen3-coder-plus, qwen3-coder-flash, qwen3-coder-next, vision-model)
[2.3.15] - 2026-03-13
✨ Features
- Auto-Combo Dashboard (Tier Priority): Added
🏷️ Tieras the 7th scoring factor label in the/dashboard/auto-combofactor breakdown display — all 7 Auto-Combo scoring factors are now visible. - i18n — autoCombo section: Added 20 new translation keys for the Auto-Combo dashboard (
title,status,modePack,providerScores,factorTierPriority, etc.) to all 30 language files.
[2.3.14] - 2026-03-13
🐛 Bug Fixes
- Qoder OAuth (#339): Restored the valid default
clientSecret— was previously an empty string, causing "Bad client credentials" on every connect attempt. The public credential is now the default fallback (overridable viaQODER_OAUTH_CLIENT_SECRETenv var). - MITM server not found (#335):
prepublish.mjsnow compilessrc/mitm/*.tsto JavaScript usingtscbefore copying to the npm bundle. Previously only raw.tsfiles were copied — meaningserver.jsnever existed in npm/Volta global installs. - GeminiCLI missing projectId (#338): Instead of throwing a hard 500 error when
projectIdis missing from stored credentials (e.g. after Docker restart), OmniRoute now logs a warning and attempts the request — returning a meaningful provider-side error instead of an OmniRoute crash. - Electron version mismatch (#323): Synced
electron/package.jsonversion to2.3.13(was2.0.13) so the desktop binary version matches the npm package.
✨ New Models (#334)
- Kiro:
claude-sonnet-4,claude-opus-4.6,deepseek-v3.2,minimax-m2.1,qwen3-coder-next,auto - Codex:
gpt5.4
🔧 Improvements
- Tier Scoring (API + Validation): Added
tierPriority(weight0.05) to theScoringWeightsZod schema and thecombos/autoAPI route — the 7th scoring factor is now fully accepted by the REST API and validated on input.stabilityweight adjusted from0.10to0.05to keep total sum =1.0.
✨ New Features
- Tiered Quota Scoring (Auto-Combo): Added
tierPriorityas a 7th scoring factor — accounts with Ultra/Pro tiers are now preferred over Free tiers when other factors are equal. New optional fieldsaccountTierandquotaResetIntervalSecsonProviderCandidate. All 4 mode packs updated (ship-fast,cost-saver,quality-first,offline-friendly). - Intra-Family Model Fallback (T5): When a model is unavailable (404/400/403), OmniRoute now automatically falls back to sibling models from the same family before returning an error (
modelFamilyFallback.ts). - Configurable API Bridge Timeout:
API_BRIDGE_PROXY_TIMEOUT_MSenv var lets operators tune the proxy timeout (default 30s). Fixes 504 errors on slow upstream responses. (#332) - Star History: Replaced star-history.com widget with starchart.cc (
?variant=adaptive) in all 30 READMEs — adapts to light/dark theme, real-time updates.
🐛 Bug Fixes
- Auth — First-time password:
INITIAL_PASSWORDenv var is now accepted when setting the first dashboard password. UsestimingSafeEqualfor constant-time comparison, preventing timing attacks. (#333) - README Truncation: Fixed a missing
</details>closing tag in the Troubleshooting section that caused GitHub to stop rendering everything below it (Tech Stack, Docs, Roadmap, Contributors). - pnpm install: Removed redundant
@swc/helpersoverride frompackage.jsonthat conflicted with the direct dependency, causingEOVERRIDEerrors on pnpm. Addedpnpm.onlyBuiltDependenciesconfig. - CLI Path Injection (T12): Added
isSafePath()validator incliRuntime.tsto block path traversal and shell metacharacters inCLI_*_BINenv vars. - CI: Regenerated
package-lock.jsonafter override removal to fixnpm cifailures on GitHub Actions.
🔧 Improvements
- Response Format (T1):
response_format(json_schema/json_object) now injected as a system prompt for Claude, enabling structured output compatibility. - 429 Retry (T2): Intra-URL retry for 429 responses (2× attempts with 2s delay) before falling back to next URL.
- Gemini CLI Headers (T3): Added
User-AgentandX-Goog-Api-Clientfingerprint headers for Gemini CLI compatibility. - Pricing Catalog (T9): Added
deepseek-3.1,deepseek-3.2, andqwen3-coder-nextpricing entries.
📁 New Files
| File | Purpose |
|---|---|
open-sse/services/modelFamilyFallback.ts |
Model family definitions and intra-family fallback logic |
Fixed
- KiloCode: kilocode healthcheck timeout already fixed in v2.3.11
- OpenCode: Add opencode to cliRuntime registry with 15s healthcheck timeout
- OpenClaw / Cursor: Increase healthcheck timeout to 15s for slow-start variants
- VPS: Install droid and openclaw npm packages; activate CLI_EXTRA_PATHS for kiro-cli
- cliRuntime: Add opencode tool registration and increase timeout for continue
[2.3.11] - 2026-03-12
Fixed
- KiloCode healthcheck: Increase
healthcheckTimeoutMsfrom 4000ms to 15000ms — kilocode renders an ASCII logo banner on startup causing falsehealthcheck_failedon slow/cold-start environments
[2.3.10] - 2026-03-12
Fixed
- Lint: Fix
check:any-budget:t11failure — replaceas anywithas Record<string, unknown>in OAuthModal.tsx (3 occurrences)
Docs
- CLI-TOOLS.md: Complete guide for all 11 CLI tools (claude, codex, gemini, opencode, cline, kilocode, continue, kiro-cli, cursor, droid, openclaw)
- i18n: CLI-TOOLS.md synced to 30 languages with translated title + intro
[2.3.8] - 2026-03-12
[2.3.9] - 2026-03-12
Added
- /v1/completions: New legacy OpenAI completions endpoint — accepts both
promptstring andmessagesarray, normalizes to chat format automatically - EndpointPage: Now shows all 3 OpenAI-compatible endpoint types: Chat Completions, Responses API, and Legacy Completions
- i18n: Added
completionsLegacy/completionsLegacyDescto 30 language files
Fixed
- OAuthModal: Fix
[object Object]displayed on all OAuth connection errors — properly extract.messagefrom error response objects in all 3throw new Error(data.error)calls (exchange, device-code, authorize) - Affects Cline, Codex, GitHub, Qwen, Kiro, and all other OAuth providers
[2.3.7] - 2026-03-12
Fixed
- Cline OAuth: Add
decodeURIComponentbefore base64 decode so URL-encoded auth codes from the callback URL are parsed correctly, fixing "invalid or expired authorization code" errors on remote (LAN IP) setups - Cline OAuth:
mapTokensnow populatesname = firstName + lastName || emailso Cline accounts show real user names instead of "Account #ID" - OAuth account names: All OAuth exchange flows (exchange, poll, poll-callback) now normalize
name = emailwhen name is missing, so every OAuth account shows its email as the display label in the Providers dashboard - OAuth account names: Removed sequential "Account N" fallback in
db/providers.ts— accounts with no email/name now use a stable ID-based label viagetAccountDisplayName()instead of a sequential number that changes when accounts are deleted
[2.3.6] - 2026-03-12
Fixed
- Provider test batch: Fixed Zod schema to accept
providerId: null(frontend sends null for non-provider modes); was incorrectly returning "Invalid request" for all batch tests - Provider test modal: Fixed
[object Object]display by normalizing API error objects to strings before rendering insetTestResultsandProviderTestResultsView - i18n: Added missing keys
cliTools.toolDescriptions.opencode,cliTools.toolDescriptions.kiro,cliTools.guides.opencode,cliTools.guides.kirotoen.json - i18n: Synchronized 1111 missing keys across all 29 non-English language files using English values as fallbacks
[2.3.5] - 2026-03-11
Fixed
- @swc/helpers: Added permanent
postinstallfix to copy@swc/helpersinto the standalone app'snode_modules— prevents MODULE_NOT_FOUND crash on global npm installs
[2.3.4] - 2026-03-10
Added
- Multiple provider integrations and dashboard improvements