Commit graph

692 commits

Author SHA1 Message Date
diegosouzapw
dc103318ab fix: remove duplicate LOAD_CODE_ASSIST exports from merge artifact
Some checks are pending
CI / Build language matrix (push) Waiting to run
CI / i18n Validation (push) Blocked by required conditions
CI / Lint (push) Waiting to run
CI / PR Test Policy (push) Waiting to run
CI / Build (push) Waiting to run
CI / Package Artifact (push) Blocked by required conditions
CI / Electron Package Smoke (push) Blocked by required conditions
CI / Unit Tests (1/2) (push) Blocked by required conditions
CI / Unit Tests (2/2) (push) Blocked by required conditions
CI / Node 24 Compatibility (1/2) (push) Blocked by required conditions
CI / Node 24 Compatibility (2/2) (push) Blocked by required conditions
CI / Coverage (push) Blocked by required conditions
CI / SonarQube (push) Blocked by required conditions
CI / PR Coverage Comment (push) Blocked by required conditions
CI / E2E Tests (1/6) (push) Blocked by required conditions
CI / E2E Tests (2/6) (push) Blocked by required conditions
CI / E2E Tests (3/6) (push) Blocked by required conditions
CI / E2E Tests (4/6) (push) Blocked by required conditions
CI / E2E Tests (5/6) (push) Blocked by required conditions
CI / E2E Tests (6/6) (push) Blocked by required conditions
CI / Integration Tests (1/2) (push) Blocked by required conditions
CI / Integration Tests (2/2) (push) Blocked by required conditions
CI / Security Tests (push) Blocked by required conditions
CI / CI Dashboard (push) Blocked by required conditions
Publish to Docker Hub / Build and Push Docker (multi-arch) (push) Waiting to run
2026-04-28 03:14:45 -03:00
diegosouzapw
70d7fc71b3 fix: remove duplicate GEMINI_CLI_VERSION/SDK_VERSION exports from merge 2026-04-28 02:54:42 -03:00
diegosouzapw
96fa89052b chore: resolve merge conflicts from main into release/v3.7.2 2026-04-28 02:53:34 -03:00
backryun
a16e47c593
fix(providers): refresh web client user agents (#1699)
Integrated release/v3.7.2 changes — refreshed web client user agents, env docs, Gemini OAuth fix
2026-04-28 02:41:19 -03:00
Hernan Javier Ardila Sanchez
58fb988c52
fix(combo): complete context truncation hotfix from PR #1480 (#1685)
Integrated into release/v3.7.2 — completes context truncation hotfix (PR #1480 follow-up)
2026-04-28 02:28:26 -03:00
diegosouzapw
95f7c69e36 fix(memory): use user role for GLM/ZAI/Qianfan providers (#1701)
GLM/ZhipuAI rejects system role messages with 422 'Input should be
user or assistant'. When memory injection adds a system-role message,
GLM combo targets fail because the system message survives into the
upstream request.

Fix:
- injection.ts: add glm, glmt, glm-cn, zai, qianfan to
  PROVIDERS_WITHOUT_SYSTEM_MESSAGE so memory is injected as user role
- roleNormalizer.ts: add exact 'glm' model match to
  MODELS_WITHOUT_SYSTEM_ROLE for Pollinations and bare model ids

Test: 22 new unit tests covering all GLM variants + regression checks
for openai/anthropic providers.

Closes #1701
2026-04-28 02:16:16 -03:00
diegosouzapw
00ae48f58d fix(combo): trigger fallback on Anthropic thinking block signature errors (#1696)
Add 'Invalid signature in thinking block' to COMBO_BAD_REQUEST_FALLBACK_PATTERNS
so combo routing falls through to the next target instead of returning 400 directly.

This error occurs when extended thinking signatures expire between turns,
which is a model-specific issue that won't be fixed by retrying the same provider.

Closes #1696
2026-04-28 00:22:08 -03:00
diegosouzapw
bda1f290ac fix(combo): complete context truncation hotfix — cache getCombos(), resolve nested combos, deduplicate context overflow patterns (#1685)
- Add getCombosCached() with 10s TTL in chatCore.ts to avoid per-request DB lookups
- Pass allCombosData to resolveComboTargets() instead of null for nested combo resolution
- Consolidate COMBO_BAD_REQUEST_FALLBACK_PATTERNS with CONTEXT_OVERFLOW_REGEX from errorClassifier.ts
- Remove 10 duplicated context overflow patterns from combo.ts
- Export clearCombosCache() for cache invalidation
- Fix rebase artifact (>) from contributor's branch

Co-authored-by: Javier Ardila <hjasgr@gmail.com>
Closes #1470
2026-04-28 00:08:30 -03:00
Raxxoor
9c9ba8f2bd
fix(stream): fail zombie streams before accepting response (#1693)
Integrated into release/v3.7.2
2026-04-27 23:30:01 -03:00
payne
67bce7721b
fix(sse): sanitize OpenAI tool schemas for strict upstream validators (kimi-k2.6 via opencode-go) (#1692)
Integrated into release/v3.7.2
2026-04-27 23:27:20 -03:00
diegosouzapw
84fbfa36c6 fix(rate-limit): replace unsupported Bottleneck maxWait with job-level expiration (#1694)
Bottleneck v2.19.5 does not support a `maxWait` limiter/constructor option — it
was silently ignored, causing queued jobs to wait indefinitely when no 429 response
triggered the drop mechanism.

Replace with Bottleneck's supported `expiration` job-schedule option which rejects
any job that waits+executes longer than maxWaitMs. Also log expiration rejections
so they are observable in production.
2026-04-27 23:09:49 -03:00
diegosouzapw
a46e920148 fix: restore CORS_HEADERS import after main merge 2026-04-27 22:55:15 -03:00
diegosouzapw
cf959c768d Merge branch 'main' into release/v3.7.2 2026-04-27 22:53:02 -03:00
diegosouzapw
9e198184a7 fix(security): resolve 14 CodeQL code scanning alerts
- Replace polynomial regex /\/+$/ with loop-based stripTrailingSlashes()
  across 8 enterprise provider configs (azure-openai, azureAi, bedrock,
  datarobot, oci, sap, watsonx, audioSpeech) — fixes js/polynomial-redos

- Add prototype-pollution denylist guard in usageHistory.ts to reject
  __proto__/constructor/prototype as model keys — fixes
  js/prototype-polluting-assignment (#167, #168)

- Suppress 3 false-positive js/insufficient-password-hash alerts in
  chatgpt-web.ts and builtins.ts where SHA-256 is used for cache-key
  derivation, not password storage (#176, #177, #178)

- Add stripTrailingSlashes unit tests with ReDoS regression check
2026-04-27 20:00:10 -03:00
diegosouzapw
0274af8c9c fix(executors): truncate tools array to 128 items max in GitHub Copilot and OpenCode executors to mitigate 400 Bad Request errors (#1687) 2026-04-27 19:39:46 -03:00
diegosouzapw
905b7555c2 fix(codex): prevent unexpected protocol leakage and fabricated instructions on bare chat completion requests without tools (#1686) 2026-04-27 19:39:37 -03:00
diegosouzapw
b1974dac12 fix(responses): sanitize empty string placeholders from tool-call optional arguments in stream delta accumulation (#1674) 2026-04-27 19:39:28 -03:00
diegosouzapw
4ecddaacd9 ci: stabilize release branch checks 2026-04-27 18:55:29 -03:00
diegosouzapw
8a8e6ca349 fix(authz): Restore REQUIRE_API_KEY support in clientApi policy 2026-04-27 13:25:32 -03:00
clousky2020
cc07e5f7f6
fix: add body-read timeout to prevent stuck pending requests (#1680)
fix: add body-read timeout to prevent stuck pending requests — integrated into release/v3.7.2
2026-04-27 11:51:04 -03:00
Jack
31a0628cf1
fix(search): support optional bearer auth for SearXNG (#1683)
fix(search): support optional bearer auth for SearXNG — integrated into release/v3.7.2
2026-04-27 11:50:43 -03:00
diegosouzapw
18a25e4e4c fix: combo retry loop stops immediately on client disconnect (499) (#1681)
- Treat status 499 as terminal non-retryable error in both priority and
  round-robin combo loops — no fallback to other models when client is gone
- Propagate AbortSignal from request into handleComboChat so the combo
  loop can detect client disconnects before starting new model attempts
- Make retry/fallback delays abort-aware via signal.addEventListener
- Add 5 unit tests covering 499 early-exit, signal.aborted pre-check,
  multi-model abort, 502 contrast behavior, and abort-during-wait
2026-04-27 11:39:26 -03:00
Payne
1c6d54ef57
feat(muse-spark-web): continue the same meta.ai conversation across turns (#1673)
Integrated into release/v3.7.2 — implements conversation continuity for muse-spark-web executor with SHA-256 prefix hashing, TTL cache, and eviction-on-error
2026-04-27 10:36:03 -03:00
backryun
021cfd791f
fix(dev): enable Turbopack and repair Codex CORS headers (#1669)
Integrated into release/v3.7.2
2026-04-27 08:04:27 -03:00
Payne
ba6a8602fb
fix(muse-spark-web): update to Meta's Ecto-era persisted query (fixes 502 "Unknown type RewriteOptionsInput") (#1668)
Integrated into release/v3.7.2
2026-04-27 08:02:12 -03:00
diegosouzapw
d4a92830be chore(release): bump to v3.7.2 — changelog, docs, version sync 2026-04-27 07:57:15 -03:00
Jack
6e0b801b6a
fix(perplexity-web): update API version and user-agent (#1666)
Integrated into release/v3.7.2
2026-04-27 07:51:06 -03:00
diegosouzapw
6747e22757 fix(codex,db): resolve 6 issues — Codex 502, store default, migration guards
Fixes:
- fix(codex): rename getWreqWebsocket() → getCodexWebSocketTransport()
  Fixes the ReferenceError causing 502 on all Codex requests (#1652, #1653)

- fix(codex): default store to false instead of true
  Codex OAuth backend rejects store=true with 'Store must be set to false' (#1635)

- fix(db): add post-migration startup guards for combos.sort_order (#1657)
  and batches/files tables (#1648) — handles heuristic seeding edge case

- fix(db): renumber duplicate migration 032_create_reasoning_cache → 033

Closes #1635, #1648, #1652, #1653, #1657
Also closed as user-config: #1649 (Claude 429), #1659 (thought_signature)
2026-04-27 07:39:12 -03:00
abix5
6dd883e5f4 feat(authz): introduce centralized proxy-based authz pipeline and lifecycle policy (#1632)
Integrated into release/v3.7.2
2026-04-27 07:16:24 -03:00
Payne
4671a1eb98 fix(chatgpt-web): bound tls-client native deadlocks so requests never hang forever (#1664)
Integrated into release/v3.7.2
2026-04-27 07:16:24 -03:00
Randi
845e2b3d01
feat: configure call log pipeline artifacts (#1650)
Integrated into release/v3.7.2
2026-04-27 07:12:34 -03:00
Randi
bc91fb9e54
fix: avoid OpenAI stream options for Anthropic-compatible providers (#1654)
Integrated into release/v3.7.2
2026-04-27 07:12:25 -03:00
backryun
9d334c82b9
fix(grokweb):Update Request and Response Specifications (#1655)
Integrated into release/v3.7.2
2026-04-27 07:12:17 -03:00
Randi
98e70a706e
[urgent] fix gpt-5.5 websocket transport and model labels (#1656)
Integrated into release/v3.7.2
2026-04-27 07:12:08 -03:00
kfiramar
eec5fa3feb
Enable native Codex websocket responses on beta-gated models (#1658)
Integrated into release/v3.7.2
2026-04-27 07:11:59 -03:00
Gi99lin
712f1ea3e9
fix(codex): default gpt-5.5 to HTTP transport instead of WebSocket (#1660)
Integrated into release/v3.7.2
2026-04-27 07:11:51 -03:00
Jack
388b84de3c
fix(blackbox-web): set isPremium flag to true (#1661)
Integrated into release/v3.7.2
2026-04-27 07:11:43 -03:00
diegosouzapw
c9fc36ca14 feat(network): add guarded remote image fetch utility
Centralize remote image downloads behind a shared helper that
validates outbound URLs, enforces redirect and size limits, and
applies request timeouts before bytes are read.

Wire the helper into image generation and vision bridge flows so
remote image inputs and result URLs follow the same fetch policy and
block redirects to private hosts. Update key management routes to use
structured logging and document the WebSocket bridge secret in the
example environment file.
2026-04-27 02:25:46 -03:00
diegosouzapw
3008ba9a13 fix(transport): cap streaming logs and parse fragmented responses (#1647) 2026-04-27 01:45:36 -03:00
Raxxoor
97912c7d9c
fix(transport): harden GitHub and Kiro streaming (#1645)
Integrated into release/v3.7.1 — fixes GitHub executor concurrency bug, hardens Kiro streaming, adds defensive tool input parsing
2026-04-27 01:09:23 -03:00
diegosouzapw
19edb8efa4 fix(claude): stabilize billing header fingerprint for prompt-cache affinity (#1638)
The billing header fingerprint was computed from the first user message text
via computeFingerprint(), which changes every conversation turn. This mutated
the system[] prefix on each request, invalidating Anthropic's prompt-cache
prefix and forcing ~100% cache_create (vs 96% cache_read with stable prefix).

Now uses a per-day SHA-256 hash of the date + ccVersion, keeping the billing
header format while preserving prompt-cache prefix stability across turns.

Includes 6 unit tests.
2026-04-27 00:37:25 -03:00
diegosouzapw
52d5b86e88 fix(codex): use per-conversation session_id as prompt_cache_key (#1643)
The prompt_cache_key was derived from the account-wide workspaceId, meaning
all conversations from the same OAuth account shared one cache partition.
The official Codex CLI uses conversation_id (a unique UUID per session).

Priority: body.session_id > body.conversation_id > workspaceId.
Session IDs are captured BEFORE deletion from the body.

Includes 10 unit tests.
2026-04-27 00:37:15 -03:00
diegosouzapw
78b845b68c feat(account-fallback): add daily quota lockout for quota_exhausted errors (#1644)
When a provider returns 429 with quota_exhausted reason, set cooldown until
tomorrow 00:00 instead of exponential backoff. Includes isDailyQuotaExhausted()
detection in chat handler and unit tests.

Co-authored-by: clousky2020 <clousky2020@users.noreply.github.com>
2026-04-27 00:02:44 -03:00
Dendy Adi Nirwana
314cad79ba
fix(codex): avoid startup crash when wreq-js is unavailable (#1640)
Integrated into release/v3.7.1 — lazy-loads wreq-js WebSocket transport so server boots even when native module is unavailable. Fixes #1612.
2026-04-26 23:59:30 -03:00
Slavic Kozyuk
4eef082cf3
fix(usage): correct MiniMax token plan quota display (#1642)
Integrated into release/v3.7.1 — fixes inverted MiniMax quota display for token_plan/remains endpoint and rounds floating-point percentages.
2026-04-26 23:57:55 -03:00
Raxxoor
acf6ad4ba0
fix: route newer GitHub GPT models through Responses (#1641)
Integrated into release/v3.7.1 — routes gpt-5.4-nano, gpt-5.4-mini, gpt-5.4, and gpt-5.5 through the Responses API on GitHub Copilot.
2026-04-26 23:57:37 -03:00
diegosouzapw
f00e9a1272 fix(postinstall): extend native module repair to cover wreq-js for pnpm global installs (#1634)
- Add fixWreqJsBinary() to postinstall.mjs with 3-strategy repair:
  1. Copy platform binary from root node_modules
  2. Copy entire rust/ directory (all platform binaries)
  3. npm rebuild wreq-js fallback
- Fixes macOS arm64 global pnpm installs shipping only Linux binaries
- Adds reasoning replay cache feature for DeepSeek V4 (#1628)
- Updates CHANGELOG.md with both fixes
2026-04-26 20:15:47 -03:00
diegosouzapw
cabdfb04e7 fix(cache): replay cached reasoning for tool-calling think models
Prevent upstream 400 failures when clients omit prior
reasoning_content in multi-turn tool-calling conversations.

Capture reasoning_content from streaming and non-streaming assistant
responses, persist it in a memory-plus-SQLite cache keyed by
tool_call_id, and re-inject it on later requests when available.

Add the reasoning cache migration, service layer, authenticated API
endpoints, dashboard tab, and unit coverage to support inspection,
cleanup, and crash recovery.
2026-04-26 18:53:54 -03:00
backryun
4692621e4f
fix(antigravity): hide deprecated Gemini-Claude models (#1631)
fix(antigravity): hide deprecated Gemini-Claude models, redirect legacy aliases to Claude 4.6 — integrated into release/v3.7.1
2026-04-26 15:59:07 -03:00
Aleksandr
0c90c9768b
Add Codex GPT-5.5 support (#1617)
Integrated into release/v3.7.1
2026-04-26 14:56:25 -03:00