Commit graph

20 commits

Author SHA1 Message Date
Adrian Cole
3a304c6af3
ci: enable agentic provider live tests (claude-code, codex, gemini-cli) (#7088)
Some checks failed
Canary / build-cli (push) Blocked by required conditions
Canary / Upload Install Script (push) Blocked by required conditions
Canary / bundle-desktop (push) Blocked by required conditions
Canary / bundle-desktop-linux (push) Blocked by required conditions
Canary / bundle-desktop-windows (push) Blocked by required conditions
Canary / Release (push) Blocked by required conditions
Canary / Prepare Version (push) Waiting to run
CI / Test and Lint Electron Desktop App (push) Blocked by required conditions
CI / changes (push) Waiting to run
CI / Check Rust Code Format (push) Blocked by required conditions
CI / Build and Test Rust Project (push) Blocked by required conditions
CI / Lint Rust Code (push) Blocked by required conditions
CI / Check OpenAPI Schema is Up-to-Date (push) Blocked by required conditions
Live Provider Tests / check-fork (push) Waiting to run
Live Provider Tests / changes (push) Blocked by required conditions
Live Provider Tests / Build Binary (push) Blocked by required conditions
Live Provider Tests / Smoke Tests (push) Blocked by required conditions
Live Provider Tests / Smoke Tests (Code Execution) (push) Blocked by required conditions
Live Provider Tests / Compaction Tests (push) Blocked by required conditions
Publish Docker Image / docker (push) Waiting to run
Scorecard supply-chain security / Scorecard analysis (push) Waiting to run
Deploy Documentation / deploy (push) Has been cancelled
Publish Ask AI Bot Docker Image / docker (push) Has been cancelled
Signed-off-by: Adrian Cole <adrian@tetrate.io>
2026-02-10 03:01:28 +00:00
Jack Amadeo
e670f348bd
Smoke tests: split compaction test and use debug build (#6984)
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2026-02-08 02:35:58 +00:00
Elias Posen
8631caa890
Use Port of Context (pctx) for code mode (#6765)
Signed-off-by: Elias Posen <elias@posen.ch>
Signed-off-by: Adrian Cole <adrian@tetrate.io>
Co-authored-by: Adrian Cole <adrian@tetrate.io>
2026-02-03 12:15:49 -05:00
Jack Amadeo
eae5a47788
More providers for testing (#6849) 2026-02-03 16:21:59 +00:00
Jack Amadeo
a398a77682
allow skipping providers in test_providers.sh (#6778) 2026-01-28 13:30:26 -05:00
Zane
e7bfdf8fa2
smoke test allow pass for flaky providers (#6638) 2026-01-22 16:10:41 -08:00
Bradley Axen
38f5f338cb
fix: improve smoke test prompt for reliable tool calling (#6281)
Co-authored-by: Michael Neale <michael.neale@gmail.com>
2025-12-31 15:52:37 -08:00
Michael Neale
8ec6332738
fix: adding more open models (#6300) 2025-12-31 09:48:05 +11:00
Michael Neale
5ca7eb2305
chore: Update gemini versions in test_providers.sh (#6246) 2025-12-23 11:12:19 +11:00
Alex Hancock
7134e89c4b
feat: improved UX for tool calls via execute_code (#6205) 2025-12-22 10:42:20 -05:00
Michael Neale
d4814042e6
chore: cover code mode with end to end provider tests (#6183) 2025-12-19 12:02:06 +08:00
Salvatore Testa
cfdf01567d
fix: support Gemini 3's thought signatures (#5806)
Some checks are pending
Canary / Prepare Version (push) Waiting to run
Canary / build-cli (push) Blocked by required conditions
Canary / Upload Install Script (push) Blocked by required conditions
Canary / bundle-desktop (push) Blocked by required conditions
Canary / bundle-desktop-linux (push) Blocked by required conditions
Canary / bundle-desktop-windows (push) Blocked by required conditions
Canary / Release (push) Blocked by required conditions
CI / Check Rust Code Format (push) Blocked by required conditions
CI / changes (push) Waiting to run
CI / Build and Test Rust Project (push) Blocked by required conditions
CI / Lint Rust Code (push) Blocked by required conditions
CI / Check OpenAPI Schema is Up-to-Date (push) Blocked by required conditions
CI / Test and Lint Electron Desktop App (push) Blocked by required conditions
Live Provider Tests / check-fork (push) Waiting to run
Live Provider Tests / changes (push) Blocked by required conditions
Live Provider Tests / Build Release Binary (push) Blocked by required conditions
Live Provider Tests / Smoke Tests (push) Blocked by required conditions
Documentation Site Preview / deploy (push) Waiting to run
Publish Docker Image / docker (push) Waiting to run
Signed-off-by: Salvatore Testa <sal@withpersona.com>
2025-11-20 16:28:27 +11:00
Michael Neale
2bef034303
feat: trying grok for live test (#5732)
Some checks are pending
Canary / bundle-desktop (push) Blocked by required conditions
Canary / bundle-desktop-linux (push) Blocked by required conditions
Canary / bundle-desktop-windows (push) Blocked by required conditions
Canary / Release (push) Blocked by required conditions
Canary / Prepare Version (push) Waiting to run
Canary / build-cli (push) Blocked by required conditions
Canary / Upload Install Script (push) Blocked by required conditions
CI / Check OpenAPI Schema is Up-to-Date (push) Blocked by required conditions
CI / changes (push) Waiting to run
CI / Check Rust Code Format (push) Blocked by required conditions
CI / Build and Test Rust Project (push) Blocked by required conditions
CI / Lint Rust Code (push) Blocked by required conditions
CI / Test and Lint Electron Desktop App (push) Blocked by required conditions
Live Provider Tests / check-fork (push) Waiting to run
Live Provider Tests / changes (push) Blocked by required conditions
Live Provider Tests / Build Release Binary (push) Blocked by required conditions
Live Provider Tests / Smoke Tests (push) Blocked by required conditions
Documentation Site Preview / deploy (push) Waiting to run
Publish Docker Image / docker (push) Waiting to run
2025-11-17 09:37:43 +11:00
Zane
89f7384d57
add clippy warning for string_slice (#5422)
Some checks are pending
Canary / bundle-desktop (push) Blocked by required conditions
Canary / bundle-desktop-linux (push) Blocked by required conditions
Canary / bundle-desktop-windows (push) Blocked by required conditions
Canary / Release (push) Blocked by required conditions
Canary / Prepare Version (push) Waiting to run
Canary / build-cli (push) Blocked by required conditions
Canary / Upload Install Script (push) Blocked by required conditions
CI / changes (push) Waiting to run
CI / Check Rust Code Format (push) Blocked by required conditions
CI / Build and Test Rust Project (push) Blocked by required conditions
CI / Test and Lint Electron Desktop App (push) Blocked by required conditions
Live Provider Tests / changes (push) Blocked by required conditions
Live Provider Tests / Build Release Binary (push) Blocked by required conditions
Live Provider Tests / Smoke Tests (push) Blocked by required conditions
Live Provider Tests / check-fork (push) Waiting to run
Documentation Site Preview / deploy (push) Waiting to run
Publish Docker Image / docker (push) Waiting to run
Co-authored-by: Douwe Osinga <douwe@squareup.com>
2025-11-04 17:46:25 -05:00
Michael Neale
7511a533d6
we should run this on main and also test open models at least via ope… (#5556)
Some checks failed
Canary / Release (push) Blocked by required conditions
Canary / Prepare Version (push) Waiting to run
Canary / build-cli (push) Blocked by required conditions
Canary / Upload Install Script (push) Blocked by required conditions
Canary / bundle-desktop (push) Blocked by required conditions
Canary / bundle-desktop-linux (push) Blocked by required conditions
Canary / bundle-desktop-windows (push) Blocked by required conditions
CI / changes (push) Waiting to run
CI / Check Rust Code Format (push) Blocked by required conditions
CI / Build and Test Rust Project (push) Blocked by required conditions
CI / Test and Lint Electron Desktop App (push) Blocked by required conditions
Live Provider Tests / check-fork (push) Waiting to run
Live Provider Tests / changes (push) Blocked by required conditions
Live Provider Tests / Build Release Binary (push) Blocked by required conditions
Live Provider Tests / Smoke Tests (push) Blocked by required conditions
Documentation Site Preview / deploy (push) Waiting to run
Publish Docker Image / docker (push) Waiting to run
Deploy Documentation / deploy (push) Has been cancelled
adds qwen3-code and GLM 4.6 to test_providers for open model coverage
2025-11-04 09:06:23 +11:00
Alex Hancock
38e7dc8f30
fix: remove qwen3-coder from provider/mcp smoke tests (#5551) 2025-11-03 14:33:49 -05:00
Michael Neale
b94535b679
testing tetrate with sonnet (#5428) 2025-10-29 11:40:02 +11:00
Douwe Osinga
6b6c50976c
Gemini again (#5390)
Co-authored-by: Douwe Osinga <douwe@squareup.com>
2025-10-27 16:41:00 -04:00
Michael Neale
3c975bb358
live testing script (#5263)
Co-authored-by: Jack Amadeo <jackamadeo@squareup.com>
2025-10-21 16:39:58 +11:00
Douwe Osinga
64b37339e0
Skip subagents for gemini (#5257)
Co-authored-by: Douwe Osinga <douwe@squareup.com>
2025-10-18 17:35:29 -04:00