Commit graph

222 commits

Author SHA1 Message Date
Peter Steinberger
cc9dcd3d69 fix(gateway): prefer linux child OOM victims
Raise eligible Linux child processes own oom_score_adj from a child-side /bin/sh exec shim so cgroup memory pressure prefers transient workers over the long-lived gateway. Cover supervisor children, PTY shells, MCP stdio servers, and OpenClaw-launched browser processes through the shared process runtime seam.

Harden the wrapper for distroless images, shell startup env, per-child and process-level opt-outs, dash-compatible exec, and leading-dash command names. Document Linux verification and OOM behavior.

Fixes #70404.

Co-authored-by: Neerav Makwana <261249544+neeravmakwana@users.noreply.github.com>
2026-04-23 05:23:40 +01:00
Peter Steinberger
0195da6b0e
refactor: cache optional runtime imports 2026-04-18 20:45:26 +01:00
Peter Steinberger
4fa961d4f1
refactor(lint): enable map spread rule 2026-04-18 20:37:12 +01:00
Peter Steinberger
c035c5c0d2
refactor: cache lazy runtime imports 2026-04-18 16:18:26 +01:00
Vincent Koc
b22bbf5660 test(process): share shimmed windows success assertions 2026-04-12 09:37:06 +01:00
Vincent Koc
e1e20c424b test(process): share supervisor sigkill wait assertions 2026-04-12 04:52:29 +01:00
Vincent Koc
d262b1c688 fix(logging): split queue diagnostic runtime 2026-04-12 03:45:35 +01:00
Vincent Koc
b9a0052dd0 fix(cycles): split embedded runner and setup leaf types 2026-04-11 14:49:48 +01:00
Peter Steinberger
ebfd468ee0
refactor: simplify typed conversions 2026-04-11 01:01:30 +01:00
Vincent Koc
78d2e9e2a8 fix(ci): repair main type drift 2026-04-10 08:13:02 +01:00
Ayaan Zaidi
c003e982a2 fix(process): drain Windows stdio before exit fallback settle 2026-04-10 10:09:25 +05:30
Ayaan Zaidi
063049c0d4 fix(process): wait for close after Windows exit fallback 2026-04-10 10:09:25 +05:30
Ayaan Zaidi
4b6b1a3ed3 fix(process): settle Windows supervisor waits from exit state 2026-04-10 10:09:25 +05:30
Peter Steinberger
552b5d3859
test: speed up cli and process tests 2026-04-08 00:30:22 +01:00
Peter Steinberger
c3074bd513
refactor: dedupe path lowercase helpers 2026-04-07 15:53:50 +01:00
Peter Steinberger
a20d96ae31
test: stabilize isolated runtime and config suites 2026-04-07 11:41:02 +01:00
Peter Steinberger
371c4147f3
fix: restore ci after rebase drift 2026-04-07 07:36:11 +01:00
Peter Steinberger
0a6fd459f9
refactor: dedupe channel and cli readers 2026-04-07 07:36:11 +01:00
openperf
e777a2b230 fix(process ): migrate legacy command-queue singleton missing activeTaskWaiters
After a SIGUSR1 in-process restart following an npm upgrade from v2026.4.2
to v2026.4.5, the globalThis singleton created by the old code version
lacks the activeTaskWaiters field added in v2026.4.5.  resolveGlobalSingleton
returns the stale object as-is, causing notifyActiveTaskWaiters() to call
Array.from(undefined) and crash the gateway in a loop.

Add a schema migration step in getQueueState() that patches the missing
field on legacy singleton objects.  Add a regression test that plants a
v2026.4.2-shaped state object and verifies resetAllLanes() and
waitForActiveTasks() succeed without throwing.

Fixes #61905
2026-04-06 15:41:14 +01:00
Peter Steinberger
edab013e51
fix: support corepack cmd shim on windows 2026-04-06 03:48:47 +01:00
Peter Steinberger
f4fa53de3f
fix(ci): repair zalouser sdk path and exec timeout kill 2026-04-04 04:51:33 +01:00
Peter Steinberger
ab318de8b7
test(plugins): finish moving contract coverage 2026-04-04 00:11:39 +01:00
Peter Steinberger
5b29483ab1
test(ci): type-safe exec timeout stub 2026-04-03 22:14:59 +01:00
Peter Steinberger
5a94909654
test(ci): stabilize exec timeout tests 2026-04-03 22:12:08 +01:00
Peter Steinberger
0204b8dd28
fix: stabilize live and docker test lanes 2026-04-03 21:43:36 +01:00
Peter Steinberger
fa6e6603fa
test(ci): harden cli and exec tests for shared workers 2026-04-03 21:30:47 +01:00
Vincent Koc
0464435777 fix(ci): align windows builtin mock types 2026-04-04 03:57:48 +09:00
Peter Steinberger
bc23db501b
test: trim more core importOriginal usage 2026-04-03 19:49:43 +01:00
Peter Steinberger
3edfc494df
test: expand builtin mock helper usage 2026-04-03 18:53:34 +01:00
Peter Steinberger
636a23b73e
test: extract node builtin mock helpers 2026-04-03 18:40:28 +01:00
Peter Steinberger
e0580e6863
test: harden shared-worker runtime setup 2026-04-03 18:18:56 +01:00
Vincent Koc
f575bc2bfe test(ci): harden proxy-sensitive and timeout unit tests 2026-04-04 02:12:00 +09:00
Shakker
2fa3a09137 test: harden command queue timer cleanup 2026-04-04 01:07:28 +09:00
Peter Steinberger
ffd34f8896
test: reduce agent test import churn 2026-04-03 04:41:09 +01:00
Peter Steinberger
f03d7c5a4c
refactor: centralize Windows exec invocation 2026-04-02 18:27:53 +01:00
lawrence3699
2fd7f7ca52 fix(exec): hide windows console windows 2026-04-03 02:19:32 +09:00
Peter Steinberger
b87f33c920
test(ci): deflake windows npm exec coverage 2026-03-31 19:28:11 +01:00
Vincent Koc
37099dae3e fix(ci): restore matrix monitor import guards and windows npm exit codes 2026-03-31 22:04:35 +09:00
Vincent Koc
11590eb6ce fix(ci): restore dotenv trust boundary and windows npm exit handling 2026-03-31 21:51:17 +09:00
Vincent Koc
25a3d37970 fix(ci): restore matrix guardrails and windows exec shim 2026-03-31 21:27:43 +09:00
Peter Steinberger
6b6ddcd2a6
test: speed up core runtime suites 2026-03-31 02:25:02 +01:00
Shakker
81e65e119f test: mock supervisor timeout flows 2026-03-31 01:40:55 +01:00
Shakker
5fb19f296a test: complete exec timeout child lifecycle 2026-03-31 01:40:55 +01:00
Shakker
82681ba215 test: stabilize exec timeout assertions on windows 2026-03-31 01:40:55 +01:00
Shakker
56c9e2493b test: harden windows timeout-sensitive suites 2026-03-31 01:40:55 +01:00
Shakker
9590e2ccae test: stabilize windows task registry and exec timeouts 2026-03-31 01:40:55 +01:00
Gustavo Madeira Santana
6a37ecad82 Supervisor: unblock waits after forced child kill 2026-03-30 00:45:22 -04:00
George Zhang
e133924047
[codex] harden clawhub plugin publishing and install (#56870)
* fix: harden clawhub plugin publishing and install

* fix(process): preserve windows shim exit success
2026-03-29 11:59:19 -07:00
Vignesh Natarajan
4d54376483 Tests: stabilize shard-2 queue and channel state 2026-03-29 01:12:58 -07:00
Vignesh Natarajan
4b137da582
Test: harden command queue test isolation (CI chore) 2026-03-28 23:43:03 -07:00