Commit graph

440 commits

Author SHA1 Message Date
rcourtman
4120d75359 Surface shared cluster-only storage in alerts (#1341) 2026-03-30 19:25:54 +01:00
rcourtman
48689137ec Migrate Docker metadata on observed container recreation (#1054) 2026-03-27 22:50:19 +00:00
rcourtman
8c1d4dcc04 Honor discovery subnet policy for cluster endpoints (#1319) 2026-03-27 16:30:21 +00:00
rcourtman
d11e3d8f2d Use Ceph monmap and mgrmap counts in cluster summaries (#1319) 2026-03-27 16:23:57 +00:00
rcourtman
03b620a429 Parallelize legacy Proxmox VM guest-agent polling (#1319) 2026-03-27 16:20:48 +00:00
rcourtman
0abbb8ba92 Rotate legacy guest-agent VM priority across polls (#1319) 2026-03-27 16:17:48 +00:00
rcourtman
3d214c80a7 Preserve guest NICs when only filtered addresses are reported (#1319) 2026-03-27 16:06:38 +00:00
rcourtman
01f916dcb5 Use linked host-agent disk data for guest fallback (#1319) 2026-03-27 15:56:20 +00:00
rcourtman
ad10e1f116 Discover controller-backed SMART wearout paths (#1368) 2026-03-27 15:42:44 +00:00
rcourtman
3609fb676b Preserve named guest interfaces without MACs (#1319) 2026-03-27 15:25:52 +00:00
rcourtman
d4242d9a13 Fix ZFS pool attachment in storage frontend (discussion #1351) 2026-03-27 14:59:52 +00:00
rcourtman
2a4432048a Continue guest-agent polling after transient status failures (#1319) 2026-03-27 14:50:28 +00:00
rcourtman
01e4227ec7 Preserve cached guest metadata in legacy PVE VM poll (#1319) 2026-03-27 14:35:40 +00:00
rcourtman
be20b22047 Use balloon-aware total for Proxmox freemem fallback (#1319) 2026-03-27 14:23:10 +00:00
rcourtman
fe66af273e Use freemem when Proxmox VM status memory disagrees (#1319) 2026-03-27 14:02:58 +00:00
rcourtman
e508bc3380 Prefer sane VM free-mem fallback over false full-usage samples (#1319) 2026-03-27 13:55:07 +00:00
rcourtman
8fc41f774c Keep normalized Windows guest disks in efficient VM polling (#1319) 2026-03-27 13:51:55 +00:00
rcourtman
c1fa084100 Stabilize low-trust 100 percent VM memory across mixed sizes (#1319)
Some checks are pending
Build and Test / Secret Scan (push) Waiting to run
Build and Test / Frontend & Backend (push) Waiting to run
Helm CI / Lint and Render Chart (push) Waiting to run
Core E2E Tests / Playwright Core E2E (push) Waiting to run
2026-03-27 11:50:51 +00:00
rcourtman
bce800e95d Retry incomplete guest network metadata sooner (#1319) 2026-03-27 11:41:56 +00:00
rcourtman
176228381b Preserve guest NIC identity before IP assignment (#1319) 2026-03-27 11:27:15 +00:00
rcourtman
2afde29a48 Retry partial guest metadata sooner (#1319) 2026-03-27 11:15:17 +00:00
rcourtman
51abca6421 Treat available guest agents as healthy for VM memory carry-forward (#1319) 2026-03-27 11:04:07 +00:00
rcourtman
963670f01c Serve fresh alert snapshots from monitor state reads (#1365) 2026-03-27 10:47:56 +00:00
rcourtman
ae66647eb1 Preserve VM memory when healthy guests fall back to false 100% usage (#1319) 2026-03-27 08:27:14 +00:00
rcourtman
f344938403 Retry Linux guest meminfo sooner after transient failures (#1319)
Some checks are pending
Build and Test / Secret Scan (push) Waiting to run
Build and Test / Frontend & Backend (push) Waiting to run
Core E2E Tests / Playwright Core E2E (push) Waiting to run
2026-03-26 23:27:54 +00:00
rcourtman
5bd85dd5ae Preserve cached guest metadata when VM status calls fail (#1319) 2026-03-26 23:25:08 +00:00
rcourtman
d91f21b961 Preserve fresh guest metadata across transient agent misses (#1319) 2026-03-26 23:20:38 +00:00
rcourtman
627181566a Allow SSH temperature fallback when host agent lacks SMART 2026-03-26 22:40:43 +00:00
rcourtman
ae6b663e95 Attach ZFS pools for dataset-backed storages 2026-03-26 22:29:32 +00:00
rcourtman
d9b7c99f02 Rotate guest-agent poll priority across QEMU polls (#1319) 2026-03-26 22:20:27 +00:00
rcourtman
6f9da1b70c Retry empty guest metadata sooner (#1319) 2026-03-26 21:03:51 +00:00
rcourtman
fcd2384dd5 Stabilize transient VM disk fallbacks (#1319)
Some checks are pending
Build and Test / Secret Scan (push) Waiting to run
Build and Test / Frontend & Backend (push) Waiting to run
Core E2E Tests / Playwright Core E2E (push) Waiting to run
2026-03-26 11:12:23 +00:00
rcourtman
92e6075ee4 Fix ZFS pool matching for local-zfs storages 2026-03-26 09:09:17 +00:00
rcourtman
e9bbc35bae Stabilize repeated low-trust VM memory fallbacks (#1319) 2026-03-26 00:23:29 +00:00
rcourtman
2196327769 Preserve VM guest metadata across transient agent gaps (#1319) 2026-03-26 00:12:19 +00:00
rcourtman
0f70aa053e Honor disk-exclude for sleeping and Proxmox disks (#1142) 2026-03-26 00:01:59 +00:00
rcourtman
333e66a8e9 Reject shared Docker token host identity collisions (#1366) 2026-03-25 23:36:57 +00:00
rcourtman
48f4438d23 Scale v5 Proxmox guest disk polling 2026-03-25 18:24:47 +00:00
rcourtman
4ad7e51875 Prefer linked host disk metrics for v5 Proxmox nodes 2026-03-25 16:54:00 +00:00
rcourtman
b4e4f6b92d Treat vm meminfo total-used fallback as low-trust (#1319) 2026-03-25 12:49:31 +00:00
rcourtman
fba1fadccd Make alert node display name resolution instance-aware (#1218) 2026-03-25 12:44:22 +00:00
rcourtman
9c2a56d351 Respect quiet hours for recovery notifications (#1068) 2026-03-25 12:27:35 +00:00
rcourtman
ffaeea18d6 Scope cluster TLS fingerprints to their own endpoints (#1199) 2026-03-25 12:10:09 +00:00
rcourtman
40249947ed Fix template backup orphan detection race (#1352) 2026-03-25 10:36:33 +00:00
rcourtman
2fe22c3308 fix(backups): prevent template backups from being flagged as orphaned
Some checks failed
Build and Test / Secret Scan (push) Failing after 5s
Build and Test / Frontend & Backend (push) Failing after 1m8s
Core E2E Tests / Playwright Core E2E (push) Failing after 4m38s
Proxmox VM/LXC templates are intentionally excluded from the monitored
guest list, but their backup files exist on storage. The orphan-detection
logic was firing for every template backup because the VMID was never
in the guest lookup maps.

Fix: track template VMID→node pairs in State.templateVMIDs (unexported,
not serialised to API/frontend) during the resources poll loop, expose
via StateSnapshot.TemplateVMIDs, and use in both buildGuestLookups() and
the storage backup node-resolution map so orphan detection treats template
backups as valid. Also preserves the template map through the cluster
health grace-period path (zero-resource preservation), the partial-node
grace-period path, and clears it on instance removal.

Closes #1352
2026-03-17 09:04:22 +00:00
rcourtman
caff845c1a fix(ui): use Proxmox tag colours from datacenter config
Pulse was generating tag colours from a hash of the tag name instead
of using the colours configured in Proxmox. Now polls /cluster/options
once per PVE instance and merges the tag-style colour map into state,
which the frontend uses as the first-priority colour source for tag
badges. Falls back to the existing special-tag and hash-based colours
when Proxmox hasn't set a custom colour for a tag.
2026-03-15 19:49:46 +00:00
rcourtman
da982d0fca Prepare v5.1.24 release 2026-03-14 16:43:26 +00:00
rcourtman
d05a00b931 fix(monitoring): smooth transient VM memory fallback spikes 2026-03-10 23:06:17 +00:00
rcourtman
afcfb23a30 fix(monitoring): retain intermittent FreeBSD SMART data 2026-03-10 22:52:25 +00:00
rcourtman
7dab977d91 Add split memory bar showing Used | Cache | Free segments (#1302)
Show reclaimable buff/cache as a distinct amber segment between used
(green) and free (gray) in the memory bar. This explains why Pulse's
memory percentage differs from Proxmox: Pulse reports cache-aware
usage (MemAvailable) while Proxmox includes cache as used (Total-Free).

Backend: add Cache field to Memory model, derived from MemInfo
(Available - Free). Only uses MemInfo.Free (not FreeMem fallback) to
avoid inflating cache by the balloon gap on ballooned VMs.

Frontend: StackedMemoryBar renders three segments with tooltip
breakdown. Tooltip Free accounts for balloon limit when active.
Percentage label and alerts remain cache-aware (unchanged).
2026-03-10 10:16:14 +00:00