Pulse

vrr/Pulse

mirror of https://github.com/rcourtman/Pulse.git synced 2026-04-29 20:10:21 +00:00

Author	SHA1	Message	Date
rcourtman	e93c8b40ae	Fix CodeQL integer and audit findings	2026-03-28 13:33:48 +00:00
rcourtman	e306c0a461	Tolerate partial guest network address payloads (#1319 ) Some checks are pending Build and Test / Secret Scan (push) Waiting to run Details Build and Test / Frontend & Backend (push) Waiting to run Details Core E2E Tests / Playwright Core E2E (push) Waiting to run Details	2026-03-27 17:09:09 +00:00
rcourtman	81b0a567ce	Harden guest network interface parsing (#1319 )	2026-03-27 17:05:34 +00:00
rcourtman	2ed4253573	Accept object-style single guest fsinfo results (#1319 )	2026-03-27 16:33:41 +00:00
rcourtman	3d27c8f006	Accept object-style guest fsinfo disk metadata (#1319 )	2026-03-27 15:24:40 +00:00
rcourtman	fcfa0c2903	Skip malformed guest fsinfo entries (#1319 )	2026-03-27 15:23:13 +00:00
rcourtman	d4242d9a13	Fix ZFS pool attachment in storage frontend (discussion #1351 )	2026-03-27 14:59:52 +00:00
rcourtman	b5629fb1df	Normalize Windows volume GUID fsinfo mountpoints (#1319 )	2026-03-27 14:04:58 +00:00
rcourtman	b05d2b0489	Handle Windows fsinfo name fallback for guest disks (#1319 )	2026-03-27 11:39:22 +00:00
rcourtman	1f332bee52	Support privileged fsinfo totals for guest disks (#1319 )	2026-03-27 11:18:53 +00:00
rcourtman	1885bd02c0	Fix Proxmox tag color parsing (#1348 )	2026-03-25 10:40:31 +00:00
rcourtman	caff845c1a	fix(ui): use Proxmox tag colours from datacenter config Pulse was generating tag colours from a hash of the tag name instead of using the colours configured in Proxmox. Now polls /cluster/options once per PVE instance and merges the tag-style colour map into state, which the frontend uses as the first-priority colour source for tag badges. Falls back to the existing special-tag and hash-based colours when Proxmox hasn't set a custom colour for a tag.	2026-03-15 19:49:46 +00:00
rcourtman	0ae2806f18	fix(memory): add guest agent /proc/meminfo fallback to avoid VM memory inflation (#1270 ) Proxmox status.Mem includes page cache as "used" memory, inflating reported VM usage. The existing fallbacks (balloon meminfo, RRD, linked host agent) were frequently unavailable, causing most VMs to fall through to the inflated status-mem source. Adds a new last-resort fallback that reads /proc/meminfo via the QEMU guest agent file-read endpoint to get accurate MemAvailable. Results are cached (60s positive, 5min negative backoff for unsupported VMs). Also fixes: RRD memavailable fallback missing from traditional polling path, cache key collisions in multi-PVE setups, FreeMem underflow guard inconsistency, and integer overflow in kB-to-bytes conversion.	2026-02-20 13:31:52 +00:00
rcourtman	a54d71117b	fix(proxmox): prevent guest agent errors from marking endpoints unhealthy Backport of v6 commits a87c9950 and 347d7db1. Part 1 (a87c9950): Wrap the four guest agent c.get() errors with fmt.Errorf("guest agent ...: %w", err) so isVMSpecificError() correctly scopes them to the VM rather than the cluster endpoint. Part 2 (347d7db1): Replace the 20+ pattern blocklist in executeWithFailover with an allowlist via isEndpointConnectivityError(). Only true TCP/DNS/TLS failures mark an endpoint unhealthy. Any HTTP response from Proxmox — including 500 — proves the node is reachable and returns the error without affecting endpoint health.	2026-02-18 12:59:20 +00:00
rcourtman	efa916ee2a	fix(memory): correct memory reporting for Linux VMs and FreeBSD ZFS ARC Linux VM page cache (#1270): QEMU VM memory now falls back to Proxmox RRD's memavailable metric (which excludes reclaimable page cache) when the qemu-guest-agent doesn't provide MemInfo.Available. Previously the fallback was detailedStatus.Mem (total - MemFree), inflating usage to 80%+ on VMs with normal Linux page cache. Mirrors the existing LXC rrd-memavailable path. FreeBSD ZFS ARC (#1264, #1051): The host agent now reads kstat.zfs.misc.arcstats.size via SysctlRaw on FreeBSD and subtracts the ARC size from reported memory usage. ZFS ARC is reclaimable under memory pressure (like Linux SReclaimable) but gopsutil counts it as wired/non-reclaimable, causing false 90%+ memory alerts on TrueNAS and FreeBSD hosts. Build-tagged so it compiles cleanly on all platforms. Fixes #1270 Fixes #1264 Fixes #1051 (cherry picked from commit 94502f83ff9ffc6da28aaadc946a2f7d8b4e9bac)	2026-02-18 12:56:53 +00:00
rcourtman	815c990e85	fix(proxmox): avoid 403 on apt update checks	2026-02-09 20:28:09 +00:00
rcourtman	13a6f7750c	Minor updates to main and proxmox client	2026-01-28 16:52:50 +00:00
rcourtman	ebc29b4fdb	feat: show pending apt updates for Proxmox nodes (#1083 ) - Add PendingUpdates and PendingUpdatesCheckedAt fields to Node model - Add GetNodePendingUpdates method to Proxmox client (calls /nodes/{node}/apt/update) - Add 30-minute polling cache to avoid excessive API calls - Add pendingUpdates to frontend Node type - Add color-coded badge in NodeSummaryTable (yellow: 1-9, orange: 10+) - Update test stubs for interface compliance Requires Sys.Audit permission on Proxmox API token to read apt updates.	2026-01-21 10:53:36 +00:00
rcourtman	bd1df9f942	feat: automatic subnet preference for cluster node discovery When discovering cluster nodes, Pulse now automatically prefers IPs on the same subnet as the initial connection. This fixes the common issue where Pulse used internal cluster network IPs (e.g., 172.x.x.x) instead of management network IPs (e.g., 10.x.x.x). How it works: 1. Extract subnet from initial connection URL (assumes /24 for IPv4) 2. For each discovered node, query /nodes/{node}/network for all IPs 3. If cluster-reported IP is on a different subnet, find an IP on the preferred subnet and set it as IPOverride 4. Manual IPOverride settings are preserved and take precedence This eliminates the need for manual IPOverride configuration in most multi-network Proxmox setups. Refs #929, #1066	2026-01-08 23:12:30 +00:00
rcourtman	3fd20340d1	fix: increase PBS storage content timeout to 60s PBS storage content queries with encrypted backups can take 10-20+ seconds to enumerate. The previous 30s timeout was causing intermittent failures when polling backup data from PBS storage configured in PVE. This increases the timeout to 60s to accommodate slow PBS backends while still preventing indefinite hangs on unavailable NFS/network storage.	2025-12-26 00:21:17 +00:00
rcourtman	c439a83fba	chore: remove additional dead code Remove 241 lines of unreachable code across internal and pkg: - internal/crypto/crypto.go: unused NewCryptoManager wrapper - internal/monitoring/scheduler.go: unused fixedIntervalSelector type - internal/ssh/knownhosts/manager.go: unused hostKeyExists function - internal/updates/manager.go: unused getLatestRelease wrapper - internal/updates/updater.go: unused GetAll method - pkg/discovery/discovery.go: unused scanWorker and runPhase (legacy compat) - pkg/proxmox/client.go: unused post, getTaskStatus, waitForTaskCompletion, getTaskLog - pkg/proxmox/cluster_client.go: unused markUnhealthy wrapper	2025-11-27 05:13:26 +00:00
rcourtman	01f7d81d38	style: fix gofmt formatting inconsistencies Run gofmt -w to fix tab/space inconsistencies across 33 files.	2025-11-26 23:44:36 +00:00
rcourtman	7c895df1f3	Fix Proxmox 9.x VM status endpoint incompatibility Proxmox VE 9.x removed support for the "full" parameter in the /nodes/{node}/qemu/{vmid}/status/current endpoint. When Pulse sent GetVMStatus() requests with ?full=1, Proxmox responded with: API error 400: {"errors":{"full":"property is not defined in schema..."}} This caused the cluster client to mark ALL endpoints as unhealthy, which cascaded into multiple failures: - VM status checks failed - Guest agent queries were blocked - Filesystem data collection stopped working - All Windows VMs showed disk:-1 (unknown) instead of actual disk usage The fix removes the ?full=1 parameter since Proxmox 9.x returns all data by default without needing this parameter. This maintains backward compatibility with older Proxmox versions while fixing the issue in 9.x. After this fix: - Cluster endpoints are correctly marked as healthy - Guest agent queries work properly - Windows VMs report actual disk usage (e.g., 26% on C:\ drive) - VM monitoring functions normally on Proxmox 9.x	2025-11-13 11:22:36 +00:00
rcourtman	f61b850179	Ensure VM status requests always return meminfo (Related to #694 )	2025-11-12 17:30:10 +00:00
rcourtman	9199892115	Fix Windows VM disk accumulation bug by normalizing drive letters Related to #656 Windows guest agents can return multiple directory mountpoints (C:\, C:\Users, C:\Windows) all on the same physical drive. When the QEMU guest agent omits disk[] metadata, commit `5325ef481` falls back to using the mountpoint string as the disk identifier. This causes every Windows directory to be treated as a separate disk, accumulating to inflated totals (e.g., 1TB reported for a 250GB drive). Root cause: The fallback logic in pkg/proxmox/client.go:1585-1594 assigns fs.Disk = fs.Mountpoint when disk[] is missing. On Windows, every directory path is unique, so the deduplication guard in internal/monitoring/monitor_polling.go: 619-635 never triggers, causing all directories to be summed. Changes: - Detect Windows-style mountpoints (drive letter + colon + backslash) - Normalize to drive root when disk[] is missing (e.g., C:\Users → C:) - Preserve existing behavior for Linux/BSD and VMs with disk[] metadata - Add debug logging for synthesized Windows drive identifiers This fix maintains backward compatibility with commit `5325ef481` while preventing the Windows directory accumulation issue. LXC containers are unaffected as they use a different code path.	2025-11-07 12:27:11 +00:00
rcourtman	1a78dcbba2	Fix guest agent disk data regression on Proxmox 8.3+ Related to #630 Proxmox 8.3+ changed the VM status API to return the `agent` field as an object ({"enabled":1,"available":1}) instead of an integer (0 or 1). This caused Pulse to incorrectly treat VMs as having no guest agent, resulting in missing disk usage data (disk:-1) even when the guest agent was running and functional. The issue manifested as: - VMs showing "Guest details unavailable" or missing disk data - Pulse logs showing no "Guest agent enabled, querying filesystem info" messages - `pvesh get /nodes/<node>/qemu/<vmid>/agent/get-fsinfo` working correctly from the command line, confirming the agent was functional Root cause: The VMStatus struct defined `Agent` as an int field. When Proxmox 8.3+ sent the new object format, JSON unmarshaling silently left the field at zero, causing Pulse to skip all guest agent queries. Changes: - Created VMAgentField type with custom UnmarshalJSON to handle both formats: * Legacy (Proxmox <8.3): integer (0 or 1) * Modern (Proxmox 8.3+): object {"enabled":N,"available":N} - Updated VMStatus.Agent from `int` to `VMAgentField` - Updated all references to `detailedStatus.Agent` to use `.Agent.Value` - The unmarshaler prioritizes the "available" field over "enabled" to ensure we only query when the agent is actually responding This fix maintains backward compatibility with older Proxmox versions while supporting the new format introduced in Proxmox 8.3+.	2025-11-06 18:42:46 +00:00
rcourtman	af55362009	Fix inflated RAM usage reporting for LXC containers Related to #553 ## Problem LXC containers showed inflated memory usage (e.g., 90%+ when actual usage was 50-60%, 96% when actual was 61%) because the code used the raw `mem` value from Proxmox's `/cluster/resources` API endpoint. This value comes from cgroup `memory.current` which includes reclaimable cache and buffers, making memory appear nearly full even when plenty is available. ## Root Cause - Nodes: Had sophisticated cache-aware memory calculation with RRD fallbacks - VMs (qemu): Had detailed memory calculation using guest agent meminfo - LXCs: Naively used `res.Mem` directly without any cache-aware correction The Proxmox cluster resources API's `mem` field for LXCs includes cache/buffers (from cgroup memory accounting), which should be excluded for accurate "used" memory. ## Solution Implement cache-aware memory calculation for LXC containers by: 1. Adding `GetLXCRRDData()` method to fetch RRD metrics for LXC containers from `/nodes/{node}/lxc/{vmid}/rrddata` 2. Using RRD `memavailable` to calculate actual used memory (total - available) 3. Falling back to RRD `memused` if `memavailable` is not available 4. Only using cluster resources `mem` value as last resort This matches the approach already used for nodes and VMs, providing consistent cache-aware memory reporting across all resource types. ## Changes - Added `GuestRRDPoint` type and `GetLXCRRDData()` method to pkg/proxmox - Added `GetLXCRRDData()` to ClusterClient for cluster-aware operations - Modified LXC memory calculation in `pollPVEInstance()` to use RRD data when available - Added guest memory snapshot recording for LXC containers - Updated test stubs to implement the new interface method ## Testing - Code compiles successfully - Follows the same proven pattern used for nodes and VMs - Includes diagnostic snapshot recording for troubleshooting	2025-11-06 00:16:18 +00:00
rcourtman	6eb1a10d9b	Refactor: Code cleanup and localStorage consolidation This commit includes comprehensive codebase cleanup and refactoring: ## Code Cleanup - Remove dead TypeScript code (types/monitoring.ts - 194 lines duplicate) - Remove unused Go functions (GetClusterNodes, MigratePassword, GetClusterHealthInfo) - Clean up commented-out code blocks across multiple files - Remove unused TypeScript exports (helpTextClass, private tag color helpers) - Delete obsolete test files and components ## localStorage Consolidation - Centralize all storage keys into STORAGE_KEYS constant - Update 5 files to use centralized keys: * utils/apiClient.ts (AUTH, LEGACY_TOKEN) * components/Dashboard/Dashboard.tsx (GUEST_METADATA) * components/Docker/DockerHosts.tsx (DOCKER_METADATA) * App.tsx (PLATFORMS_SEEN) * stores/updates.ts (UPDATES) - Benefits: Single source of truth, prevents typos, better maintainability ## Previous Work Committed - Docker monitoring improvements and disk metrics - Security enhancements and setup fixes - API refactoring and cleanup - Documentation updates - Build system improvements ## Testing - All frontend tests pass (29 tests) - All Go tests pass (15 packages) - Production build successful - Zero breaking changes Total: 186 files changed, 5825 insertions(+), 11602 deletions(-)	2025-11-04 21:50:46 +00:00
rcourtman	a885fb5472	Surface LXC interface IPs via PVE interfaces API (#596 )	2025-10-23 08:07:32 +00:00
rcourtman	b95c01066e	Capture dynamic LXC IP metrics (#596 )	2025-10-23 07:50:45 +00:00
rcourtman	be85459db2	Add LXC config metadata for guest drawers (#596 )	2025-10-23 07:30:32 +00:00
rcourtman	aac3dacd63	Improve LXC guest metrics visibility (#596 )	2025-10-22 22:24:33 +00:00
rcourtman	c9543e8a7e	Add qemu guest agent version metadata	2025-10-22 15:24:07 +00:00
rcourtman	13e2577c57	Handle FreeBSD guest agent disk counters Refs #580	2025-10-22 14:06:45 +00:00
rcourtman	32421b36b8	Refs #533 : add total-minus-used memory fallback	2025-10-15 18:19:54 +00:00
rcourtman	7e5fa9a147	fix: restore cache-aware node memory on PVE 8.4	2025-10-14 16:40:45 +00:00
rcourtman	2163d6f5a8	Use guest meminfo available for VM memory usage	2025-10-12 11:03:56 +00:00
rcourtman	f46ff1792b	Fix settings security tab navigation	2025-10-11 23:29:47 +00:00

38 commits