Pulse

vrr/Pulse

mirror of https://github.com/rcourtman/Pulse.git synced 2026-04-30 04:20:20 +00:00

Author	SHA1	Message	Date
rcourtman	884c85c2ab	chore: Remove debug logging that exposed config JSON Removed two DEBUG log statements that were logging full nodes config JSON at Info level. This was verbose and potentially exposed sensitive configuration data (credentials, tokens) in logs.	2025-12-02 15:32:02 +00:00
courtmanr@gmail.com	f4c2bd7c35	Implement UI toggle for Hide Local Login (related to issue #750 )	2025-11-25 08:14:19 +00:00
rcourtman	2207642fa9	Related to #727 : normalize persisted Proxmox hosts	2025-11-20 19:58:05 +00:00
courtmanr@gmail.com	11477546f8	Update config persistence, crypto, and dev script	2025-11-20 11:46:20 +00:00
rcourtman	51b368ddc1	feat: make PVE polling interval configurable (related to #467 )	2025-11-18 21:30:04 +00:00
rcourtman	5d99fc2f2d	Fix dark mode toggle wiping API tokens (related to #685 ) Root cause: SaveSystemSettings calls updateEnvFile which rewrites .env on any setting change, triggering the config watcher. The watcher sees API_TOKEN in .env and replaces all UI-created tokens with "Environment token" records, wiping out host-agent scoped tokens. Fix: updateEnvFile now compares the new content with existing content and skips the write if nothing changed. Since dark mode (and other UI settings) are stored in system.json, not .env, toggling theme no longer triggers unnecessary .env rewrites. This prevents the config watcher from being triggered unnecessarily and preserves UI-created API tokens when changing cosmetic settings. Future improvement: Deprecate API_TOKEN/API_TOKENS from .env entirely and make api_tokens.json the single source of truth (requires migration logic).	2025-11-11 00:11:41 +00:00
rcourtman	1b221cca71	feat: Add configurable allowlist for webhook private IP targets (addresses #673 ) Allow homelab users to send webhooks to internal services while maintaining security defaults. Changes: - Add webhookAllowedPrivateCIDRs field to SystemSettings (persistent config) - Implement CIDR parsing and validation in NotificationManager - Convert ValidateWebhookURL to instance method to access allowlist - Add UI controls in System Settings for configuring trusted CIDR ranges - Maintain strict security by default (block all private IPs) - Keep localhost, link-local, and cloud metadata services blocked regardless of allowlist - Re-validate on both config save and webhook delivery (DNS rebinding protection) - Add comprehensive tests for CIDR parsing and IP matching Backend: - UpdateAllowedPrivateCIDRs() parses comma-separated CIDRs with validation - Support for bare IPs (auto-converts to /32 or /128) - Thread-safe allowlist updates with RWMutex - Logging when allowlist is updated or used - Validation errors prevent invalid CIDRs from being saved Frontend: - New "Webhook Security" section in System Settings - Input field with examples and helpful placeholder text - Real-time unsaved changes tracking - Loads and saves allowlist via system settings API Security: - Default behavior unchanged (all private IPs blocked) - Explicit opt-in required via configuration - Localhost (127/8) always blocked - Link-local (169.254/16) always blocked - Cloud metadata services always blocked - DNS resolution checked at both save and send time Testing: - Tests for CIDR parsing (valid/invalid inputs) - Tests for IP allowlist matching - Tests for bare IP address handling - Tests for security boundaries (localhost, link-local remain blocked) Related to #673 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 08:31:12 +00:00
rcourtman	431769024f	Fix P1: Config Persistence transaction field synchronization Problem: writeConfigFileLocked() accessed c.tx field without synchronization - Function reads c.tx to check if transaction is active (line 109) - c.tx modified by begin/endTransaction under lock, but read without lock - Race condition: c.tx could change between check and use Impact: - Inconsistent transaction handling - File could be written directly when it should be staged - Or staged when it should be written directly - Data corruption risk during config imports Fix (lines 108-128): - Added documentation that caller MUST hold c.mu lock - Read c.tx into local variable tx while lock is held - Use local copy for transaction check - Safe because all callers hold c.mu when calling writeConfigFileLocked - Transaction field only modified while holding c.mu in begin/endTransaction This maintains the existing contract (callers hold lock) while making the transaction read safe and explicit.	2025-11-07 10:00:31 +00:00
rcourtman	6ca4d9b750	Fix P1/P2 infrastructure issues: panic recovery and optimizations This commit addresses 4 P1 important issues and 1 P2 optimization in infrastructure components: P1-1: Missing Panic Recovery in Discovery Service (service.go:172-195, 499-542) - Problem: No panic recovery in Start(), ForceRefresh(), SetSubnet() goroutines - Impact: Silent service death if scan panics, broken discovery with no monitoring - Fix: - Wrapped initial scan goroutine with defer/recover (lines 172-182) - Wrapped scanLoop goroutine with defer/recover (lines 185-195) - Wrapped ForceRefresh scan with defer/recover (lines 499-509) - Wrapped SetSubnet scan with defer/recover (lines 532-542) - All log panics with stack traces for debugging P1-2: Missing Panic Recovery in Config Watcher Callback (watcher.go:546-556) - Problem: User-provided onMockReload callback could panic and crash watcher - Impact: Panicking callback kills watcher goroutine, no config updates - Fix: Wrapped callback invocation with defer/recover and stack trace logging P1-3: Session Store Stop() Using Send Instead of Close (session_store.go:16-84) - Problem: Stop() used channel send which blocks if nobody reads - Impact: Stop() hangs if backgroundWorker already exited - Fix: - Added sync.Once field stopOnce (line 22) - Changed Stop() to use close() within stopOnce.Do() (lines 80-84) - Prevents double-close panic and ensures all readers are signaled P2-1: Backup Cleanup Inefficient O(n²) Sort (persistence.go:1424-1427) - Problem: Bubble sort used to sort backups by modification time - Impact: Inefficient for large backup counts (>100 files) - Fix: - Replaced bubble sort with sort.Slice() using O(n log n) algorithm - Added "sort" import (line 9) - Maintains same oldest-first ordering for deletion logic All fixes add defensive programming without changing external behavior. Panic recovery ensures services continue operating even with bugs, while optimization reduces cleanup time for backup-heavy environments.	2025-11-07 09:55:22 +00:00
rcourtman	ba6d934204	Fix critical P0 infrastructure concurrency issues This commit addresses 3 critical P0 race conditions and resource leaks in core infrastructure: P0-1: Discovery Service Goroutine Leak (service.go:468, 488) - Problem: ForceRefresh() and SetSubnet() spawned unbounded goroutines without checking if scan already in progress - Impact: Rapid API calls create goroutine explosion, resource exhaustion - Fix: - ForceRefresh: Check isScanning before spawning goroutine (lines 470-476) - SetSubnet: Check isScanning, defer scan if already running (lines 491-504) - Both now log when skipping to aid debugging P0-2: Config Persistence Unlock/Relock Race (persistence.go:1177-1206) - Problem: LoadNodesConfig() unlocked RLock, called SaveNodesConfig (acquires Lock), then relocked - Impact: Another goroutine could modify config between unlock/relock, causing migrated data loss - Fix: - Copy instance slices while holding RLock to ensure consistency (lines 1189-1194) - Release lock, save copies, then return without relocking (lines 1196-1205) - Prevents TOCTOU vulnerability where migrations could be overwritten P0-3: Config Watcher Channel Close Race (watcher.go:19-178) - Problem: Stop() used select-check-close pattern vulnerable to concurrent calls - Impact: Multiple Stop() calls panic on double-close - Fix: - Added sync.Once field stopOnce to ConfigWatcher struct (line 26) - Changed Stop() to use stopOnce.Do() ensuring single execution (lines 175-178) - Removed racy select-based guard All fixes maintain backwards compatibility and add defensive logging for operational visibility.	2025-11-07 09:49:55 +00:00
rcourtman	9257071ca1	Add encryption status to notification health endpoint (P2) Backend: - Add IsEncryptionEnabled() method to ConfigPersistence - Include encryption status in /api/notifications/health response - Allows frontend to warn when credentials are stored in plaintext Frontend: - Update NotificationHealth type to include encryption.enabled field - Frontend can now display warnings when encryption is disabled This addresses the P2 requirement for encryption visibility, allowing operators to know when notification credentials are not encrypted at rest.	2025-11-07 08:36:55 +00:00
rcourtman	e21a72578f	Add configurable SSH port for temperature monitoring Related to #595 This change adds support for custom SSH ports when collecting temperature data from Proxmox nodes, resolving issues for users who run SSH on non-standard ports. Why SSH is still needed: Temperature monitoring requires reading /sys/class/hwmon sensors on Proxmox nodes, which is not exposed via the Proxmox API. Even when using API tokens for authentication, Pulse needs SSH access to collect temperature data. Changes: - Add `sshPort` configuration to SystemSettings (system.json) - Add `SSHPort` field to Config with environment variable support (SSH_PORT) - Add per-node SSH port override capability for PVE, PBS, and PMG instances - Update TemperatureCollector to accept and use custom SSH port - Update SSH known_hosts manager to support non-standard ports - Add NewTemperatureCollectorWithPort() constructor with port parameter - Maintain backward compatibility with NewTemperatureCollector() (uses port 22) - Update frontend TypeScript types for SSH port configuration Configuration methods: 1. Environment variable: SSH_PORT=2222 2. system.json: {"sshPort": 2222} 3. Per-node override in nodes.enc (future UI support) Default behavior: - Defaults to port 22 if not configured - Maintains full backward compatibility - No changes required for existing deployments The implementation includes proper ssh-keyscan port handling and known_hosts management for non-standard ports using [host]:port notation per SSH standards.	2025-11-05 20:03:29 +00:00
rcourtman	c93581e1aa	Add DNS caching to reduce excessive DNS queries Related to #608 Implements DNS caching using rs/dnscache to dramatically reduce DNS query volume for frequently accessed Proxmox hosts. Users were reporting 260,000+ DNS queries in 37 hours for the same hostnames. Changes: - Added rs/dnscache dependency for DNS resolution caching - Created pkg/tlsutil/dnscache.go with DNS cache wrapper - Updated HTTP client creation to use cached DNS resolver - Added DNSCacheTimeout configuration option (default: 5 minutes) - Made DNS cache timeout configurable via: - system.json: dnsCacheTimeout field (seconds) - Environment variable: DNS_CACHE_TIMEOUT (duration string) - DNS cache periodically refreshes to prevent stale entries Benefits: - Reduces DNS query load on local DNS servers by ~99% - Reduces network traffic and DNS query log volume - Maintains fresh DNS entries through periodic refresh - Configurable timeout for different network environments Default behavior: 5-minute cache timeout with automatic refresh	2025-11-05 18:25:38 +00:00
rcourtman	d52ac6d8b5	Fix CSRF token validation and improve token management - Add Access-Control-Expose-Headers to allow frontend to read X-CSRF-Token response header - Implement proactive CSRF token issuance on GET requests when session exists but CSRF cookie is missing - Ensures frontend always has valid CSRF token before making POST requests - Fixes 403 Forbidden errors when toggling system settings This resolves CSRF validation failures that occurred when CSRF tokens expired or were missing while valid sessions existed.	2025-11-05 09:23:44 +00:00
rcourtman	5a2d808aa1	Harden setup token flow and enforce encrypted persistence	2025-10-25 16:00:37 +00:00
rcourtman	d643dcf0bc	perf: reduce polling allocations and guest metadata load	2025-10-25 13:12:47 +00:00
rcourtman	5c54685f04	Add API token scopes and standalone host agent Introduces granular permission scopes for API tokens (docker:report, docker:manage, host-agent:report, monitoring:read/write, settings:read/write) allowing tokens to be restricted to minimum required access. Legacy tokens default to full access until scopes are explicitly configured. Adds standalone host agent for monitoring Linux, macOS, and Windows servers outside Proxmox/Docker estates. New Servers workspace in UI displays uptime, OS metadata, and capacity metrics from enrolled agents. Includes comprehensive token management UI overhaul with scope presets, inline editing, and visual scope indicators.	2025-10-23 11:40:31 +00:00
rcourtman	be26f957c0	Add snapshot size alert thresholds (#585 )	2025-10-22 13:30:40 +00:00
rcourtman	ff4dc49ae4	Update Pulse install flow and related components	2025-10-21 19:58:53 +00:00
rcourtman	999da6d900	feat: production-ready import/export with API tokens and transactional rollback Export/import payload bumped to v4.1 to include API tokens alongside existing config bundle, eliminating blind spots in disaster recovery scenarios. ## Key Features API Tokens in Exports (v4.1) - Exports now include API token metadata (ID, name, hash, prefix, suffix, timestamps) - Export format version bumped from 4.0 to 4.1 - Fixes gap where API tokens were lost during config migrations Transactional Atomic Imports - New importTransaction helper stages all writes before committing - On failure, automatic rollback restores original configs - Prevents partial/corrupted imports that could break running systems - All config writes (nodes, alerts, email, webhooks, apprise, system, OIDC, API tokens, guest metadata) now transaction-aware Backward Compatibility - Version 4.0 exports (without API tokens) still import successfully - System logs notice but proceeds, leaving existing API tokens untouched - No breaking changes to existing export/import workflows ## Implementation Files Added: - internal/config/import_transaction.go - Transaction helper with staging/rollback Files Modified: - internal/config/export.go - v4.1 export, transactional ImportConfig wrapper - internal/config/persistence.go - Transaction-aware Save* methods, beginTransaction/endTransaction helpers - internal/config/persistence_test.go - 4 comprehensive unit tests Testing: - TestExportConfigIncludesAPITokens - Verifies API tokens in v4.1 exports - TestImportConfigTransactionalSuccess - Validates atomic import success path - TestImportConfigRollbackOnFailure - Confirms rollback on mid-import failure - TestImportAcceptsVersion40Bundle - Ensures backward compatibility with v4.0 All tests passing ✅ ## Migration Notes - No manual migration required - Users can re-export to generate v4.1 bundles with API tokens - Existing 4.0 bundles remain valid for import - Recommended: Re-run export after upgrade to ensure API tokens are captured Co-authored-by: Codex (implementation) Co-authored-by: Claude (coordination and testing)	2025-10-21 14:37:44 +00:00
rcourtman	56c6c0cc0c	feat: improve discovery with progress tracking, validation, and structured errors Significantly enhanced network discovery feature to eliminate false positives, provide real-time progress updates, and better error reporting. Key improvements: - Require positive Proxmox identification (version data, auth headers, or certificates) instead of reporting any service on ports 8006/8007 - Add real-time progress tracking with phase/target counts and completion percentage - Implement structured error reporting with IP, phase, type, and timestamp details - Fix TLS timeout handling to prevent hangs on unresponsive hosts - Expose progress and structured errors via WebSocket for UI consumption - Reduce log verbosity by moving discovery logs to debug level - Fix duplicate IP counting to ensure progress reaches 100% Breaking changes: None (backward compatible with legacy API methods)	2025-10-20 22:29:30 +00:00
rcourtman	5ebb32ce10	feat: enhance runtime configuration and system settings management Improves configuration handling and system settings APIs to support v4.24.0 features including runtime logging controls, adaptive polling configuration, and enhanced config export/persistence. Changes: - Add config override system for discovery service - Enhance system settings API with runtime logging controls - Improve config persistence and export functionality - Update security setup handling - Refine monitoring and discovery service integration These changes provide the backend support for the configuration features documented in the v4.24.0 release.	2025-10-20 17:41:19 +00:00
rcourtman	57429900a6	feat: add adaptive polling scheduler infrastructure (Phase 2 Tasks 1-3) Implements adaptive scheduling foundation for Phase 2: - Poll cycle metrics: duration, staleness, queue depth, in-flight counters - Adaptive scheduler with pluggable staleness/interval/enqueue interfaces - Config support: ADAPTIVE_POLLING_ENABLED flag + min/max/base intervals - Feature flag defaults to disabled for safe rollout - Scheduler wiring into Monitor with conditional instantiation Tasks 1-3 of 10 complete. Ready for staleness tracker implementation.	2025-10-20 15:13:37 +00:00
Pulse Automation Bot	cfdfe896be	Adjust backup and snapshot alert handling	2025-10-18 20:11:01 +00:00
Pulse Automation Bot	80b9d0602a	Add Apprise notification integration (#570 )	2025-10-18 16:39:39 +00:00
Pulse Automation Bot	0b4e4f9c59	Add configurable backup polling interval	2025-10-18 13:06:41 +00:00
rcourtman	4838793677	feat: enhance alerts system with tests and improved thresholds - Add comprehensive test coverage for alerts package with 285+ new tests - Implement ThresholdsTable component with metric thresholds display - Enhance Alerts page UI with improved layout and metric filtering - Add frontend component tests for Alerts page and ThresholdsTable - Set up Vitest testing infrastructure for SolidJS components - Improve config persistence with better validation - Expand discovery tests with 333+ test cases - Update API, configuration, and Docker monitoring documentation	2025-10-15 22:25:04 +00:00
rcourtman	5c79d2516d	feat: streamline docker agent onboarding	2025-10-14 09:45:32 +00:00
rcourtman	f46ff1792b	Fix settings security tab navigation	2025-10-11 23:29:47 +00:00

29 commits