Pulse

vrr/Pulse

mirror of https://github.com/rcourtman/Pulse.git synced 2026-04-29 03:50:18 +00:00

Author	SHA1	Message	Date
rcourtman	d554c9dbb2	fix(sensor-proxy): eliminate all uncoordinated config writers Remove all code paths that manipulate config files without Phase 2 locking: 1. Installer: Remove ensure_allowed_nodes_file_reference() call (line 1674) - Migration now handled exclusively by config migrate-to-file 2. Installer: Make migration failures fatal in update_allowed_nodes() - Prevents fallback to unsafe Python manipulation 3. Daemon sanitizer: Remove os.WriteFile() call - Now only sanitizes in-memory copy, doesn't write back to disk - Logs warning instructing admin to run `config migrate-to-file` 4. Self-heal script: Replace 132 lines of Python with CLI call - sanitize_allowed_nodes() now calls `config migrate-to-file` - Eliminates uncoordinated Python-based config rewriting All config mutations now flow exclusively through Phase 2 CLI with atomic operations and file locking. No code paths remain that can create duplicate allowed_nodes blocks. Addresses Codex review feedback on Phase 2 gaps.	2025-11-19 10:55:01 +00:00
rcourtman	0565781655	feat(sensor-proxy): Phase 2 - atomic config management with CLI Implements bullet-proof configuration management to completely eliminate allowed_nodes corruption by design. This builds on Phase 1 (file-only mode) by replacing all shell/Python config manipulation with proper Go tooling. New Features: - `pulse-sensor-proxy config validate` - parse and validate config files - `pulse-sensor-proxy config set-allowed-nodes` - atomic node list updates - File locking via flock prevents concurrent write races - Atomic writes (temp file + rename) ensure consistency - systemd ExecStartPre validation prevents startup with bad config Architectural Changes: 1. Installer now calls config CLI instead of embedded Python/shell scripts 2. All config mutations go through single authoritative writer 3. Deduplication and normalization handled in Go (reuses existing logic) 4. Sanitizer kept as noisy failsafe (warns if corruption still occurs) Implementation Details: - New cmd/pulse-sensor-proxy/config_cmd.go with cobra commands - withLockedFile() wrapper ensures exclusive access - atomicWriteFile() uses temp + rename pattern - Installer update_allowed_nodes() simplified to CLI calls - Both systemd service modes include ExecStartPre validation Why This Works: - Single code path for all writes (no shell/Python divergence) - File locking serializes self-heal timer + manual installer runs - Validation gate prevents proxy from starting with corrupt config - CLI uses same YAML parser as the daemon (guaranteed compatibility) Phase 2 Benefits: - Corruption impossible by design (not just detected and fixed) - No more Python dependency for config management - Atomic operations prevent partial writes - Clear error messages on validation failures The defensive sanitizer remains active but now logs loudly if triggered, allowing us to confirm Phase 2 eliminates corruption in production before removing the safety net entirely. This completes the fix for the recurring temperature monitoring outages. Related to Phase 1 commit `53dec6010`	2025-11-19 09:37:49 +00:00
rcourtman	509e87ca35	Sanitize duplicate allowed_nodes blocks	2025-11-18 19:33:26 +00:00
rcourtman	eca1f272ca	Move allowed_nodes to managed file	2025-11-16 10:06:58 +00:00
rcourtman	47d5c14aef	Improve temperature proxy control-plane flow	2025-11-15 21:49:51 +00:00
rcourtman	2ee693cc63	Add HTTP mode to pulse-sensor-proxy for multi-instance temperature monitoring This implements HTTP/HTTPS support for pulse-sensor-proxy to enable temperature monitoring across multiple separate Proxmox instances. Architecture changes: - Dual-mode operation: Unix socket (local) + HTTPS (remote) - Unix socket remains default for security/performance (no breaking change) - HTTP mode enables temps from external PVE hosts Backend implementation: - Add HTTPS server with TLS + Bearer token authentication to sensor-proxy - Add TemperatureProxyURL and TemperatureProxyToken fields to PVEInstance - Add HTTP client (internal/tempproxy/http_client.go) for remote proxy calls - Update temperature collector to prefer HTTP proxy when configured - Fallback logic: HTTP proxy → Unix socket → direct SSH (if not containerized) Configuration: - pulse-sensor-proxy config: http_enabled, http_listen_addr, http_tls_cert/key, http_auth_token - PVEInstance config: temperature_proxy_url, temperature_proxy_token - Environment variables: PULSE_SENSOR_PROXY_HTTP_* for all HTTP settings Security: - TLS 1.2+ with modern cipher suites - Constant-time token comparison (timing attack prevention) - Rate limiting applied to HTTP requests (shared with socket mode) - Audit logging for all HTTP requests Next steps: - Update installer script to support HTTP mode + auto-registration - Add Pulse API endpoint for proxy registration - Generate TLS certificates during installation - Test multi-instance temperature collection Related to #571 (multi-instance architecture)	2025-11-13 16:13:53 +00:00
rcourtman	7062b07411	feat(security): Add node allowlist validation to prevent SSRF attacks Implements comprehensive node validation system to prevent SSRF attacks via the temperature proxy. Addresses critical vulnerability where proxy would SSH to any hostname/IP passing format validation. Features: - Configurable allowed_nodes list (hostnames, IPs, CIDR ranges) - Automatic Proxmox cluster membership validation - 5-minute cluster membership cache to reduce pvecm overhead - strict_node_validation option for strict vs permissive modes - New metric: pulse_proxy_node_validation_failures_total{node,reason} - Logs blocked attempts at WARN level with 'potential SSRF attempt' Configuration: - allowed_nodes: [] (empty = auto-discover from cluster) - strict_node_validation: true (require cluster membership) Default behavior: Empty allowlist + Proxmox host = validate cluster members (secure by default, backwards compatible). Related to security audit 2025-11-07. Co-authored-by: Codex <codex@openai.com>	2025-11-07 17:08:28 +00:00
rcourtman	930ad20921	Add configurable log level for pulse-sensor-proxy Users can now control logging verbosity through: - YAML config file: log_level: "debug\|info\|warn\|error" - Environment variable: PULSE_SENSOR_PROXY_LOG_LEVEL Default log level is set to "info" instead of debug, reducing verbose output. Supported levels: trace, debug, info, warn, error, fatal, panic, disabled Related to #629	2025-11-05 19:48:00 +00:00
rcourtman	44d5f91e92	feat: make pulse-sensor-proxy rate limits configurable Add support for configuring rate limits via config.yaml to allow administrators to tune the proxy for different deployment sizes. Changes: - Add RateLimitConfig struct to config.go with per_peer_interval_ms and per_peer_burst - Update newRateLimiter() to accept optional RateLimitConfig parameter - Load rate limit config from YAML and apply overrides to defaults - Update tests to pass nil for default behavior - Add comprehensive config.example.yaml with documentation Configuration examples: - Small (1-3 nodes): 1000ms interval, burst 5 (default) - Medium (4-10 nodes): 500ms interval, burst 10 - Large (10+ nodes): 250ms interval, burst 20 Defaults remain conservative (1 req/sec, burst 5) to support most deployments while allowing customization for larger environments. Related: #`46b8b8d08` (rate limit fix for multi-node support)	2025-10-21 11:25:21 +00:00
rcourtman	e4c3b06f14	Automate sensor proxy container mount and auth	2025-10-14 12:41:48 +00:00
rcourtman	b952444837	refactor: Rename pulse-temp-proxy to pulse-sensor-proxy The name "temp-proxy" implied a temporary or incomplete implementation. The new name better reflects its purpose as a secure sensor data bridge for containerized Pulse deployments. Changes: - Renamed cmd/pulse-temp-proxy/ to cmd/pulse-sensor-proxy/ - Updated all path constants and binary references - Renamed environment variables: PULSE_TEMP_PROXY_* to PULSE_SENSOR_PROXY_* - Updated systemd service and service account name - Updated installation, rotation, and build scripts - Renamed hardening documentation - Maintained backward compatibility for key removal during upgrades	2025-10-13 13:17:05 +00:00

11 commits