Pulse

vrr/Pulse

mirror of https://github.com/rcourtman/Pulse.git synced 2026-04-28 11:30:15 +00:00

Author	SHA1	Message	Date
rcourtman	3fce14469c	chore: remove legacy proxy handlers and unused functions Remove legacy V1 handlers replaced by V2 versions: - sendError (replaced by sendErrorV2) - handleGetStatus (replaced by handleGetStatusV2) - handleEnsureClusterKeys (replaced by handleEnsureClusterKeysV2) - handleRegisterNodes (replaced by handleRegisterNodesV2) - handleGetTemperature (replaced by handleGetTemperatureV2) Also remove related unused functions: - getPublicKey wrapper (only getPublicKeyFrom is used) - pushSSHKey wrapper (only pushSSHKeyFrom is used) - nodeValidator.ipAllowed method (standalone ipAllowed is used) - validateConfigFile (never called) - runServiceDebug (Windows debug mode, never called)	2025-11-27 08:41:28 +00:00
courtmanr@gmail.com	37b1517bd8	feat: implement atomic config management in sensor proxy	2025-11-20 19:01:24 +00:00
rcourtman	4419d8be87	fix(sensor-proxy): sanitize duplicate blocks before migration The migrate-to-file command now calls sanitizeDuplicateAllowedNodesBlocks before parsing the config, allowing it to handle corrupted configs with duplicate allowed_nodes blocks. This ensures migration works even on hosts that were affected by the original corruption issue.	2025-11-19 10:38:04 +00:00
rcourtman	28cd487889	feat(sensor-proxy): complete Phase 2 with CLI-based config migration Add `config migrate-to-file` command and update installer to eliminate all shell/Python config manipulation, ensuring atomic operations throughout. Changes: - Add `config migrate-to-file` command to atomically migrate inline allowed_nodes blocks to file-based configuration - Update installer's update_allowed_nodes() to call CLI exclusively - Simplify migrate_inline_allowed_nodes_to_file() to use CLI - Remove dependency on Python/sed for config manipulation - Implement dual-file locking (config.yaml + allowed_nodes.yaml) to prevent race conditions during migration All config mutations now flow through the Phase 2 CLI with: - File locking (flock) - Atomic writes (temp + rename + fsync) - Proper YAML parsing/generation This completes Phase 2 architecture and eliminates the root cause of config corruption issues. Related to prior commits: `53dec6010`, `3dc073a28`, `804a638ea`, `131666bc1`	2025-11-19 10:35:49 +00:00
rcourtman	d99a855ee7	fix(sensor-proxy): lock file permissions and deadlock prevention Final security hardening based on second Codex review: Lock File Permission Fix (Security) - Lock file now created with 0600 instead of 0644 - Prevents unprivileged users from opening lock and holding LOCK_EX - Without this, any local user could DoS the installer/self-heal - Added f.Chmod(0600) to fix permissions on existing lock files Deadlock Prevention (Future-Proofing) - Added documentation for future multi-file locking scenarios - Specifies consistent lock ordering requirement (config.yaml.lock before allowed_nodes.yaml.lock) - Prevents potential deadlocks if future commands modify multiple files - Current implementation only locks one file, so no immediate issue Testing: ✅ Lock file created as `-rw-------` (0600) ✅ Existing lock files with wrong perms get fixed ✅ Unprivileged users can no longer DoS the lock Codex Validation: - Locking is now correct (persistent .lock file, held during entire operation) - Atomic writes complete while lock is held - Validation honors actual config paths - Empty lists supported for operational flexibility - Error propagation prevents silent failures - No remaining race conditions or security issues Phase 2 is now complete and Codex-verified as secure. Related to Phase 2 fixes commit `804a638ea`	2025-11-19 09:51:20 +00:00
rcourtman	1162a208cc	fix(sensor-proxy): critical Phase 2 locking and validation fixes Fixes critical issues found by Codex code review: 1. Fixed file locking race condition (CRITICAL) - Lock file was being replaced by atomic rename, invalidating the lock - New approach: lock a separate `.lock` file that persists across renames - Ensures concurrent writers (installer + self-heal timer) are properly serialized - Without this fix, corruption was still possible despite Phase 2 2. Fixed validation to honor configured allowed_nodes_file path - validate command now uses loadConfig() to read actual config - Respects allowed_nodes_file setting instead of assuming default path - Prevents false positives/negatives when path is customized 3. Allow empty allowed_nodes lists - Empty lists are valid (admin may clear for security, or rely on IPC validation) - validate no longer fails on empty lists - set-allowed-nodes --replace with zero nodes now supported - Critical for operational flexibility 4. Installer error propagation - update_allowed_nodes failures now exit installer with error - Prevents silent failures that leave stale allowlists - Self-heal will abort instead of masking CLI errors Technical Details: - withLockedFile() now locks `<path>.lock` instead of target file - Lock held for entire duration of read-modify-write-rename - atomicWriteFile() completes while lock is still held - Empty lists represented as `allowed_nodes: []` in YAML Testing: ✅ Lock file created and persists across operations ✅ Empty list can be written with --replace ✅ Validation passes with empty lists ✅ Config path from allowed_nodes_file honored ✅ Concurrent operations properly serialized These fixes ensure Phase 2 actually eliminates corruption by design. Identified by Codex code review Related to Phase 2 commit `3dc073a28`	2025-11-19 09:47:43 +00:00
rcourtman	0565781655	feat(sensor-proxy): Phase 2 - atomic config management with CLI Implements bullet-proof configuration management to completely eliminate allowed_nodes corruption by design. This builds on Phase 1 (file-only mode) by replacing all shell/Python config manipulation with proper Go tooling. New Features: - `pulse-sensor-proxy config validate` - parse and validate config files - `pulse-sensor-proxy config set-allowed-nodes` - atomic node list updates - File locking via flock prevents concurrent write races - Atomic writes (temp file + rename) ensure consistency - systemd ExecStartPre validation prevents startup with bad config Architectural Changes: 1. Installer now calls config CLI instead of embedded Python/shell scripts 2. All config mutations go through single authoritative writer 3. Deduplication and normalization handled in Go (reuses existing logic) 4. Sanitizer kept as noisy failsafe (warns if corruption still occurs) Implementation Details: - New cmd/pulse-sensor-proxy/config_cmd.go with cobra commands - withLockedFile() wrapper ensures exclusive access - atomicWriteFile() uses temp + rename pattern - Installer update_allowed_nodes() simplified to CLI calls - Both systemd service modes include ExecStartPre validation Why This Works: - Single code path for all writes (no shell/Python divergence) - File locking serializes self-heal timer + manual installer runs - Validation gate prevents proxy from starting with corrupt config - CLI uses same YAML parser as the daemon (guaranteed compatibility) Phase 2 Benefits: - Corruption impossible by design (not just detected and fixed) - No more Python dependency for config management - Atomic operations prevent partial writes - Clear error messages on validation failures The defensive sanitizer remains active but now logs loudly if triggered, allowing us to confirm Phase 2 eliminates corruption in production before removing the safety net entirely. This completes the fix for the recurring temperature monitoring outages. Related to Phase 1 commit `53dec6010`	2025-11-19 09:37:49 +00:00

7 commits