Pulse

vrr/Pulse

mirror of https://github.com/rcourtman/Pulse.git synced 2026-05-11 04:43:59 +00:00

Author	SHA1	Message	Date
rcourtman	397871629c	fix: cluster-aware guest deduplication and multi-agent token binding - Add cluster-aware guest ID generation (clusterName-VMID instead of instanceName-VMID) to prevent duplicate VMs/containers when multiple cluster nodes are monitored - Add cluster deduplication at registration time - when a node is added that belongs to an already-configured cluster, merge as endpoint instead of creating duplicate - Add startup consolidation to automatically merge duplicate cluster instances - Change host agent token binding from agent GUID to hostname, allowing: - Multiple host agents to share a token (each bound by hostname) - Agent reinstalls on same host without token conflicts - Remove 12-character password minimum requirement - Remove emoji from auto-registration success message - Fix grouped view node lookup to support both cluster-aware node IDs (clusterName-nodeName) and legacy guest grouping keys (instance-nodeName) Fixes duplicate guests appearing when agents are installed on multiple cluster nodes. Also improves multi-agent UX by allowing shared tokens.	2025-12-14 10:16:17 +00:00
rcourtman	1dc573e209	feat: support token query parameter for WebSocket authentication Allows WebSocket connections to authenticate via ?token= query parameter when headers cannot be sent (browser WebSocket limitations).	2025-12-13 21:28:50 +00:00
rcourtman	8919281718	fix: clear agents that connected during unauthenticated setup window When no auth is configured (fresh install), CheckAuth allows all requests. This creates a race condition where existing agents from a previous setup can report data before the wizard completes security configuration. This fix clears all host agents and docker hosts when /api/security/quick-setup is called, ensuring the wizard shows a clean state after security is configured. Added: - State.ClearAllHosts() - removes all host agents - State.ClearAllDockerHosts() - removes all docker hosts - Monitor.ClearUnauthenticatedAgents() - clears both and resets token bindings - Call to ClearUnauthenticatedAgents() in handleQuickSecuritySetupFixed()	2025-12-13 21:22:04 +00:00
rcourtman	5c4069cdbf	feat: add persistent metrics history API endpoint - Add GET /api/metrics-store/history endpoint for querying SQLite-backed metrics - Support flexible time ranges: 1h, 6h, 12h, 24h, 7d, 30d, 90d - Return aggregated data with min/max values for longer time ranges - Add TypeScript types and ChartsAPI.getMetricsHistory() client method This enables frontend charts to visualize long-term trends using the tiered retention system (raw → minute → hourly → daily averages).	2025-12-13 14:18:16 +00:00
rcourtman	a259b67348	feat: add Kubernetes platform support	2025-12-12 21:31:11 +00:00
rcourtman	8b077f69ce	feat: AI security and policy improvements for 5.0 - Add DOMPurify sanitization for AI chat markdown rendering (XSS fix) - Configure DOMPurify to add target=_blank and rel=noopener to links - Update system prompt to align with command approval policy - Clarify safe vs destructive commands in prompt - Improve patrol auto-fix mode guidance with safe operation list - Add verification requirements for auto-fix actions - Update observe-only mode to be clearer about read-only restrictions	2025-12-12 17:38:55 +00:00
rcourtman	6f0379f879	feat(api): Add AI intelligence API endpoints Expose learned AI intelligence data via REST API: New endpoints: - GET /api/ai/intelligence/patterns - Detected failure patterns - GET /api/ai/intelligence/predictions - Failure predictions - GET /api/ai/intelligence/correlations - Resource correlations - GET /api/ai/intelligence/changes - Recent infrastructure changes - GET /api/ai/intelligence/baselines - Learned baselines All endpoints support ?resource_id filter for per-resource queries. Changes endpoint supports ?hours filter (default: 24). Backend additions: - ai_intelligence_handlers.go - Handler implementations - baseline.Store.GetAllBaselines() - Flat baseline export - patrol.GetChangeDetector() - Access change detector This enables frontend to display: - 'OOM expected in 3 days based on pattern' - 'When storage-1 is full, database VM restarts' - 'VM memory baseline: 60-75%' All tests passing.	2025-12-12 14:49:46 +00:00
rcourtman	d36ad0945f	feat(settings): Add separate Auto-Fix Model setting for remediation Add configurable model specifically for automatic remediation actions: Backend (internal/config/ai.go): - Add AutoFixModel field to AIConfig - Add GetAutoFixModel() getter with fallback chain: AutoFixModel -> PatrolModel -> Model Frontend (AISettings.tsx, types/ai.ts): - Add auto_fix_model to AISettings types - Add Auto-Fix Model dropdown (only shows when patrol_auto_fix enabled) - Falls back to patrol model if not set API (ai_handlers.go): - Add auto_fix_model to response and update request - Handle saving/loading the new field Rationale: - Auto-fix takes real actions, may warrant a more capable model - Patrol observation can use cheaper models for cost savings - Gives users granular control over model costs vs reliability - Model hierarchy: Chat > AutoFix > Patrol > Default	2025-12-12 14:35:28 +00:00
rcourtman	9539ddaa6b	feat(ai): Add multi-resource correlation detection (Phase 6) Create internal/ai/correlation package: 1. Correlation Detector (detector.go): - Tracks events across resources - Detects when events on one resource follow events on another - Calculates average delay between correlated events - Confidence scoring based on occurrence count - Persists to ai_correlations.json 2. Features: - GetCorrelations() - All detected relationships - GetCorrelationsForResource() - Relationships for one resource - GetDependencies() - What resources depend on this one - GetDependsOn() - What this resource depends on - PredictCascade() - Predict what will be affected - FormatForContext() - AI-consumable summary 3. Integration: - Wire to alert history in router startup - Map alert types to correlation event types - Add correlation context to enriched AI context Example AI context now includes: 'When local-zfs experiences high usage, database often follows within 5 minutes' This enables the AI to understand infrastructure dependencies and predict cascade failures. All tests passing.	2025-12-12 14:26:10 +00:00
rcourtman	9c92bb49df	feat(ai): Wire alert history to pattern detector for event tracking Connect alert system to failure prediction: 1. Add AlertCallback to HistoryManager: - OnAlert() method to register callbacks - Callbacks invoked when alerts are added - Called outside lock to prevent deadlocks 2. Expose OnAlertHistory() on alerts.Manager: - Pass-through to HistoryManager.OnAlert() - Enables external systems to track alerts 3. Wire pattern detector in router startup: - Register callback when pattern detector is created - Convert alert types to trackable events - Pattern detector now learns from production alerts Now every alert (memory_warning, cpu_critical, etc.) is recorded as a historical event for pattern analysis. The AI can predict: 'High memory usage typically occurs every ~3 days (next expected in ~1 day)' All tests passing.	2025-12-12 14:16:03 +00:00
rcourtman	e76e86b298	feat(ai): Add failure pattern detection for predictive intelligence (Phase 5) Create internal/ai/patterns package: 1. Pattern Detector (detector.go): - Records historical events (high memory, OOM, restarts, etc.) - Detects recurring failure patterns - Calculates average interval between occurrences - Computes confidence based on pattern consistency - Predicts when failures will occur again - Persists to ai_patterns.json 2. Event types tracked: - high_memory, high_cpu, disk_full - oom, restart, unresponsive - backup_failed 3. Integration: - Wire PatternDetector into router startup - Add to AI context in buildEnrichedContext - FormatForContext generates failure predictions Example AI context now includes: 'OOM events typically occurs every ~10 days (next expected in ~3 days)' This enables proactive alerts before problems recur. All tests passing.	2025-12-12 14:11:28 +00:00
rcourtman	c63d7828a0	feat(ai): Wire operational memory into router startup Complete Phase 3 integration: - Initialize ChangeDetector and RemediationLog in StartPatrol - Add SetChangeDetector/SetRemediationLog to handler chain: Router -> AISettingsHandler -> Service -> PatrolService - Persist change history to ai_changes.json - Persist remediation log to ai_remediations.json - Both use the Pulse config directory for storage Operational memory is now fully integrated: - Change detector tracks infrastructure changes on each patrol - Recent changes (24h) are appended to AI context - Remediation log ready for command execution logging All tests passing.	2025-12-12 13:54:38 +00:00
rcourtman	21abb6ef01	Clarify AI cost estimates with pricing coverage	2025-12-12 13:19:03 +00:00
rcourtman	4aea5ed730	Unify provider/model normalization for AI cost export	2025-12-12 13:04:42 +00:00
rcourtman	c598069da3	Add AI cost export and top target rollups	2025-12-12 12:55:39 +00:00
rcourtman	39eb94e067	Backup AI usage history on reset	2025-12-12 12:14:13 +00:00
rcourtman	54a3c3c47d	Persist AI cost budget and allow history reset	2025-12-12 12:10:58 +00:00
rcourtman	8310974634	feat(ai): Wire baseline learning loop into router startup Complete Phase 2 baseline integration: - Add baseline_exports.go for clean type aliasing - Wire baseline store initialization into StartPatrol - Implement startBaselineLearning background loop - Runs initial learning after 5 min delay - Updates baselines every hour from metrics history - Learns from 7 days of data for nodes, VMs, containers - Add SetBaselineStore methods throughout the chain (Router -> AIHandler -> Service -> PatrolService) - Persists baselines to data directory as JSON The baseline learning loop: 1. Starts automatically when AI patrol starts 2. Queries metrics history for all resources 3. Computes mean, stddev, percentiles for cpu/memory/disk 4. Saves baselines to disk for durability 5. Anomaly detection uses these baselines in context builder All tests passing.	2025-12-12 11:29:47 +00:00
rcourtman	60c980a921	Show AI cost refresh errors and harden log redaction	2025-12-12 11:05:24 +00:00
rcourtman	88d419dd5b	feat(ai): Add enriched context with historical trends and predictions Phase 1 of Pulse AI differentiation: - Create internal/ai/context package with types, trends, builder, formatter - Implement linear regression for trend computation (growing/declining/stable/volatile) - Add storage capacity predictions (predicts days until 90% and 100%) - Wire MetricsHistory from monitor to patrol service - Update patrol to use buildEnrichedContext instead of basic summary - Update patrol prompt to reference trend indicators and predictions This gives the AI awareness of historical patterns, enabling it to: - Identify resources with concerning growth rates - Predict capacity exhaustion before it happens - Distinguish between stable high usage vs growing problems - Provide more actionable, time-aware insights All tests passing. Falls back to basic summary if metrics history unavailable.	2025-12-12 09:45:57 +00:00
rcourtman	fcea3c11ee	feat(ai): Replace patrol frequency dropdown with custom minutes input - Changed patrol schedule from preset dropdown to freeform number input - Users can now set any interval (min 10 minutes, max 7 days, or 0 to disable) - Added patrol_interval_minutes to API request/response (preset is now deprecated) - Backend validates: min 10 minutes when enabled, max 10080 (7 days) - Frontend shows human-readable duration next to input (e.g., '6h', '2h 30m') Also improved Auto-Fix Mode safety: - Removed '(recommended)' from preset options (was subjective) - Added 'I understand the risks' acknowledgement checkbox - Toggle is disabled until user explicitly acknowledges the risks - Shows prominent warning when Auto-Fix is enabled - Acknowledgement is session-based (must re-acknowledge on page reload)	2025-12-11 23:24:33 +00:00
rcourtman	b3f3fc95c4	feat: Add clear credentials button for each AI provider - Add clear_anthropic_key, clear_openai_key, clear_deepseek_key, clear_ollama_url flags to API - Backend handles clearing with confirmation prompt - Each provider accordion shows Test and Clear buttons when configured - Clear button requires confirmation before removing credentials - Frontend automatically refreshes settings after clearing	2025-12-11 18:24:25 +00:00
rcourtman	df2e36e5e4	feat: Add per-provider test buttons and documentation links - Add /api/ai/test/{provider} endpoint for testing individual providers - Add 'Test' button to each provider accordion (visible when configured) - Shows test result inline (success/error message) - Update help links with direct URLs to API key pages: - Anthropic: console.anthropic.com/settings/keys - OpenAI: platform.openai.com/api-keys - DeepSeek: platform.deepseek.com/api_keys - Ollama: ollama.ai	2025-12-11 18:11:31 +00:00
rcourtman	e842f523b7	feat: Implement multi-provider AI support Backend: - Add per-provider API key fields to AIConfig (AnthropicAPIKey, OpenAIAPIKey, DeepSeekAPIKey, OllamaBaseURL, OpenAIBaseURL) - Add NewForProvider() and NewForModel() factory functions for multi-provider instantiation - Update ListModels() to aggregate models from all configured providers with provider:model format - Update Execute/ExecuteStream to dynamically create provider based on selected model - Update TestConnection to use multi-provider aware provider creation - Add helper functions: HasProvider(), GetConfiguredProviders(), GetAPIKeyForProvider(), GetBaseURLForProvider(), ParseModelString(), FormatModelString() Frontend: - Remove legacy single-provider UI (provider grid, single API key input, single base URL) - Add accordion-style UI for configuring all providers independently - Add model grouping by provider in selectors using optgroup - Update AIChat model dropdown with grouped provider sections - Add helper functions for parsing provider from model ID and grouping models API: - Add multi-provider fields to AISettingsResponse and AISettingsUpdateRequest - Add /api/ai/models endpoint for dynamic model listing - Update settings handlers for per-provider credential management	2025-12-11 16:00:45 +00:00
rcourtman	40236317fb	feat(ai): Add suppression rules management API and UI Users can now: 1. View all suppression rules (both from dismissed findings and manually created) 2. Create manual rules like 'ignore performance issues on debian-go' 3. Delete rules when they want alerts to come back Backend: - Added SuppressionRule type for user-defined rules - Added suppressionRules storage to FindingsStore - Added AddSuppressionRule/GetSuppressionRules/DeleteSuppressionRule methods - Added isSuppressedInternal check for manual rules - Added API handlers and routes for /api/ai/patrol/suppressions Frontend: - Added SuppressionRule interface - Added getSuppressionRules/addSuppressionRule/deleteSuppressionRule API functions - Added getDismissedFindings for viewing dismissed findings Example usage: POST /api/ai/patrol/suppressions { 'resource_id': 'debian-go', 'category': 'performance', 'description': 'Dev container runs hot - expected' }	2025-12-11 00:12:18 +00:00
rcourtman	7350e64f3e	feat(ai): Add LLM memory system for patrol findings Implements a comprehensive feedback system that allows the LLM to 'remember' user decisions about findings, preventing repetitive/annoying alerts. Backend changes: - Extended Finding struct with dismissed_reason, user_note, times_raised, suppressed - Added Dismiss(), Suppress(), SetUserNote(), IsSuppressed() methods to FindingsStore - Added GetDismissedForContext() to format dismissed findings for LLM context - Enhanced buildPatrolPrompt() to inject user feedback context - Added POST /api/ai/patrol/dismiss and /api/ai/patrol/suppress endpoints - Updated IsActive() to exclude suppressed findings Frontend changes: - Added Dismiss dropdown with options: Not an Issue, Expected Behavior, Will Fix Later - Added Never Alert Again option for permanent suppression - Expected Behavior prompts for optional note to help LLM understand context - Added visual badges: recurrence count (×N), dismissed status, suppressed indicator - Display user notes in expanded finding view Also fixes: - Fixed 403 error on Run Patrol (compilation errors from partial refactoring) - Removed non-LLM patrol checks - patrol now uses LLM analysis only - Fixed function signature mismatches in alert_triggered.go The LLM now receives context about previously dismissed findings and is instructed not to re-raise them unless severity has significantly worsened.	2025-12-10 22:55:34 +00:00
rcourtman	1e3fdb6f63	feat(ai): Enhanced AI patrol system with alert triggers and history persistence - Add alert-triggered AI analysis for real-time incident response - Implement patrol history persistence across restarts - Add patrol schedule configuration UI in AI Settings - Enhance AIChat with patrol status and manual trigger controls - Add resource store improvements for AI context building - Expand Alerts page with AI-powered analysis integration - Add Vite proxy config for AI API endpoints - Support both Anthropic and OpenAI providers with streaming	2025-12-10 21:08:22 +00:00
rcourtman	ae7b66ecff	refactor(ai): Remove over-engineered URL discovery service Keep only the simple AI-powered approach: - set_resource_url tool lets AI save discovered URLs - Users ask AI directly: 'Find URLs for my containers' - AI uses its intelligence to discover and set URLs Removed: - URLDiscoveryService (rigid port scanning) - Bulk discovery API endpoints - Frontend discovery button The AI itself is smart enough to iterate through resources and discover URLs when asked.	2025-12-10 08:35:24 +00:00
rcourtman	5a8791d97d	feat(ai): Add bulk URL discovery service - Add URLDiscoveryService for scanning all resources at once - Scans common web ports (80, 443, 8080, 8096, 3000, etc.) - Automatically saves discovered URLs to resource metadata - Add API endpoints for start/status/cancel discovery - Progress tracking with results reporting Endpoints: - POST /api/ai/discover-urls/start - Start bulk discovery - GET /api/ai/discover-urls/status - Check progress - POST /api/ai/discover-urls/cancel - Cancel discovery	2025-12-10 08:30:56 +00:00
rcourtman	387ae309cc	feat(ai): Add URL discovery tool - AI can find and set resource URLs - Add MetadataProvider interface for AI to update resource URLs - Add set_resource_url tool to AI service - Wire up metadata stores to AI service via router - Add URL discovery guidance to AI system prompt - AI can now inspect guests/containers/hosts for web services and automatically save discovered URLs to Pulse metadata Usage: Ask the AI 'Find the web URL for this container' and it will: 1. Check for listening ports and web servers 2. Get the IP address 3. Verify the URL works 4. Save it to Pulse for quick dashboard access	2025-12-10 00:29:07 +00:00
rcourtman	c8adbb7ae5	Add AI monitoring enhancements and host metadata features - Add host metadata API for custom URL editing on hosts page - Enhance AI routing with unified resource provider lookup - Add encryption key watcher script for debugging key issues - Improve AI service with better command timeout handling - Update dev environment workflow with key monitoring docs - Fix resource store deduplication logic	2025-12-09 16:27:46 +00:00
rcourtman	927ac76bad	feat: AI integration, Docker metrics, RAID display, and infrastructure improvements - Add Claude OAuth authentication support with hybrid API key/OAuth flow - Implement Docker container historical metrics in backend and charts API - Add CEPH cluster data collection and new Ceph page - Enhance RAID status display with detailed tooltips and visual indicators - Fix host deduplication logic with Docker bridge IP filtering - Fix NVMe temperature collection in host agent - Add comprehensive test coverage for new features - Improve frontend sparklines and metrics history handling - Fix navigation issues and frontend reload loops	2025-12-09 09:29:27 +00:00
rcourtman	e1e83c8295	fix: complete unified resources WebSocket integration Backend: - Call SetMonitor after router creation to inject resource store - Add debug logging for resource population and broadcast Frontend: - Add resources array to WebSocket store initial state - Handle resources in WebSocket message processing - Use reconcile for efficient state updates The unified resources are now properly: 1. Populated from StateSnapshot on each broadcast cycle 2. Converted to frontend format (ResourceFrontend) 3. Included in WebSocket state messages 4. Received and stored in frontend state 5. Consumed by migrated route components Console now shows '[DashboardView] Using unified resources: VMs: X' confirming the migration is working end-to-end.	2025-12-07 23:52:00 +00:00
rcourtman	f7735cca62	fix: Populate resources on-demand when /api/resources is called The Resources page was showing 0 resources because the store was only populated when /api/state was called (from the dashboard). Now the resources are populated on-demand when /api/resources is accessed. Changes: - Added StateProvider interface to ResourceHandlers - SetStateProvider() method for injecting the monitor - HandleGetResources now calls PopulateFromSnapshot before querying - Router injects monitor as state provider during SetMonitor() This ensures the /resources page works even when accessed directly without visiting the main dashboard first.	2025-12-07 14:47:29 +00:00
rcourtman	f34c28f970	feat: Complete Unified Resource Architecture (Phases 1-3) This commit implements the Unified Resource Architecture for AI-first infrastructure management. Key features: Phase 1 - Backend Unification: - New unified Resource type with 9 resource types, 7 platforms, 7 statuses - Resource store with identity-based deduplication (hostname, machineID, IP) - 8 converter functions (FromNode, FromVM, FromContainer, etc.) - REST API endpoints: /api/resources, /api/resources/stats, /api/resources/{id} - 28 comprehensive unit tests Phase 2 - AI Context Enhancement: - Unified context builder for AI system prompts - Cross-platform query methods: GetTopByCPU, GetTopByMemory, GetTopByDisk - Resource correlation: GetRelated (parent, children, siblings, cluster) - Infrastructure summary: GetResourceSummary with health status counts - AI context now includes top consumers and infrastructure overview Phase 3 - Agent Preference & Hybrid Mode: - Polling optimization methods in resource store - ResourceStoreInterface added to Monitor - SetResourceStore() and shouldSkipNodeMetrics() helper methods - Store automatically wired into Monitor via Router.SetMonitor() - Foundation ready for reduced API polling when agents are active Files added: - internal/resources/resource.go - Core Resource type - internal/resources/store.go - Store with deduplication - internal/resources/converters.go - Type converters - internal/resources/platform_data.go - Platform-specific data - internal/resources/store_test.go - 28 tests - internal/resources/converters_test.go - Converter tests - internal/api/resource_handlers.go - REST API handlers - internal/ai/resource_context.go - AI context builder - .gemini/docs/unified-resource-architecture.md - Architecture docs All tests pass.	2025-12-07 13:49:00 +00:00
rcourtman	28ea85a0a0	Additional updates	2025-12-07 10:22:42 +00:00
rcourtman	bcd7b550d4	AI Problem Solver implementation and various fixes - Implement 'Show Problems Only' toggle combining degraded status, high CPU/memory alerts, and needs backup filters - Add 'Investigate with AI' button to filter bar for problematic guests - Fix dashboard column sizing inconsistencies between bars and sparklines view modes - Fix PBS backups display and polling - Refine AI prompt for general-purpose usage - Fix frontend flickering and reload loops during initial load - Integrate persistent SQLite metrics store with Monitor - Fortify AI command routing with improved validation and logging - Fix CSRF token handling for note deletion - Debug and fix AI command execution issues - Various AI reliability improvements and command safety enhancements	2025-12-06 23:46:08 +00:00
rcourtman	8948e84fe5	feat: AI features, agent improvements, and host monitoring enhancements AI Chat Integration: - Multi-provider support (Anthropic, OpenAI, Ollama) - Streaming responses with markdown rendering - Agent command execution for remote troubleshooting - Context-aware conversations with host/container metadata Agent Updates: - Add --enable-proxmox flag for automatic PVE/PBS token setup - Improve auto-update with semver comparison (prevents downgrades) - Add updatedFrom tracking to report previous version after update - Reduce initial update check delay from 30s to 5s - Add agent version column to Hosts page table Host Metrics: - Add DiskIO stats collection (read/write bytes, ops, time) - Improve disk filtering to exclude Docker overlay mounts - Add RAID array monitoring via mdadm - Enhanced temperature sensor parsing Frontend: - New Agent Version column on Hosts overview table - Improved node modal with agent-first installation flow - Add DiskIO display in host drawer - Better responsive handling for metric bars	2025-12-05 10:37:02 +00:00
rcourtman	53d7776d6b	wip: AI chat integration with multi-provider support - Add AI service with Anthropic, OpenAI, and Ollama providers - Add AI chat UI component with streaming responses - Add AI settings page for configuration - Add agent exec framework for command execution - Add API endpoints for AI chat and configuration	2025-12-04 20:16:53 +00:00
rcourtman	d0d989289a	Refactor alert system: fix race conditions, memory leaks, and improve code quality - Rename checkFlapping to checkFlappingLocked to clarify lock contract - Replace goto statements with structured control flow - Wire up unused recordAlertFired/recordAlertResolved metric hooks - Add trackingMapCleanup goroutine to prevent memory leaks from stale entries - Tighten alert ID validation to alphanumeric + safe punctuation - Fix history save error handling to properly manage backup lifecycle - Add auto-migration for deprecated GroupingWindow field - Refactor 300+ line UpdateConfig into focused helper functions - Unify duplicate evaluateVMCondition/evaluateContainerCondition - Add constants for magic numbers (thresholds, timing, flapping) - Update tests to match new backup behavior	2025-12-02 23:31:36 +00:00
rcourtman	bda8056e48	Add refresh-cluster button to detect new Proxmox cluster members When new nodes are added to a Proxmox cluster after Pulse was initially configured, they weren't showing up in Settings. The existing "Refresh" button only triggered network discovery, not cluster membership re-detection. Changes: - Add POST /api/config/nodes/{id}/refresh-cluster endpoint - Add "Refresh" button in cluster node panel in Settings - Re-detect cluster membership and update stored endpoints Related to #799	2025-12-02 22:01:00 +00:00
rcourtman	4f824ab148	style: Apply gofmt to 37 files Standardize code formatting across test files and monitor.go. No functional changes.	2025-12-02 17:21:48 +00:00
rcourtman	cf26ed7f12	security: Add request body size limits to remaining API handlers Add http.MaxBytesReader to 8 additional handlers to complete API hardening against memory exhaustion attacks: - guest_metadata.go: HandleUpdateMetadata (16KB) - notification_queue.go: RetryDLQItem, DeleteDLQItem (8KB each) - temperature_proxy.go: HandleRegister (8KB) - host_agents.go: HandleReport (256KB) - updates.go: HandleApplyUpdate (8KB) - docker_metadata.go: HandleUpdateMetadata (16KB) - system_settings.go: UpdateSystemSettings (64KB) All API handlers that decode JSON request bodies now have size limits.	2025-12-02 16:47:13 +00:00
rcourtman	b4d497ce3b	security: Add request body size limits to API handlers Add http.MaxBytesReader to 16 additional handlers to prevent memory exhaustion attacks via oversized request bodies: - docker_agents.go: HandleReport (512KB), HandleCommandAck (8KB), HandleSetCustomDisplayName (8KB) - alerts.go: UpdateAlertConfig (64KB), BulkAcknowledgeAlerts (32KB), BulkClearAlerts (32KB) - config_handlers.go: HandleAddNode, HandleTestConnection, HandleUpdateNode, HandleTestNodeConfig (32KB each), HandleVerifyTemperatureSSH, HandleExportConfig, HandleDiscoverServers, HandleSetupScriptURL (8KB each), HandleImportConfig (1MB), HandleUpdateMockMode (16KB)	2025-12-02 16:43:13 +00:00
rcourtman	6eb7f06df1	security: Add request body size limits to notification handlers Add http.MaxBytesReader limits to prevent memory exhaustion attacks: - UpdateEmailConfig: 32KB limit - UpdateAppriseConfig: 64KB limit - CreateWebhook: 64KB limit - UpdateWebhook: 64KB limit This follows the pattern already used in system_settings.go for SSH config validation.	2025-12-02 16:37:30 +00:00
rcourtman	c05817f9de	docs: Add godoc comments to exported functions Add missing godoc comments to: - NewRateLimiter and Allow in ratelimit.go - SnapshotSyncStatus in temperature_proxy.go - NewClient and GetVersion in pkg/pmg/client.go	2025-12-02 15:58:59 +00:00
rcourtman	097976321b	perf: Cache hostname lowercase in temperature proxy lookups Pre-compute strings.ToLower(hostname) before loops that search for matching PVE instances. Avoids repeated lowercasing in two functions.	2025-12-02 15:43:41 +00:00
rcourtman	98d170e087	perf: Cache err.Error() in cluster node validation Cache err.Error() result in two locations in config_handlers.go: - TLS mismatch detection (3x calls to 1) - Standalone node detection (2x calls to 1)	2025-12-02 15:41:18 +00:00
rcourtman	158669296e	refactor: Remove unreachable dead code branches - firstForwardedValue: strings.Split always returns at least one element - shouldRunBackupPoll: remaining is always >= 1 by math - convertContainerDiskInfo: lowerLabel is never empty for non-rootfs All three functions now at 100% coverage.	2025-12-02 14:41:53 +00:00
rcourtman	69bcd6ab0f	test: Add SessionStore.load legacy format tests for API package	2025-12-02 14:12:32 +00:00

... 11 12 13 14 15 ...

979 commits