Pulse/docs/operations
rcourtman e0396c1362 docs: update documentation for diagnostics improvements
Add comprehensive operator documentation for the new observability features
introduced in the previous commit.

**New Documentation:**
- docs/monitoring/PROMETHEUS_METRICS.md - Complete reference for all 18 new
  Prometheus metrics with alert suggestions

**Updated Documentation:**
- docs/API.md - Document X-Request-ID and X-Diagnostics-Cached-At headers,
  explain diagnostics endpoint caching behavior
- docs/TROUBLESHOOTING.md - Add section on correlating API calls with logs
  using request IDs
- docs/operations/ADAPTIVE_POLLING_ROLLOUT.md - Update monitoring checklists
  with new per-node and scheduler metrics
- docs/CONFIGURATION.md - Clarify LOG_FILE dual-output behavior and rotation
  defaults

These updates ensure operators understand:
- How to set up monitoring/alerting for new metrics
- How to configure file logging with rotation
- How to troubleshoot using request correlation
- What metrics are available for dashboards

Related to: 495e6c794 (feat: comprehensive diagnostics improvements)
2025-10-21 12:45:19 +00:00
..
ADAPTIVE_POLLING_MANAGEMENT_ENDPOINTS.md docs: comprehensive v4.24.0 documentation audit and updates 2025-10-20 17:20:13 +00:00
ADAPTIVE_POLLING_ROLLOUT.md docs: update documentation for diagnostics improvements 2025-10-21 12:45:19 +00:00
audit-log-rotation.md docs: comprehensive v4.24.0 documentation audit and updates 2025-10-20 17:20:13 +00:00
pulse-sensor-proxy-runbook.md docs: comprehensive documentation for rate limit fix and configurability 2025-10-21 11:36:07 +00:00