Fix showing errors#294
Conversation
Audit of PR #294The approach is sound — comparing error vs running timestamps per slot is the right way to detect failed rolling updates where old replicas mask the failure. A few findings: 1. Duplicated function
2. No time cutoffThe existing error-detection code uses a 5-minute cutoff ( 3. Empty
|
Extracts taskKeyForService, latestTasksByServiceKey, and activeDeploymentErrorsByService into views/taskutil so both services and stacks views share one implementation. Fixes: - Add 5-minute cutoff to activeDeploymentErrorsByService to avoid surfacing stale historical errors - Handle empty errMsg on failed tasks (fallback to "task <state>") - Pick most recent error when multiple slots fail for same service - Add unit tests for all three functions Co-authored-by: eldara-cruncher <[email protected]> Co-authored-by: Claude Opus 4.6 (1M context) <[email protected]>
Fixes #292