action #105804
closed
- Related to coordination #102882: [epic] All OSD PPC64LE workers except malbec appear to have horribly broken cache service added
- Priority changed from High to Urgent
That ppc jobs are piling up is no surprise due to #102882 . I suggest to pause the alert and track this ticket as blocked as long as #102882 is not solved.
- Status changed from New to Blocked
- Assignee set to mkittler
- Priority changed from Urgent to High
Since I've already checked the performance of the power workers today I'll track this as blocked. I'm also lowering the prio because we have this issue now for quite a while and the figures from the performance test don't suggest it has gotten worse. Considering the alert turned off again after 2 hours I assume the impact is still not that high.
Although this ticket is assigned to mkittler, as okurz asked me directly to pause the alerts, I did that now and set both Job age (scheduled) (max)
and Job age (scheduled) (median)
to paused.
Please remember when resolving, I will also try to.
- Description updated (diff)
- Subject changed from Job age (scheduled) (median) alert to Job age (scheduled) (median) alert size:S
- Description updated (diff)
- Status changed from Blocked to Resolved
It looks good so far so I'm resolving the ticket.
- Related to action #135008: Max job age graphs use mean aggregation when max would make more sense added
Also available in: Atom
PDF