Project

General

Profile

Actions

action #106543

closed

coordination #102882: [epic] All OSD PPC64LE workers except malbec appear to have horribly broken cache service

Conduct rollback steps and check impact for "All OSD PPC64LE workers except malbec appear to have horribly broken cache service" size:M

Added by okurz about 2 years ago. Updated about 2 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
-
Target version:
Start date:
2022-02-10
Due date:
% Done:

0%

Estimated time:

Description

Acceptance criteria

  • AC1: No masked worker instances on all our OSD ppc workers
  • AC2: No alerts relating to ppc job queue or other failures related to ppc machines

Suggestions

  • Conduct the rollback steps described in the epic
  • Crosscheck that all ppc64 OSD worker instances are fully online and are able to work on openQA jobs
  • Monitor some openQA tests running on these instances, e.g. over https://openqa.suse.de
  • Monitor https://monitor.qa.suse.de for related failures
  • Ensure that there are no paused alerts relating to ppc that we had previously disabled
  • Optional: Crosscheck with EngInfra that all ppc machines are actually connected to rack switches and not anymore to core switches
  • Read all comments in the epic to make sure we haven't overlooked something
Actions

Also available in: Atom PDF