Project

General

Profile

Actions

tickets #153160

closed

20240105 postmortem

Added by crameleon 4 months ago. Updated 4 months ago.

Status:
Resolved
Priority:
Normal
Assignee:
-
Category:
Servers hosted in PRG
Target version:
-
Start date:
2024-01-05
Due date:
% Done:

0%

Estimated time:

Description

What/Problem: various openSUSE services were offline
When: 2024-01-05, approximately 00:01:20 to 08:45 UTC
Why:

  • critical cluster services on the Falkor KVM cluster were restarted as part of an automatic OS update
  • the services were excluded from being restarted, but the os-update package overwrote its configuration as part of a previous operating system update, due to a missing %config attribute in its spec file:
              diff:
                  - UPDATE_CMD=auto
                  + UPDATE_CMD="auto"
                  - REBOOT_CMD=auto
                  + REBOOT_CMD="rebootmgr"
                  - RESTART_SERVICES=yes
                  + RESTART_SERVICES="yes"
                  - IGNORE_SERVICES_FROM_RESTART="dbus virtlockd"
                  + IGNORE_SERVICES_FROM_RESTART="dbus virtlockd virtlogd corosync pacemaker sbd"
Actions #1

Updated by crameleon 4 months ago

  • Status changed from New to Resolved
  • Private changed from Yes to No
Actions #2

Updated by crameleon 4 months ago

  • Subject changed from 20240104 postmortem to 20240105 postmortem
Actions #3

Updated by crameleon 4 months ago

  • Description updated (diff)
Actions

Also available in: Atom PDF