Project

General

Profile

Actions

action #160098

closed

openQA Project - coordination #157969: [epic] Upgrade all our infrastructure, e.g. o3+osd workers+webui and production workloads, to openSUSE Leap 15.6

After the upgrade to Leap 15.6 osiris showed no proper mount points again for libvirt VMs size:S

Added by okurz 2 months ago. Updated about 2 months ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
Regressions/Crashes
Target version:
Start date:
2024-05-08
Due date:
% Done:

0%

Estimated time:
Tags:

Description

Observation

Similar as in #125087 now after I upgraded osiris to Leap 15.6 virt-manager only showed a single VM called "first-test-vm", no other machines like "okurz". I manually recovered by logging in over ssh and called

systemctl stop libvirtd
drbdadm up r0
systemctl restart etc-libvirt.mount
systemctl start libvirtd

but we should ensure this does not happen anymore and also we should not even try to start libvirtd if those dependencies are not fulfilled

Acceptance criteria

  • AC1: osiris shows expected production VMs consistently after multiple reboots

Suggestions

  • Look into what happened and what we did in the past in related tickets
  • Check if this is reproducible on reboots
  • Take a look into logs of drbd to see what the problem was (storage shared with seth)
  • Maybe restarts of systemd services can be enough

Rollback actions

Remove silence from https://stats.openqa-monitor.qa.suse.de/alerting/silences alertname=Failed systemd services alert (not openqa)


Related issues 2 (0 open2 closed)

Has duplicate openQA Infrastructure - action #160493: Failed systemd services alert (osiris-1 drbd)Rejectedokurz2024-05-17

Actions
Copied from openQA Project - action #157996: Upgrade all other LSG QE salt controlled machines to openSUSE Leap 15.6Resolvedokurz

Actions
Actions

Also available in: Atom PDF