Actions
action #169939
openopenQA Project (public) - coordination #157969: [epic] Upgrade all our infrastructure, e.g. o3+osd workers+webui and production workloads, to openSUSE Leap 15.6
Upgrade Power8 o3 workers to openSUSE Leap 15.6
Status:
New
Priority:
Normal
Assignee:
-
Category:
Organisational
Target version:
Start date:
2024-11-14
Due date:
% Done:
0%
Estimated time:
Tags:
Description
Motivation¶
- Need to upgrade workers before EOL of Leap 15.5 and have a consistent environment
- Upgrading power8 workers - kerosene.qe.nue2.suse.org and qa-power8-3.openqanet.opensuse.org encountered some issues, refer #157972
Acceptance criteria¶
- AC1: all power8 o3 worker machines run a clean upgraded openSUSE Leap 15.6 (no failed systemd services, no left over .rpm-new files, etc.)
Suggestions¶
- Go through the comment to understand the issue: #157972-53
- read https://progress.opensuse.org/projects/openqav3/wiki#Distribution-upgrades
- Reserve some time when the workers are only executing a few or no openQA test jobs
- Keep IPMI interface ready and test that Serial-over-LAN works for potential recovery
- Apply the workaround for #162296, i.e.
zypper al -m "boo#1227616" *firewall*
- Try using old package version of kernel and utils-linux, refer: #119008-14
- After upgrade reboot and check everything working as expected, additionally use https://github.com/os-autoinst/scripts/blob/master/reboot-stability-check, if not rollback, e.g. with
transactional-update rollback
Further details¶
- Don't worry, everything can be repaired :) If by any chance the worker gets misconfigured there are btrfs snapshots to recover, the IPMI Serial-over-LAN, a reinstall is possible and not hard, there is no important data on the host (it's only an openQA worker) and there are also other machines that can jobs while one host might be down for a little bit longer. And okurz can hold your hand :)
Actions