action #183110
opencoordination #180716: [epic] Support our software and upgrade all our infrastructure, e.g. o3+osd workers+webui and production workloads, to openSUSE Leap 16.0
Upgrade o3 container workers to openSUSE Leap 15.6 size:S
0%
Description
Motivation¶
It seems we overlooked o3 container workers during our 15.5->15.6 upgrade.
https://suse.slack.com/archives/C02CANHLANP/p1748346789850819?thread_ts=1748249338.816829&cid=C02CANHLANP
openqaworker23_container also pulls the 15.5 images from devel:openqa which are also build disabled
Acceptance criteria¶
- AC1: all o3 container worker machines referenced in https://progress.opensuse.org/projects/openqav3/wiki/#o3-s390-and-other-eg-bare-metal-workers run a clean upgraded openSUSE Leap 15.6 (no failed systemd services, no left over .rpm-new files, etc.)
Suggestions¶
- Read https://progress.opensuse.org/projects/openqav3/wiki/#o3-s390-and-other-eg-bare-metal-workers
- Read https://progress.opensuse.org/projects/openqav3/wiki#Distribution-upgrades
- Use the instructions from above
- After upgrade reboot and check everything working as expected
Updated by okurz 6 days ago
- Copied from action #183107: Upgrade o3 container workers to openSUSE Leap 16.0 added
Updated by okurz 6 days ago
- Related to action #183104: os-autoinst-distri-opensuse-deps did not update since long added
Updated by nicksinger 3 days ago
I had a brief look at this on Friday. I simply changed the URL in /etc/systemd/system/container-openqaworker23_container_101.service
from registry.opensuse.org/devel/openqa/containers15.5/openqa_worker_os_autoinst_distri_opensuse:latest
to registry.opensuse.org/devel/openqa/containers15.6/openqa_worker_os_autoinst_distri_opensuse:latest
and stopped all other 15.5 workers. The newer containers come up fine, connect to OSD and start executing jobs. x3270 changed a bit so I had to re-create the used needle in https://openqa.opensuse.org/tests/5072530#step/bootloader_s390/1
Liv helped me to schedule a bunch of test jobs (https://openqa.opensuse.org/tests/overview?version=Tumbleweed&distri=opensuse&build=20250523lemmejusttestthis
) and they all show similar problems:
- Unstable initial connection to SUT; the test is spamming "cp i cms" but gets no response or fails to parse it resulting in timeouts after several retires: https://openqa.opensuse.org/tests/5072530#step/bootloader_s390/18
- Test fails to restart SUT: https://openqa.opensuse.org/tests/5072547#step/reconnect_mgmt_console/1
2.1. Maybe this is linked to 1. as well