action #180098
closedMulti-Machine tests failing on network access
0%
Description
Observation¶
This is being observed for the last 2 days - all MM machines seem to fail to access network resources
openQA test in scenario microos-Tumbleweed-DVD-x86_64-remote_ssh_target@64bit-2G fails in
networking
Test suite description¶
Maintainer: jrivera Boot with ssh=1 parameter and wait for parallel job (remote_ssh_controller) to install the system.
Reproducible¶
Fails since (at least) Build 20250404
Expected result¶
Last good: 20250403 (or more recent)
Further details¶
Always latest result in this scenario: latest
Files
Updated by livdywan 10 days ago
Hum. I was just taking a look, the error seems clear:
curl: (6) Could not resolve host: openqa.opensuse.org
However just now there's a passing test: https://openqa.opensuse.org/tests/4976935
Updated by okurz 10 days ago
Maybe related to https://github.com/os-autoinst/openQA/pull/6366 ?
Updated by okurz 10 days ago
- Tags set to reactive work
- Project changed from openQA Tests (public) to openQA Project (public)
- Category changed from Bugs in existing tests to Regressions/Crashes
- Priority changed from Immediate to Urgent
- Target version set to Ready
According to rfan1 and dzedro in https://suse.slack.com/archives/C02CANHLANP/p1744022074232239?thread_ts=1744022074.232239&cid=C02CANHLANP no relevant changes from core or osado side so likely something regressed in the o3 infra. Please also see changes applied by @favogt as mitigation.
Updated by mkittler 9 days ago ยท Edited
Note that we enable ip forwarding in os-autoinst/script/os-autoinst-setup-multi-machine
. The setting is nevertheless only present on a few hosts. So I re-executed this line on all hosts mentioned on https://progress.opensuse.org/projects/openqav3/wiki/#Manual-command-execution-on-o3-workers - just to be sure. So we should now have forwarding enabled on all hosts in a persistent way. Considering our setup script already does this I don't see anything to improve upstream. I have no idea how this was working before.
Note that I haven't seen the "one host only" setting on o3 workers (except on the power pc worker). I do remember configuring this. Either I misremember or this configuration was lost. The latter would explain how this was working before but it is probably the former.