[qe-core][qem] Problems with aarch64 RAID 15SP1/SP2 QU tests - **Suggested Backport**
Passed for 15SP1 16 days ago:
Failed in a later build:
(same build, aarch64 specific for playground)
But associated with change last Thursday like:
that led to setup_libyui error:
and with manual schedule omitting the setup_libyui to raid_gpt error:
Then Rodion mentioned YAML schedule should not be used and moved back to non-YAML (even though the passing tests 16 days earlier were using the YAML schedule):
But that lead to different errors, which in turn were partially fixed by:
Before now George started looking at, at least RAID 0, 5 and 10 were proven to have passed at least once for 15SP1 Build 47.3, while 1 and 6 remained problemtic.
For 15SP2, everything passed 5 days ago at https://openqa.suse.de/tests/overview?distri=sle&version=15-SP2&build=375.8&groupid=321 but similarly failures with latest build (rerun can get it further though): https://openqa.suse.de/tests/overview?distri=sle&version=15-SP2&build=376.1&groupid=321
SP2 is still using YAML.
- Subject changed from [qe-core][qem] Problems with aarch64 RAID 15SP1/SP2 QU tests to [qe-core][qem] Problems with aarch64 RAID 15SP1/SP2 QU tests - **Suggested Backport**
- Status changed from In Progress to Resolved
- % Done changed from 0 to 100
Some reneedling was required; The aarch64 RAID jobs of 15SP1 and 15SP2 QUs should be passing consistently now.
Concerning the irregular
reconnect_mgmt_console failure, this issue is actually caused in
The reboot message at the end of the installation has a default timeout of 10 seconds.
In some archs like aarch64 and s390, it happens that await_install module's needle check is not catching up with the 10 second timeout, and reboot is not cancelled.
This results in the machine rebooting when it should not, and failing in the next module,
In order to fix this issue, PR_1 and PR2 were introduced.
This allows for the following usage, as seen in
push @params, 'reboot_timeout=' . get_var('REBOOT_TIMEOUT', 0) unless (is_leap('<15.2') || is_sle('<15-SP2'));
The above line, by default, pushes in the list of bootparams the
reboot_timeout=0 which, for 15-SP2 that contains the two aforementioned PR changes, removes the timeout on the reboot message and openQA will have time to catch up.
However, in 15-SP1 this boot parameter is not checked, so there is no straightforward way of changing or disabling the timeout.
The suggested approach here is to request a backport of this for yast in SLE 15-SP1.
Since it is not likely that there will ever be a new 15-SP1 QU release, the backport approach remains a suggestion for now.