action #12106
closeds390 sporadic hang in bootloader_s390 on lsdasd
0%
Description
observation¶
https://openqa.suse.de/tests/396323/modules/bootloader_s390/steps/4
output is there but wait_serial fails to find the output in time
steps to reproduce¶
TBC
problem¶
H1. sporadic everywhere but not everytime
H2. something worker specific
H3. multiple s390 zVM instances at the same time influence each other
suggestion¶
DONE: at least improve the code, what we did with https://github.com/os-autoinst/os-autoinst-distri-opensuse/pull/1389 even if this issue never reappears.
As this is a timeout issue I suggest to record a soft failure in case of timeout with reference to this issue and then retry.
Updated by okurz over 8 years ago
PR merged, let's see if the issue reappears although mgriessmeier and me could not really reproduce the issue before trying to improve the source code.
Updated by okurz over 8 years ago
- Description updated (diff)
- Status changed from In Progress to Resolved
I triggered 20 runs on my testing machine (lord.arch -> LINUX157):
for i in {1..20}; do sudo -u _openqa-worker /usr/share/openqa/script/clone_job.pl --from http://… … ; done
and all succeeded so regard this as resolved for now.
Updated by okurz over 8 years ago
- Description updated (diff)
- Status changed from Resolved to In Progress
- Assignee changed from okurz to mgriessmeier
- Priority changed from Low to High
not fixed, still appearing often enough to be regarded as "high": https://openqa.suse.de/tests/404250/modules/bootloader_s390/steps/4
H1 REJECTED as I could not reproduce it calling the same test often enough locally so H2-4 added
Updated by okurz over 8 years ago
more fails:
https://openqa.suse.de/tests/406041/file/autoinst-log.txt
https://openqa.suse.de/tests/406123/file/autoinst-log.txt
https://openqa.suse.de/tests/406203/file/autoinst-log.txt
https://openqa.suse.de/tests/406153/file/autoinst-log.txt
so this one looks like more severe than #10584 now
Updated by okurz over 8 years ago
- Related to action #10584: [s390] sporadic fail in bootloader_s390, e.g. downloading kernel and initrd added
Updated by okurz over 8 years ago
- Status changed from In Progress to Feedback
- Assignee deleted (
mgriessmeier)
intended to be fixed by PR https://github.com/os-autoinst/os-autoinst-distri-opensuse/pull/1420, let's wait for feedback
Updated by okurz over 8 years ago
- Status changed from Feedback to In Progress
- Assignee set to mgriessmeier
Updated by okurz over 8 years ago
- Assignee changed from mgriessmeier to okurz
now I can also reproduce it locally, e.g. see http://lord.arch/tests/311?limit_previous=30#previous
All three failures in bootloader_s390 are at the same step, the wait_serial after lsdasd. I am wondering why the serial output of the command is missing here.
Anyway, slightly changing the code and retrying multiple times. Let's see if this makes a difference.
Updated by okurz over 8 years ago
should be fixed by https://github.com/os-autoinst/os-autoinst-distri-opensuse/pull/1445
Updated by okurz over 8 years ago
- Status changed from In Progress to Resolved
considered done, e.g. see https://openqa.suse.de/tests/436688 . Did not see a related issue in recent builds.
Updated by okurz over 8 years ago
- Copied to action #12300: [s390] can fail during formatting/wait_serial added