action #155758
closed
coordination #151816: [epic] Handle openQA fixes and job group setup
[sporadic] Reboot takes too long for ppc64le
Added by syrianidou_sofia 9 months ago.
Updated 9 months ago.
Description
For some ppc tests, we experience a sporadic reboot failure on different points of the test. As the failure doesn't happen every time on the same point, it could be an infrastructure issue that might be fixed by increasing the timeout.
Failure examples:
https://openqa.suse.de/tests/13549614#next_previous
https://openqa.suse.de/tests/13554127#next_previous
Acceptance criteria¶
AC1: Check if the ppc failures can go away by increasing the reboot timeout for ppc64le.
Additional information¶
- In case the issue persists, check if it could be a bug.
- In case there are no indications of a bug, ask tools team for more information
note: not sure how this two suggestions above could go, as power kvm is not officially supported and tools squad is not taking care of exotic architecture.
Most likely if unstable it is better to consider disabling the scenario, we'll see...
- Project changed from openQA Tests to qe-yam
- Category deleted (
Bugs in existing tests)
- Tags set to qe-yam-feb-sprint
- Description updated (diff)
- Status changed from New to Workable
- Parent task set to #151816
- Status changed from Workable to In Progress
- Assignee set to leli
From the failed job serail0.txt:
[ 44.022735][ T3924] block vda: the capability attribute has been deprecated.
[ 46.704481][ T5157] NET: Registered PF_ALG protocol family
[ 51.041195] wickedd-dhcp6[1094]: eth0: DHCPv6 is disabled by IPv6 router RA
[ 52.480523] wickedd-dhcp6[1094]: eth0: DHCPv6 is disabled by IPv6 router RA
[ 52.960305] wickedd-dhcp6[1094]: eth0: DHCPv6 is disabled by IPv6 router RA
[ 56.211725] load.sh[2018]: Starting kdump kernel load; kexec cmdline: /sbin/kexec -p /var/lib/kdump/kernel --append=" plymouth.ignore-serial-consoles console=hvc0 console=tty fadump= mitigations=auto sysrq=yes reset_devices acpi_no_memhotplug cgroup_disable=memory nokaslr numa=off irqpoll maxcpus=1 root=kdump rootflags=bind rd.udev.children-max=8 panic=1" --initrd=/var/lib/kdump/initrd -a
[ 56.242270][ T5413] memfd_create() without MFD_EXEC nor MFD_NOEXEC_SEAL, pid=5413 'kexec'
[ 56.980248] load.sh[2018]: Loaded kdump kernel.
It seems disk has some issue and kdump loaded which may caused the reboot very slow. I will run the test with the new support image to check whether the issue can be reproduced.
- Tags changed from qe-yam-feb-sprint to qe-yam-mar-sprint
- Status changed from In Progress to Resolved
Thanks for the investigation, feel free to pick the follow-up ticket or leave it for anyone else. Let's resolve this one.
Also available in: Atom
PDF