action #58805
closed[infra]Severe storage performance issue on openqa.suse.de workers
0%
Description
Last week on Thursday, a handful of tests in two LTP testsuites started timing out. I've initially reported it as a kernel performance regression: https://bugzilla.suse.com/show_bug.cgi?id=1155018
However, I've tried to reproduce the problem on a released kernel version which didn't have the issue 3 weeks ago and succeeded: https://openqa.suse.de/tests/overview?build=15ga_mdoucha_bsc_1155018&version=15&distri=sle
This successful reproduction on a known good kernel indicates that the problem is somewhere in OpenQA infrastructure, possibly a bug introduced during the weekly deployment on Wednesday, October 23rd. The timeout continues to appear in kernel-of-the-day LTP tests: https://openqa.suse.de/tests/3533819#step/DOR000/7
Both PPC64LE and x86_64 are affected. Reproducibility on aarch64 and s390 is currently unknown because we don't run the affected testsuites on those two platforms. The failing tests mostly belong to the async & direct I/O stress testsuite.