Project

General

Profile

Actions

action #58805

closed

[infra]Severe storage performance issue on openqa.suse.de workers

Added by MDoucha over 4 years ago. Updated about 4 years ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
-
Target version:
-
Start date:
2019-10-29
Due date:
% Done:

0%

Estimated time:

Description

Last week on Thursday, a handful of tests in two LTP testsuites started timing out. I've initially reported it as a kernel performance regression: https://bugzilla.suse.com/show_bug.cgi?id=1155018

However, I've tried to reproduce the problem on a released kernel version which didn't have the issue 3 weeks ago and succeeded: https://openqa.suse.de/tests/overview?build=15ga_mdoucha_bsc_1155018&version=15&distri=sle

This successful reproduction on a known good kernel indicates that the problem is somewhere in OpenQA infrastructure, possibly a bug introduced during the weekly deployment on Wednesday, October 23rd. The timeout continues to appear in kernel-of-the-day LTP tests: https://openqa.suse.de/tests/3533819#step/DOR000/7

Both PPC64LE and x86_64 are affected. Reproducibility on aarch64 and s390 is currently unknown because we don't run the affected testsuites on those two platforms. The failing tests mostly belong to the async & direct I/O stress testsuite.


Related issues 2 (0 open2 closed)

Related to openQA Infrastructure - action #20914: [tools] configure vm settings for workers with rotating discsResolvedokurz2017-07-282019-11-05

Actions
Related to openQA Project - coordination #64746: [saga][epic] Scale up: Efficient handling of large storage to be able to run current tests efficiently but keep big archives of old resultsResolvedokurz2020-03-18

Actions
Actions

Also available in: Atom PDF