Project

General

Profile

Actions

action #135773

closed

[tools] many multi-machine test failures in "ovs-client+ovs-server" test scenario when tests are run across different workers size:M

Added by okurz over 1 year ago. Updated about 1 year ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
-
Start date:
2023-08-15
Due date:
2023-10-07
% Done:

0%

Estimated time:
Tags:

Description

Observation

See #134282-1

There is something wrong with multimachine network when tests are run across different workers. If is multimachine job forced to run on same worker, it is fine.

There are fails in core group: https://openqa.suse.de/tests/11843205#next_previous
Kernel group: https://openqa.suse.de/tests/11846943#next_previous
HPC: https://openqa.suse.de/tests/11845897#next_previous

The scenario is https://openqa.suse.de/tests/latest?arch=x86_64&distri=sle&flavor=Server-DVD-Updates&machine=64bit&test=ovs-client&version=15-SP5

Acceptance criteria

  • AC1: The "ovs-client+ovs-server" test scenario passes consistently when running on multiple OSD workers with "tap" class

Suggestions

Out of scope

  • Anything that already fails when the multi-machine cluster runs on a single physical host
  • #135035 "Pin multimachine jobs to a single worker"
  • Any other test than "ovs-client+server"
  • Try to minimize the reproducer, e.g. skip test modules in openQA -> #135818

Workaround

Pin to a single physical machine


Related issues 1 (0 open1 closed)

Copied from openQA Infrastructure (public) - action #134282: [tools] network protocols failures on multimachine tests on HA/SAP size:S auto_review:"no candidate.*iscsi-target-overview-service-tab|yast2.+firewall.+services.+add.+zone":retryResolvednicksinger2023-08-15

Actions
Actions

Also available in: Atom PDF