Project

General

Profile

Actions

action #162374

closed

openQA Project (public) - coordination #112862: [saga][epic] Future ideas for easy multi-machine handling: MM-tests as first-class citizens

openQA Project (public) - coordination #111929: [epic] Stable multi-machine tests covering multiple physical workers

Limit number of OSD PRG2 x86_64 tap multi-machine workers until stabilized

Added by okurz 6 months ago. Updated 4 months ago.

Status:
Resolved
Priority:
Low
Assignee:
Category:
Feature requests
Start date:
2024-06-17
Due date:
% Done:

0%

Estimated time:

Description

Motivation

We (again) have multiple multi-machine issues within OSD, e.g. latest #162320, and our monitoring can not always catch those issues, e.g. if only certain combination of hosts are problematic. Hence we should limit the workers that can run multi-machine tests until we have at least applied some improvements or had stable operations for a longer time.

Acceptance criteria

  • AC1: Only a limited number of OSD workers run multi-machine tests

Suggestions


Related issues 3 (1 open2 closed)

Related to openQA Project (public) - action #162320: multi-machine test failures 2024-06-14+, auto_review:"ping with packet size 100 failed.*can be GRE tunnel setup issue":retryResolvedokurz2024-06-15

Actions
Copied from openQA Infrastructure (public) - action #160652: Secondary TAP worker class in different zones size:SResolvedybonatakis

Actions
Copied to openQA Infrastructure (public) - action #165192: Enable all OSD PRG2 x86_64 machines for multi-machine use againNew

Actions
Actions #1

Updated by okurz 6 months ago

  • Copied from action #160652: Secondary TAP worker class in different zones size:S added
Actions #2

Updated by okurz 6 months ago

  • Related to action #162320: multi-machine test failures 2024-06-14+, auto_review:"ping with packet size 100 failed.*can be GRE tunnel setup issue":retry added
Actions #3

Updated by okurz 6 months ago

  • Status changed from In Progress to Feedback
  • Target version changed from Ready to Tools - Next
Actions #4

Updated by okurz 5 months ago

  • Related to action #162293: SMART errors on bootup of worker31, worker32 and worker34 size:M added
Actions #5

Updated by okurz 5 months ago

  • Related to deleted (action #162293: SMART errors on bootup of worker31, worker32 and worker34 size:M)
Actions #6

Updated by okurz 4 months ago

  • Copied to action #165192: Enable all OSD PRG2 x86_64 machines for multi-machine use again added
Actions #7

Updated by okurz 4 months ago

  • Status changed from Feedback to Resolved
  • Target version changed from Tools - Next to Ready

Created #165192 for re-enablement

Actions

Also available in: Atom PDF