Actions
action #162374
closedopenQA Project (public) - coordination #112862: [saga][epic] Future ideas for easy multi-machine handling: MM-tests as first-class citizens
openQA Project (public) - coordination #111929: [epic] Stable multi-machine tests covering multiple physical workers
Limit number of OSD PRG2 x86_64 tap multi-machine workers until stabilized
Status:
Resolved
Priority:
Low
Assignee:
Category:
Feature requests
Target version:
Start date:
2024-06-17
Due date:
% Done:
0%
Estimated time:
Tags:
Description
Motivation¶
We (again) have multiple multi-machine issues within OSD, e.g. latest #162320, and our monitoring can not always catch those issues, e.g. if only certain combination of hosts are problematic. Hence we should limit the workers that can run multi-machine tests until we have at least applied some improvements or had stable operations for a longer time.
Acceptance criteria¶
- AC1: Only a limited number of OSD workers run multi-machine tests
Suggestions¶
- Disable all but one host for "x86_64,tap" in https://gitlab.suse.de/openqa/salt-pillars-openqa/-/blob/master/openqa/workerconf.sls until at least #162320 was resolved, more later.
Updated by okurz 6 months ago
- Copied from action #160652: Secondary TAP worker class in different zones size:S added
Updated by okurz 6 months ago
- Related to action #162320: multi-machine test failures 2024-06-14+, auto_review:"ping with packet size 100 failed.*can be GRE tunnel setup issue":retry added
Updated by okurz 6 months ago
- Status changed from In Progress to Feedback
- Target version changed from Ready to Tools - Next
https://gitlab.suse.de/openqa/salt-pillars-openqa/-/merge_requests/843 merged. Monitoring.
Updated by okurz 5 months ago
- Related to action #162293: SMART errors on bootup of worker31, worker32 and worker34 size:M added
Updated by okurz 5 months ago
- Related to deleted (action #162293: SMART errors on bootup of worker31, worker32 and worker34 size:M)
Updated by okurz 4 months ago
- Copied to action #165192: Enable all OSD PRG2 x86_64 machines for multi-machine use again added
Actions