Project

General

Profile

action #135035

Updated by okurz 12 months ago

## Motivation 
 Multi-machine jobs have been failing since 20230814, because of a misconfiguration of the MTU/GRE tunnels. A workaround 
 has been found in forcing the complete multi-machine tests to run in the same worker. 

 The purpose of this ticket is to have all multi-machine runs be scheduled on the same well-configured worker. 

 The change doesn't need to be permanent but it does need to be applied until proper networking between multi-machine nodes can 
 be guaranteed. 

 ## Acceptance Criteria 
 * **AC1:** If configured accordingly all jobs nodes of a multi-machine parallel cluster job must be scheduled to run on the *same* worker host 
 * **AC2:** By default jobs of a multi-machine parallel cluster can still be scheduled covering multiple different hosts 

 ## Suggestions 
 * Have a look at https://github.com/Martchus/openQA/pull/new/dependency-pinning for how this could be enabled and documented.

Back