Project

General

Profile

action #136154

Updated by tinita 5 months ago

## Observation 

 I started noticing multiple jobs that are MM missing one or more dependencies: 

 Normally this job is a MM one, with two jobs https://openqa.suse.de/tests/12210430 -> should look like https://openqa.suse.de/tests/12207579#dependencies 

 In this case, the RETRY=1 makes the situation worse, causing blocked updates, due to jobs that should not have ever been restarted automatically, see https://openqa.suse.de/tests/12207609 

 ## Suggestions 
 * Find a reproducing scenario with multi-machine clusters using RETRY=1 
 * Create a simple MM cluster locally (maybe within unit tests are by adjusting the local database manually) and invoke the code that is done on an automatic retry (via `RETRY=…`), e.g. in t/10-jobs.t where we already use `RETRY` and take a look into t/05-scheduler-dependencies.t 
 * Only then solve this problem in a mob session since only Marius is currently aware of how to do it

Back