action #20002
closed[tools] openqa sometimes doesn't update job_dependencies table
0%
Description
For multi-machine jobs (caasp and slenkins) openQA time to time doesn't schedule all child jobs by triggering its parent CaaSP-controller or slenkins--control job.
It seems that not all child jobs are running because **there are missing entries for that jobs in job_dependencies SQL table*.
Example of broken job https://openqa.suse.de/tests/1016423 CaaSP-controller (In this case we miss admin node so then the whole test failed)
`# select count(child_job_id) from job_dependencies where parent_job_id=1016423;
count
-------
22
(1 row)`
If you try examine some successful CaaSP-controller job (eg. id=1015418) you should get count=25 (1x controller, 1x admin, 1x master, 22x workers).
I'm not able to reproduce the issue on request but the problem sometimes occurs in my local openqa instance using sqlite and also o.s.d using postgresql. The broken job dependency could be solved by posting iso again.
Maybe it has something to do with scheduler which just skips some db insert queries.
I'm sorry being so brief but I really don't know more.