Project

General

Profile

Actions

action #20002

closed

[tools] openqa sometimes doesn't update job_dependencies table

Added by thehejik almost 7 years ago. Updated over 6 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
-
Category:
Feature requests
Target version:
-
Start date:
2017-06-22
Due date:
% Done:

0%

Estimated time:

Description

For multi-machine jobs (caasp and slenkins) openQA time to time doesn't schedule all child jobs by triggering its parent CaaSP-controller or slenkins--control job.
It seems that not all child jobs are running because **there are missing entries for that jobs in job_dependencies SQL table
*.

Example of broken job https://openqa.suse.de/tests/1016423 CaaSP-controller (In this case we miss admin node so then the whole test failed)

`# select count(child_job_id) from job_dependencies where parent_job_id=1016423;
count 
-------
22
(1 row)`

If you try examine some successful CaaSP-controller job (eg. id=1015418) you should get count=25 (1x controller, 1x admin, 1x master, 22x workers).

I'm not able to reproduce the issue on request but the problem sometimes occurs in my local openqa instance using sqlite and also o.s.d using postgresql. The broken job dependency could be solved by posting iso again.

Maybe it has something to do with scheduler which just skips some db insert queries.

I'm sorry being so brief but I really don't know more.


Related issues 2 (0 open2 closed)

Related to openQA Project - action #18684: Jobs with worker class qemu_x86_64 are taken by machines without this class, causing incomplete jobsResolvedszarate2017-04-20

Actions
Related to openQA Tests - action #20790: [qam] SLE12-SP3 test fails in 1__unknown_ - slenkins-tests-openvpn-controlRejectedpcervinka2017-07-26

Actions
Actions

Also available in: Atom PDF