action #49535: Improve time to schedule a build - openQA Project (public) - openSUSE Project Management Tool

Actions

Copy link

action #49535

closed

Improve time to schedule a build

Added by coolo almost 6 years ago. Updated over 5 years ago.

Status:

Resolved

Priority:

High

Assignee:

mkittler

Category:

Feature requests

Target version:

Current Sprint

Start date:

2019-03-21

Due date:

% Done:

Estimated time:

Description

currently scheduling a SP1@x86_64 build takes > 180s, which was up to now our apache limit.

This really should be reduced a lot. Like all the cancelling/deprio logic could be moved to a minion job - just as the calculcation of blocked_by state I guess.

Related issues 1 (0 open — 1 closed)

Actions

Copy link

Updated by mkittler almost 6 years ago

Assignee set to mkittler
Target version changed from Ready to Current Sprint

I guess this is more important than refactoring the worker code.

Actions

Copy link

Updated by coolo almost 6 years ago

The blocked by calculation is touchy btw - we would need to make it three state. Currently it's blocked_by an id or NULL - which means the scheduler can pick it. Or we introduce yet another job state :)

Actions

Copy link

Updated by mkittler almost 6 years ago

I'm only wondering why we do the blocked_by calculation currently twice. One time directly after creating a job via create_from_settings and then again for each job after dealing with cycles, wrong parents, ...

If the last recalculation is required because it adds blocked-by IDs which couldn't be assigned in the first place, wouldn't that allow the scheduler to pick jobs accidentally (in the small period of time between the job creation and the final blocked-by calculation)? It that what you mean by touchy? With the right isolation level for the transaction that problem shouldn't occur, actually. Even if I remove the blocked-by calculation within create_from_settings and only do it in the end. (Everything is done in one transaction.)

Yet another job state to be sure we don't schedule jobs in an inconsistent state would make sense in general. Maybe SCHEDULING or ADDED? Or we just disable the scheduler (somehow) while job creation via posting an ISO is ongoing?

Actions

Copy link

Updated by mkittler almost 6 years ago

Likely another job state is the best. Then we don't need to care about the transaction isolation level and can do the blocked-by calculation only in a Minion job.

Actions

Copy link

Updated by coolo almost 6 years ago

Marius and me discussed this in detail - and I propose to make 'Scheduled Product' a high level DBIx class. And we create a new API that creates that and schedule a minion task to do the actual scheduling. So you can poll the status of the Scheduled Product - and it can be 'scheduling' or 'scheduled' and if it's scheduled you can query the errors it created.

This would make actually a nice addition as currently the scheduled products are extracted from audit log and the errors just appear in some log file. Plus it solves the timeout problem - clients simply poll if they care (or use the old API :)

Actions

Copy link

Updated by mkittler almost 6 years ago

Status changed from New to In Progress

WIP-PR: https://github.com/os-autoinst/openQA/pull/2036

Actions

Copy link

Updated by mkittler over 5 years ago

The PR is ready to merge. Further UI tweaks like filtering for failed scheduled products or showing all jobs related to one scheduled product can still be done.

Actions

Copy link