coordination #32851
closed[tools][EPIC] Scheduling redesign
100%
Description
Currently we are scheduling using DB with specific crafted query with the ORM - which is a consuming process both in terms of CPU and memory, even refining furthermore the query could be in long-term a dead end, and being problematic as we might have more requirements from it.
This ticket is meant just as a tracker to group refactorization/enhancements, redesign proposals.
Updated by EDiGiacinto over 6 years ago
My 2c With regards to replacing DB, and doing it in memory - if AMQP is not a way to go (so, that means also dispatching jobs over ws would be replaced) - i would explore the possibility to switch to a SAT solving mechanism instead, avoiding to hard-code condition ourselves in the future. As i see it, we can re-formulate our problem as conditions that can be nicely expressed in CNF.
Updated by coolo over 6 years ago
Don't! The problem is way too simple for such a monster solution
Updated by dasantiago over 6 years ago
- Related to action #20812: Jobs will be assigned to workers with wrong arch unless WORKER_CLASS is set somewhere added
Updated by EDiGiacinto over 6 years ago
- Related to action #25970: Profile/Optimize _workers_checker in WebSockets server added
Updated by EDiGiacinto over 6 years ago
- Related to action #28714: [tools] Investigate why sporadically job is set to scalar value of the reference instead of the reference itself. added
Updated by EDiGiacinto over 6 years ago
coolo wrote:
Don't! The problem is way too simple for such a monster solution
Well, it seems growing in complexity now, so maybe a simple solution is not enough anymore - and it might actually help slim the logic, as we could infer CNFs from job settings.
Not saying that is the road to hit - just worth mentioning the possibilities.
Updated by EDiGiacinto over 6 years ago
- Related to action #31069: Job life cycle not always covered by events added
Updated by AdamWill over 6 years ago
Updated by EDiGiacinto over 6 years ago
- Related to action #25124: [tools][sprint 201709.1] Workers disconnects from websocket server and getting stuck: job shows as 'State: assigned' forever added
Updated by EDiGiacinto over 6 years ago
- Related to action #35296: Error messages on worker about "Use of uninitialized value $host in hash element at /usr/share/openqa/script/../lib/OpenQA/Worker/Common.pm line 359, <GEN298662> line 4." added
Updated by szarate over 6 years ago
- Follows action #35914: Changes to Job::duplicate added
Updated by EDiGiacinto over 6 years ago
- Related to action #36727: job_grab does not cope with parallel cycles added
Updated by okurz about 5 years ago
- Category changed from 122 to Feature requests
Updated by okurz over 4 years ago
- Status changed from New to Resolved
- Assignee set to okurz
The one open subtask #12876 is still a valid feature request but I doubt we need this top-level tracker as it does not provide more details.
Updated by szarate almost 4 years ago
- Tracker changed from action to coordination
Updated by szarate almost 4 years ago
See for the reason of tracker change: http://mailman.suse.de/mailman/private/qa-sle/2020-October/002722.html