coordination #155485
open[saga][epic] Efficient openQA worker pool resource handling in datacenters
25%
Description
Motivation¶
Based on a bootstrapping discussions between lvogdt+mgriessmeier+okurz and my general goal to make efficient use of computing resources.
Our openQA instances especially openqa.suse.de but also openqa.opensuse.org hold ready ressources as in physical machines running openQA worker instances for different architectures and worker classes. There are enough resources put in place so that builds for products can finish testing in reasonable time, e.g. some hours for a new Tumbleweed snapshot. But in between builds often there are idle resources, e.g. idle x86_64 openQA workers. Both in on-premise datacenters as well as public cloud resources can be reassigned for other purposes. So we should teach our applications how to only request resources as needed, run necessary workloads from scratch when machines are dynamically switched and used for other purposes and give back resources to a resource pool if the workload schedules allow.