Project

General

Profile

Actions

action #135380

closed

openQA Project - coordination #110833: [saga][epic] Scale up: openQA can handle a schedule of 100k jobs with 1k worker instances

openQA Project - coordination #135122: [epic] OSD openQA refuses to assign jobs, >3k scheduled not being picked up, no alert

A significant number of scheduled jobs with one or two running triggers an alert

Added by livdywan 10 months ago. Updated 10 months ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
-
Target version:
Start date:
2023-09-07
Due date:
% Done:

0%

Estimated time:

Description

Motivation

#135122 was discovered due to user feedback rather than alert handling. We should ensure we either adjust existing alerts accordingly or add one that would be able to discover this issue early.

Acceptance criteria

  • AC1: A significant number of scheduled jobs with one or two running triggers an alert

Suggestions


Files

wakeup-scheduler.ods (16.1 KB) wakeup-scheduler.ods tinita, 2023-09-08 10:42

Related issues 2 (0 open2 closed)

Related to openQA Infrastructure - action #135632: "Mojo::File::spurt is deprecated in favor of Mojo::File::spew" breaking os-autoinst OBS build and osd-deployment size:MResolvedokurz2023-05-08

Actions
Copied to openQA Infrastructure - action #135578: Long job age and jobs not executed for long size:MResolvednicksinger

Actions
Actions

Also available in: Atom PDF