action #168178: Limit connected online workers based on websocket+scheduler load size:M - openQA Project (public) - openSUSE Project Management Tool

Actions

Copy link

action #168178

open

coordination #110833: [saga][epic] Scale up: openQA can handle a schedule of 100k jobs with 1k worker instances

coordination #157669: websockets+scheduler improvements to support more online worker instances

Limit connected online workers based on websocket+scheduler load size:M

Added by okurz 5 months ago. Updated 9 days ago.

Status:

Workable

Priority:

Low

Assignee:

Category:

Feature requests

Target version:

QA (public) - Tools - Next

Start date:

Due date:

% Done:

Estimated time:

Description

Motivation¶

With #157690 the amount of connected online workers is already limited based on a configuration variable. We can extend that to limit based on the actual websocket+scheduler load meaning to keep the number low enough to ensure proper operation of websocket+scheduler to prevent problems like #157666.

Acceptance criteria¶

AC1: A clear definition of "websocket+scheduler load" exists
AC2: The number of online workers is limited to min(configured_number,configured_load_limit)
AC3: Rejected openQA workers exceeding the mentioned limit(s) explicitly log or fail that situation

Suggestions¶

Look into the implemention of #157690 to see how the simple limit was implemented so far
Come up with a definition of the critical websocket+scheduler load based on "overload experiments" which can be used as a metric for the problem seen in #157666
Extend the simple limit with a lookup of the said metric and also prevent additional worker connections based on the metric
Also consider disconnecting already connected workers if the metric exceeds the configured threshold

Related issues 2 (1 open — 1 closed)

Actions

Copy link

Also available in: Atom PDF

Project

General

Profile

QA (public) » openQA Project (public)

Tags

Custom queries

action #168178

Limit connected online workers based on websocket+scheduler load size:M

Motivation¶

Acceptance criteria¶

Suggestions¶

Updated by okurz 5 months ago

Updated by okurz 5 months ago

Updated by okurz 5 months ago

Updated by okurz 5 months ago

Updated by okurz 4 months ago

Updated by okurz 3 months ago

Updated by okurz 24 days ago

Updated by mkittler 9 days ago

Updated by okurz 9 days ago

Updated by okurz 9 days ago