Project

General

Profile

Actions

action #97412

open

openQA Project - coordination #103944: [saga][epic] Scale up: More robust handling of diverse infrastructure with varying performance

openQA Project - coordination #98463: [epic] Avoid too slow asset downloads leading to jobs exceeding the timeout with or run into auto_review:"(timeout: setup exceeded MAX_SETUP_TIME|Cache service queue already full)":retry

Reduce I/O load on OSD by using more cache size on workers with using free disk space when available instead of hardcoded space

Added by okurz over 2 years ago. Updated over 2 years ago.

Status:
New
Priority:
Normal
Assignee:
-
Category:
-
Target version:
Start date:
Due date:
% Done:

0%

Estimated time:

Description

Motivation

I am sure we could spread the load on OSD when many job start a bit if we manage to have less assets that need to be downloaded at the same time. We can increase the cache size, e.g. use all available space instead of the artifical limits of the cache directory.

The worker cache could just ensure a certain percentage of free disk space in the file system.

Acceptance criteria

  • AC1: At least most workers on OSD use all available free disk space except for a configured ratio to keep free

Related issues 1 (0 open1 closed)

Copied from openQA Infrastructure - action #96554: Mitigate on-going disk I/O alerts size:MResolvedmkittler2021-08-04

Actions
Actions #1

Updated by okurz over 2 years ago

  • Copied from action #96554: Mitigate on-going disk I/O alerts size:M added
Actions #2

Updated by okurz over 2 years ago

  • Target version changed from Ready to future
Actions #3

Updated by mkittler over 2 years ago

  • Parent task changed from #96447 to #98463

#96447 hasn't a very meaningful ticket description, so I'm replacing the parent ticket with #98463.

Actions

Also available in: Atom PDF