Project

General

Profile

Actions

action #164988

closed

coordination #92323: [saga][epic] Scale up: Fine-grained control over use and removal of results, assets, test data

coordination #179888: [epic] Creating, tracking, accounting "supporting jobs"

Better accounting for openqa-investigation jobs size:S

Added by okurz 8 months ago. Updated about 4 hours ago.

Status:
Resolved
Priority:
Low
Assignee:
Category:
Feature requests
Target version:
Start date:
2024-08-06
Due date:
% Done:

0%

Estimated time:

Description

Motivation

#164979 alerted us about /results being nearly full. We found that groupless jobs are now the biggest offender linked to heavy jobs failing often triggering also heavy openqa-investigate jobs.

Acceptance criteria

  • AC1: Big investigation jobs will not fill up our disk space; we would instead just keep less of them.

Suggestions

  • Count investigation jobs towards the group of the original job
    • Investigation jobs are groupless to avoid being considered for the result of the according group
    • It is probably also not wanted by users; investigation jobs should not cause normal jobs to be stored less long but still be kept for a short time.
    • The way the cleanup algorithm currently works makes this also hard to implement. It goes though jobs group by group and factoring in groupless jobs here without good relations in the database is not straight forward / efficient.
  • Use a dedicated group for all investigation jobs
    • Sounds most promising - just create a new group and schedule investigation jobs to be part of it.
    • There is a caveat: Having all investigation jobs in one group does not solve the problem that investigation jobs for a particular scenario become very big. If we put everything in one group one scenario might cause other investigation jobs to be stored only very shortly.

Related issues 3 (2 open1 closed)

Related to openQA Project (public) - coordination #179221: [epic] Support keeping only jobs in database, remove all logs, assets, test results soonerNew2025-03-19

Actions
Copied from openQA Infrastructure (public) - action #164979: [alert][grafana] File systems alert for WebUI /results size:SResolvedmkittler2024-08-21

Actions
Copied to openQA Project (public) - action #179894: [spike][timeboxed:10h] Count assets+results of openqa-investigate jobs towards the originating groupNew

Actions
Actions

Also available in: Atom PDF