Project

General

Profile

Actions

action #129244

closed

[alert][grafana] File systems alert for WebUI /results size:M

Added by jbaier_cz over 1 year ago. Updated over 1 year ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
-
Start date:
2023-05-12
Due date:
2023-05-30
% Done:

0%

Estimated time:
Tags:

Description

Observation

Observed at 2023-05-12 17:23:00 +0200 CEST
One of the file systems /results is too full (> 90%)
See http://stats.openqa-monitor.qa.suse.de/d/WebuiDb?orgId=1&viewPanel=74

https://monitor.qa.suse.de/d/WebuiDb/webui-summary?orgId=1&viewPanel=74&from=1683650195801&to=1684141123756 for one recent instance where /results was exceeding the threshold and coming back below the threshold shortly.

Current usage:

Filesystem      Size  Used Avail Use% Mounted on
/dev/vdd        7.0T  6.3T  782G  90% /results

Acceptance criteria

AC1: There is enough space and headroom on the affected file system /results, i.e. considerably more free than 20%

Suggestions

  • Check used space and evolution over time in https://monitor.qa.suse.de/d/nRDab3Jiz/openqa-jobs-test?orgId=1&viewPanel=19 , in particular check "Development Security" which looks too big in too short a time
  • Check job group results retention settings for "not-important" results
  • Crosscheck the use of "archiving": openQA should move "important" results to /results/archive on a separate storage device

Files


Related issues 3 (0 open3 closed)

Related to openQA Project (public) - action #129412: Verify cleanup behavior of groupless job resultsResolvedmkittler2023-05-162023-05-31

Actions
Related to openQA Infrastructure (public) - coordination #68923: [epic] Use external videoencoder in production auto_review:"External encoder not accepting data"Resolvedokurz2020-11-13

Actions
Copied to openQA Infrastructure (public) - action #164979: [alert][grafana] File systems alert for WebUI /results size:SResolvedmkittler2024-08-21

Actions
Actions

Also available in: Atom PDF