Project

General

Profile

Actions

action #134018

closed

[alert] Multiple alerts with "used > 80%" size:S

Added by livdywan 9 months ago. Updated 9 months ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
-
Target version:
Start date:
Due date:
% Done:

0%

Estimated time:
Tags:

Description

Observation

There's 3 alerts about disk space running out at the time of this writing:

Problem name: /assets: Disk space is low (used > 80%)
Host: ariel.dmz-prg2.suse.org (over old-ariel)
Severity: Warning
Operational data: Space used: 4.48 TB of 5.6 TB (80 %)

Problem name: /space/openqa/share: Disk space is low (used > 80%)
Host: ariel.dmz-prg2.suse.org (over old-ariel)
Operational data: Space used: 4.48 TB of 5.6 TB (80 %)

Problem name: /var/lib/openqa/share: Disk space is low (used > 80%)
Host: ariel.dmz-prg2.suse.org (over old-ariel)
Severity: Warning
Operational data: Space used: 4.48 TB of 5.6 TB (80 %)

Acceptance Criteria

  • AC1: Alerts are not seen unless disk usage is > 90%

Suggestions

Actions #1

Updated by okurz 9 months ago

The alerts should use the limits that were defined for the old icinga/nagios instance. I am sure they are still somewhere in the according infra gitlab repo

Actions #2

Updated by okurz 9 months ago

  • Due date set to 2023-09-01
  • Status changed from New to Feedback
  • Assignee set to okurz
  • Priority changed from High to Normal

I think I could update the threshold to 90%. In https://zabbix.nue.suse.com/zabbix.php?action=host.list I clicked on the first entry for new-ariel, then "Macros", then "Inherited and host macros", look up the setting, click "Change", enter new value and "Update". Did the equivalent for "critical" setting 90->94%

Actions #3

Updated by livdywan 9 months ago

  • Subject changed from [alert] Multiple alerts with "used > 80%" to [alert] Multiple alerts with "used > 80%" size:S
  • Description updated (diff)
Actions #4

Updated by okurz 9 months ago

  • Due date deleted (2023-09-01)
  • Status changed from Feedback to Resolved

I checked today again on https://zabbix.nue.suse.com/zabbix.php?action=host.edit&hostid=10923 and we still have those settings and have no space used alerts so we should be good.

Actions

Also available in: Atom PDF