Actions
action #134018
closed[alert] Multiple alerts with "used > 80%" size:S
Start date:
Due date:
% Done:
0%
Estimated time:
Tags:
Description
Observation¶
There's 3 alerts about disk space running out at the time of this writing:
Problem name: /assets: Disk space is low (used > 80%)
Host: ariel.dmz-prg2.suse.org (over old-ariel)
Severity: Warning
Operational data: Space used: 4.48 TB of 5.6 TB (80 %)
Problem name: /space/openqa/share: Disk space is low (used > 80%)
Host: ariel.dmz-prg2.suse.org (over old-ariel)
Operational data: Space used: 4.48 TB of 5.6 TB (80 %)
Problem name: /var/lib/openqa/share: Disk space is low (used > 80%)
Host: ariel.dmz-prg2.suse.org (over old-ariel)
Severity: Warning
Operational data: Space used: 4.48 TB of 5.6 TB (80 %)
Acceptance Criteria¶
- AC1: Alerts are not seen unless disk usage is > 90%
Suggestions¶
- ~Group the alerts similar to what was done in #133130~
- Click somewhere in https://zabbix.nue.suse.com/zabbix.php?action=host.list to configure the alert thresholds
- This should be the relevant items for the relevant host: https://zabbix.suse.de/items.php?filter_set=1&filter_hostids%5B0%5D=10923&context=host
- Check the value under "Macros" i.e. {$VFS.FS.PUSED.MAX.WARN} and {$VFS.FS.PUSED.MAX.CRIT}
Updated by okurz about 1 year ago
The alerts should use the limits that were defined for the old icinga/nagios instance. I am sure they are still somewhere in the according infra gitlab repo
Updated by okurz about 1 year ago
- Due date set to 2023-09-01
- Status changed from New to Feedback
- Assignee set to okurz
- Priority changed from High to Normal
I think I could update the threshold to 90%. In https://zabbix.nue.suse.com/zabbix.php?action=host.list I clicked on the first entry for new-ariel, then "Macros", then "Inherited and host macros", look up the setting, click "Change", enter new value and "Update". Did the equivalent for "critical" setting 90->94%
Updated by livdywan about 1 year ago
- Subject changed from [alert] Multiple alerts with "used > 80%" to [alert] Multiple alerts with "used > 80%" size:S
- Description updated (diff)
Updated by okurz about 1 year ago
- Due date deleted (
2023-09-01) - Status changed from Feedback to Resolved
I checked today again on https://zabbix.nue.suse.com/zabbix.php?action=host.edit&hostid=10923 and we still have those settings and have no space used alerts so we should be good.
Actions