Actions
action #181256
opencoordination #161414: [epic] Improved salt based infrastructure management
Easier alert handling by opting out of backup_check for individual hosts
Start date:
Due date:
% Done:
0%
Estimated time:
Tags:
Description
Motivation¶
We have a service backup_check which was introduced in #173674 that calls a script for several hosts. Changing this when one of the hosts is down currently requires changing the script or switching of the whole check.
Acceptance criteria¶
- AC1: Backup checks can be enabled on a per-host basis
Suggestions¶
- Look at the current implementation of the script https://gitlab.suse.de/openqa/salt-states-openqa/-/blob/a529a8769f000ad973cd704951a140fabcb5d1b1/usr/local/bin/backup_check.sh#L5
- Consider using individual timers per host which can be enabled or disabled
- Currently there is one timer that calls the script https://gitlab.suse.de/openqa/salt-states-openqa/-/blob/00903f06197dbb618c09972981983b4e77ab8f26/backup/rsnapshot_generic.sls#L43
- Read a configuration file which lists relevant hosts
- A file in salt could end up being overridden
Updated by livdywan 7 days ago
- Copied from action #173674: qamaster-independent backup size:S added
Updated by nicksinger 7 days ago
created https://gitlab.suse.de/openqa/salt-states-openqa/-/merge_requests/1446 as POC/draft to collect a metric which we could use to alert upon. But it sounded like we don't consider this request useful anyway so I won't implement an alert.
Actions