action #128969
closed[alert][grafana] Failed systemd services alert (except openqa.suse.de) Salt (Uk02cifVkz)
0%
Description
Observation¶
multiple alert emails received 2023-05-09
https://stats.openqa-monitor.qa.suse.de/alerting/grafana/Uk02cifVkz/view?orgId=1
Updated by livdywan 12 months ago
- Status changed from New to In Progress
- Assignee set to livdywan
Well, let's see if I can find out what to do here. Asking in Slack for now.
Btw since I was asked, here's how I double-checked that these are all Leap 15.4 machines: sudo salt -C '*' cmd.run 'grep PRETTY /etc/os-release'
Updated by openqa_review 12 months ago
- Due date set to 2023-05-24
Setting due date based on mean cycle time of SUSE QE Tools
Updated by livdywan 12 months ago
- Status changed from In Progress to Feedback
cdywan wrote:
Well, let's see if I can find out what to do here. Asking in Slack for now.
Btw since I was asked, here's how I double-checked that these are all Leap 15.4 machines:
sudo salt -C '*' cmd.run 'grep PRETTY /etc/os-release'
Apparently a fix for a regression in libbpfgo was applied, and after restarting the service it's looking to run fine again:
sudo salt -C '*' cmd.run 'systemctl restart velociraptor-client'
storage.oqa.suse.de:
[...]
sudo salt -C '*' cmd.run 'systemctl is-active velociraptor-client'
storage.oqa.suse.de:
active
[...]
Updated by okurz 12 months ago
oh, right. https://stats.openqa-monitor.qa.suse.de/d/KToPYLEWz/failed-systemd-services?viewPanel=6&orgId=1 is good again, resolve then?