action #174556
closedopenqa.suse.de cannot ping diesel.qe.nue2.suse.org via IPv4 - source based routing missing
0%
Description
Motivation¶
Because of https://progress.opensuse.org/issues/174550 I looked into the pending "host up alert" for diesel. Apparently OSD cannot ping the machine. Quickly checking it seems the "source based routing" from https://gitlab.suse.de/openqa/salt-states-openqa/-/merge_requests/1316 / https://progress.opensuse.org/issues/170338
why is it missing only on that machine?
Acceptance criteria¶
- AC1: openqa.suse.de can ping diesel.qe.nue2.suse.org via IPv4
Rollback steps¶
- un-silence https://monitor.qa.suse.de/alerting/silences?alertmanager=grafana "rule_uid=~host_up_alert.*"
Updated by nicksinger about 21 hours ago
- Copied from action #174550: grafana silence linking to #164853 but alert is about diesel? added
Updated by nicksinger about 21 hours ago
- Status changed from In Progress to Feedback
apparently the system just never rebooted since my changes and I forgot to restart networking manually. After doing a systemctl restart network
OSD can ping the machine. Grafana looks good as well. Just to be sure I checked our auto-updates and auto-reboots are enabled on diesel but everything looks fine as well:
diesel:/etc/sysconfig/network # rebootmgrctl is-active
RebootMgr is active
diesel:/etc/sysconfig/network # rebootmgrctl status
Status: Reboot not requested
diesel:/etc/sysconfig/network # systemctl status auto-update.timer
● auto-update.timer - Nightly trigger of auto-update.
Loaded: loaded (/etc/systemd/system/auto-update.timer; enabled; vendor preset: disabled)
Active: active (waiting) since Tue 2024-11-26 10:14:17 UTC; 3 weeks 1 day ago
Trigger: Thu 2024-12-19 02:02:07 UTC; 12h left
Triggers: ● auto-update.service
next alert eval is in 3 mins then I can remove the silence and resolve the ticket.
Updated by nicksinger about 21 hours ago
- Status changed from Feedback to Resolved
Silence removed, alert not firing so we're golden.