Project

General

Profile

Actions

action #174556

closed

openqa.suse.de cannot ping diesel.qe.nue2.suse.org via IPv4 - source based routing missing

Added by nicksinger about 18 hours ago. Updated about 18 hours ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
Regressions/Crashes
Start date:
2024-12-18
Due date:
% Done:

0%

Estimated time:

Description

Motivation

Because of https://progress.opensuse.org/issues/174550 I looked into the pending "host up alert" for diesel. Apparently OSD cannot ping the machine. Quickly checking it seems the "source based routing" from https://gitlab.suse.de/openqa/salt-states-openqa/-/merge_requests/1316 / https://progress.opensuse.org/issues/170338

why is it missing only on that machine?

Acceptance criteria

  • AC1: openqa.suse.de can ping diesel.qe.nue2.suse.org via IPv4

Rollback steps


Related issues 1 (0 open1 closed)

Copied from openQA Infrastructure (public) - action #174550: grafana silence linking to #164853 but alert is about diesel?Resolvedgpathak2024-12-18

Actions
Actions #1

Updated by nicksinger about 18 hours ago

  • Copied from action #174550: grafana silence linking to #164853 but alert is about diesel? added
Actions #2

Updated by okurz about 18 hours ago

  • Parent task set to #166598
Actions #3

Updated by nicksinger about 18 hours ago

  • Status changed from In Progress to Feedback

apparently the system just never rebooted since my changes and I forgot to restart networking manually. After doing a systemctl restart network OSD can ping the machine. Grafana looks good as well. Just to be sure I checked our auto-updates and auto-reboots are enabled on diesel but everything looks fine as well:

diesel:/etc/sysconfig/network # rebootmgrctl is-active
RebootMgr is active
diesel:/etc/sysconfig/network # rebootmgrctl status
Status: Reboot not requested
diesel:/etc/sysconfig/network # systemctl status auto-update.timer
● auto-update.timer - Nightly trigger of auto-update.
     Loaded: loaded (/etc/systemd/system/auto-update.timer; enabled; vendor preset: disabled)
     Active: active (waiting) since Tue 2024-11-26 10:14:17 UTC; 3 weeks 1 day ago
    Trigger: Thu 2024-12-19 02:02:07 UTC; 12h left
   Triggers: ● auto-update.service

next alert eval is in 3 mins then I can remove the silence and resolve the ticket.

Actions #4

Updated by nicksinger about 18 hours ago

  • Status changed from Feedback to Resolved

Silence removed, alert not firing so we're golden.

Actions

Also available in: Atom PDF