Project

General

Profile

Actions

action #170338

open

No monitoring data from OSD since 2024-11-25 1449Z size:M

Added by okurz 7 days ago. Updated about 1 hour ago.

Status:
In Progress
Priority:
Normal
Assignee:
Category:
Regressions/Crashes
Start date:
2024-11-27
Due date:
2024-12-12 (Due in 8 days)
% Done:

0%

Estimated time:

Description

Observation

https://monitor.qa.suse.de/d/WebuiDb/webui-summary?orgId=1&from=2024-11-25T14:15:24.661Z&to=2024-11-25T14:58:43.878Z&var-host_disks=$__all&refresh=15m

Acceptance criteria

  • AC1: There is current monitoring data from OSD itself on monitor.qa.suse.de
  • AC2: There is also monitoring data after reboots of monitor+OSD

Acceptance tests

Suggestions

  • Handle IPv4+IPv6 double routing problems after setting up wireguard tunnels disrupting also our monitoring
  • Understand what approach to take for routing with VPN in place and consider both source and target hosts for communication
  • Might need changes to multiple hosts
  • Make changes persistent in salt
  • Ensure reboot consistency

Rollback actions


Related issues 3 (1 open2 closed)

Related to openQA Infrastructure (public) - action #169564: Configure wireguard tunnels on OSD production hosts needed for openQA located in the NUE2 server room size:SResolvedmkittler

Actions
Related to openQA Infrastructure (public) - action #170473: k2.qe.suse.de not reachable from mania:2 size:SBlockedokurz2024-11-28

Actions
Copied to openQA Infrastructure (public) - action #170494: nginx.service on monitor failed because of: "No such file or directory:calling fopen(/etc/dehydrated/certs/loki.qa.suse.de/fullchain.pem"Resolvednicksinger

Actions
Actions

Also available in: Atom PDF