action #159270: openqaworker-arm-1 is Unreachable size:S - openQA Infrastructure - openSUSE Project Management Tool

Actions

Copy link

action #159270

closed

QA - coordination #121720: [saga][epic] Migration to QE setup in PRG2+NUE3 while ensuring availability

QA - coordination #129280: [epic] Move from SUSE NUE1 (Maxtorhof) to new NBG Datacenters

openqaworker-arm-1 is Unreachable size:S

Added by ybonatakis 7 months ago. Updated 6 months ago.

Status:

Resolved

Priority:

High

Assignee:

ybonatakis

Category:

Regressions/Crashes

Target version:

openQA Project - Ready

Start date:

2024-04-19

Due date:

% Done:

Estimated time:

Tags:

alert, infra

Description

Observation¶

❯ ping openqaworker-arm-1.qe.nue2.suse.org
PING openqaworker-arm-1.qe.nue2.suse.org (10.168.192.213) 56(84) bytes of data.
From 81.95.8.245 icmp_seq=1 Destination Host Unreachable
From 81.95.8.245 icmp_seq=2 Destination Host Unreachable
From 81.95.8.245 icmp_seq=3 Destination Host Unreachable

graph shows that it went down at 2024-04-18 15:32:00
I think the most relevant graph is https://stats.openqa-monitor.qa.suse.de/d/WDopenqaworker-arm-1/worker-dashboard-openqaworker-arm-1?orgId=1&from=now-12h&to=now&viewPanel=65113
QA network infrastructure packet loss shows walter1.qe.nue2.suse.org 100 at 2024-04-18 15:19:00

Suggestions¶

Just recover the machine and ensure it's up again as alert mitigation

Out of scope¶

Fixing the automation: #157753
Fixing osd-deployment: #159303

Related issues 9 (0 open — 9 closed)

Related to openQA Infrastructure - action #159303: [alert] osd-deployment pre-deploy pipeline failed because openqaworker-arm-1.qe.nue2.suse.org was offline size:S

Resolved

nicksinger

2024-06-25

Actions

Related to QA - action #157753: Bring back automatic recovery for openqaworker-arm-1 size:M

Resolved

ybonatakis

Actions

Related to openQA Infrastructure - action #159318: openqa-piworker host up alert

Resolved

nicksinger

2023-08-09