Project

General

Profile

Actions

action #122983

closed

[alert] openqa/monitor-o3 failing because openqaworker1 is down size:M

Added by livdywan almost 2 years ago. Updated over 1 year ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
-
Start date:
2023-01-02
Due date:
2023-05-12
% Done:

0%

Estimated time:
Tags:

Description

Observation

openqa/monitor-o3 is failing because openqaworker1 is down:

PING openqaworker1.openqanet.opensuse.org (192.168.112.6) 56(84) bytes of data.
2388--- openqaworker1.openqanet.opensuse.org ping statistics ---
23891 packets transmitted, 0 received, +1 errors, 100% packet loss, time 0ms

Acceptance criteria

  • AC1: openqaworker1 is up and survives reboots

Rollback steps

  • Disable s390x worker slots on rebel again (to use the setup on openqaworker1 again instead).

Suggestions

  • Try to login
  • Reboot via ipmi

Related issues 3 (0 open3 closed)

Related to openQA Project (public) - action #126188: [openQA][infra][worker][sut] openQA infra performance fluctuates to the level that that leads to tangible test run failure size:MResolvedmkittler2023-03-20

Actions
Blocked by openQA Infrastructure (public) - action #123028: A/C broken in TAM lab size:MResolvednicksinger2023-01-12

Actions
Blocked by openQA Infrastructure (public) - action #127256: missing nameservers in dhcp response for baremetal machines in NUE-FC-B 2 size:MResolvedmkittler2023-04-05

Actions
Actions

Also available in: Atom PDF