Project

General

Profile

Actions

action #157441

closed

osd-deployment | Failed pipeline for master (qesapworker-prg5.qa.suse.cz)

Added by tinita 2 months ago. Updated about 2 months ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
Regressions/Crashes
Target version:
Start date:
2024-03-18
Due date:
% Done:

0%

Estimated time:

Description

Observation

https://gitlab.suse.de/openqa/osd-deployment/-/jobs/2398227

Date: Sun, 17 Mar 2024 05:49:14 +0000
Date: Mon, 18 Mar 2024 05:49:42 +0000

qesapworker-prg5.qa.suse.cz:
2184    Minion did not return. [Not connected]

https://stats.openqa-monitor.qa.suse.de/alerting/grafana/host_up_alert_qesapworker-prg5/view?orgId=1

The worker seemed to have hung up. No login prompt on serial tty.
Rebooted via IPMI.
Worker came up, but a systemd service failed: …

It seems like the NVMe disk is not found anymore. Maybe it died and the system subsequently freezed.

Acceptance criteria

  • AC1: osd-deployment passed again
  • AC2: qesapworker-prg5.qa.suse.cz back in production again

Suggestions

Rollback steps


Related issues 1 (0 open1 closed)

Related to openQA Infrastructure - action #157453: [FIRING:1] host_up (qesapworker-prg5: host up alert openQA qesapworker-prg5 host_up_alert_qesapworker-prg5 worker)Rejectedokurz2024-03-18

Actions
Actions

Also available in: Atom PDF