Project

General

Profile

Actions

action #89815

closed

osd-deployment blocked by openqaworker-arm-3 offline and not recovered automatically

Added by okurz almost 4 years ago. Updated over 3 years ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
-
Start date:
2021-03-10
Due date:
2021-04-22
% Done:

0%

Estimated time:

Description

Observation

https://gitlab.suse.de/openqa/osd-deployment/pipelines shows that deployment on 2021-03-08 and 2021-03-10 have been blocked by openqaworker-arm-3 being offline. https://monitor.qa.suse.de/d/1bNU0StZz/automatic-actions?orgId=1 shows that openqaworker-arm-3 is offline since 2021-03-06 and automatic actions could not recover

Acceptance criteria

  • AC1: openqaworker-arm-3 is online again
  • AC2: openqaworker-arm-3 can be recovered automatically or ticket to EngInfra is created automatically as in before
  • AC3: osd-deployment is recovered

Suggestions

  • Remove openqaworker-arm-3 from salt keys
  • Trigger osd-deployment again manually
  • Check automatic actions for 2021-03-06 and find out if a ticket was created or not, fix if not
  • Bring back openqaworker-arm-3 after the above is fixed (or get rid of this unreliable machine and replace it with a better one)

Related issues 3 (0 open3 closed)

Related to openQA Infrastructure (public) - action #92176: [alert] openqaworker-arm-3 offline and CI pipeline unable to send email but stating "passed"Resolvedmkittler2021-05-052021-05-21

Actions
Related to openQA Infrastructure (public) - action #94456: no data from any arm host on https://stats.openqa-monitor.qa.suse.de/d/1bNU0StZz/automatic-actions?orgId=1Resolvedmkittler2021-06-22

Actions
Related to openQA Infrastructure (public) - action #159270: openqaworker-arm-1 is Unreachable size:SResolvedybonatakis2024-04-19

Actions
Actions

Also available in: Atom PDF