Project

General

Profile

Actions

action #88225

closed

osd infrastructure: Many failed systemd services on various machines

Added by okurz almost 4 years ago. Updated almost 4 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
-
Start date:
2021-01-26
Due date:
% Done:

0%

Estimated time:

Description

Observation

hi guys, https://stats.openqa-monitor.qa.suse.de/d/KToPYLEWz/failed-systemd-services?orgId=1&editPanel=6&tab=alert is disabled since some weeks since we had bigger problems which we already handled in various tickets, e.g. the broken worker issues reg. network, but it shows currently 14 (!) failed systemd services on our hosts. I think the original ticket is still blocked but by a new issue. I will create a new urgent issue to handle the plethora of failed services

Acceptance criteria

  • AC1: Significantly reduced number of failed systemd services
  • AC2: alert is again enabled

Related issues 1 (0 open1 closed)

Related to openQA Infrastructure (public) - action #88474: All workers on powerqaworker-qam-1 are offlineResolvedlivdywan2021-02-08

Actions
Actions

Also available in: Atom PDF