Project

General

Profile

Actions

action #110494

closed

alert: openqaworker5 host up size:M

Added by jbaier_cz over 2 years ago. Updated over 2 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
-
Start date:
2022-05-01
Due date:
2022-05-27
% Done:

0%

Estimated time:

Description

Observation

It seems that openqaworker5 is down since early morning: https://stats.openqa-monitor.qa.suse.de/d/WDopenqaworker5/worker-dashboard-openqaworker5?tab=alert&viewPanel=65105&orgId=1&from=1651359305036&to=1651398742983
The host is reachable via IPMI and it is up.

Suggestions

  • ipmi-openqaworker5-ipmi sol activate allowed login but the network was down. There is one failed service "os-autoinst-openvswitch" failed with can't parse bridge… which could be the cause or is just a symptom of another network issue. okurz triggered systemctl start default.target. Check again if it's reachable, login over SoL, check logs, try multiple reboots, fix any problems in network config or os-autoinst-openvswitch
  • We already have multiple levels of retry, e.g. in os-autoinst-openvswitch. Check if this retry was not enough, different problem, or add more retrying on systemd level

Related issues 1 (0 open1 closed)

Related to openQA Project (public) - action #110497: Minion influxdb data causing unusual download rates size:MResolvedmkittler2022-05-01

Actions
Actions

Also available in: Atom PDF