Project

General

Profile

action #110494

Updated by cdywan 4 months ago

## Observation
It seems that openqaworker5 is down since early morning: https://stats.openqa-monitor.qa.suse.de/d/WDopenqaworker5/worker-dashboard-openqaworker5?tab=alert&viewPanel=65105&orgId=1&from=1651359305036&to=1651398742983
The host is reachable via IPMI and it is up.

## Suggestions

* `ipmi-openqaworker5-ipmi sol activate` allowed login but the network was down. There is one failed service "os-autoinst-openvswitch" failed with `can't parse bridge…` which could be the cause or is just a symptom of another network issue. okurz triggered `systemctl start default.target`. Check again if it's reachable, login over SoL, check logs, try multiple reboots, fix any problems in network config or os-autoinst-openvswitch
* We already have multiple levels of retry, e.g. in os-autoinst-openvswitch. Check if this retry was not enough, different problem, or add more retrying on systemd level

Back