Actions
action #68869
closedautomatic ARM recovery jobs fail due to caasp master running gitlab CI jobs have expired certificates
Start date:
2020-07-12
Due date:
% Done:
0%
Estimated time:
Description
https://stats.openqa-monitor.qa.suse.de/d/1bNU0StZz/automatic-actions?orgId=1 shows two ARM machines being down for two days. The automatic recovery did not work. The good thing is that the long-time alerts have triggered. The problem is visible in https://gitlab.suse.de/openqa/grafana-webhook-actions/-/jobs/232053 on the side of gitlab runner machines.
Updated by okurz over 4 years ago
- Status changed from New to Blocked
Reported https://infra.nue.suse.com/SelfService/Display.html?id=174378
Have triggered reboot of both openqaworker-arm-1 and openqaworker-arm-2 manually now with:
ipmitool -I lanplus -H openqaworker-arm-1-ipmi.suse.de -U ADMIN -P ADMIN power cycle
ipmitool -I lanplus -H openqaworker-arm-2-ipmi.suse.de -U ADMIN -P ADMIN power cycle
Updated by okurz over 4 years ago
- Status changed from Blocked to Resolved
infra ticket was resolved, all working fine again.
Updated by jbaier_cz over 2 years ago
- Related to action #113561: failed pipelines for openQABot and bot-ng because of an expired cert added
Actions