Actions
action #119479
openopenqABot pipeline failed after runner getting stuck for 1h0m0s size:M
Status:
Workable
Priority:
Low
Assignee:
-
Category:
-
Target version:
QA (public, currently private due to #173521) - future
Start date:
2022-10-21
Due date:
% Done:
0%
Estimated time:
Description
Motivation¶
The openqABot pipeline failed. There's no logs and it seems like the bot never returns: https://gitlab.suse.de/qa-maintenance/openQABot/-/jobs/1212440, https://gitlab.suse.de/qa-maintenance/openQABot/-/jobs/1147725 and https://gitlab.suse.de/qa-maintenance/openQABot/-/jobs/1147636 (which aren't in the maintenance window)
++ ./oqabot.py --verbose --only-mr --metadata /etc/openqabot/mr.yml
41++ tee openqabot_0.log
43ERROR: Job failed: execution took longer than 1h0m0s seconds
A suspicion is that the GitLab runner never returned since otherwise we would see an error or log output. This occurred during the maintenance window.
Acceptance criteria¶
- AC1: It is ensured that gitlab CI jobs do not time out with no helpful information
Suggestions¶
- Raise to SUSE-IT, ask about the gitlab instance and runners and ask to review logs at the time of the failure
- Increase the timeout e.g. Settings -> CI / CD -> General pipelines -> Timeout
- Look at job history for other failed jobs to find a pattern https://gitlab.suse.de/qa-maintenance/openQABot/-/jobs?statuses=FAILED
Updated by jbaier_cz about 2 years ago
Also please note, we have
artifacts:
when: always
in the pipeline, so we should have the log uploaded even if the pipeline itself fail (so this is also an indication of a runner failure than a job failure).
Updated by livdywan about 2 years ago
- Subject changed from openqABot pipeline failed after runner getting stuck for 1h0m0s to openqABot pipeline failed after runner getting stuck for 1h0m0s size:M
- Description updated (diff)
- Status changed from New to Workable
Actions