action #116107
closedopenQA-in-OpenQA openqa_from_containers test fails in build size:M
Description
Observation¶
openQA test in scenario openqa-Tumbleweed-dev-x86_64-openqa_from_containers@64bit-2G fails in
build
# Test died: command 'for i in webui worker; do retry -s 30 -- docker build openQA/container/$i -t openqa_$i; done' failed at openqa//tests/containers/build.pm line 8.
Test suite description¶
Maintainer: okurz@suse.de Test for running openQA itself from containers. To be used with "openqa" distri. Introduced retry on the job level due to https://progress.opensuse.org/issues/108665 as there can still be sporadic network issues sometimes.
Reproducible¶
Fails since (at least) Build :TW.12159
Expected result¶
Last good: :TW.12158 (or more recent)
Further details¶
Always latest result in this scenario: latest
Suggestions¶
- Change the docker call to reveal the exit code
- Reveal the return code via the testapi
- Investigate this in a mob session
Updated by tinita over 2 years ago
- Subject changed from openQA-in-OpenQA test fails in to openQA-in-OpenQA test fails in build
Updated by tinita over 2 years ago
- Subject changed from openQA-in-OpenQA test fails in build to openQA-in-OpenQA openqa_from_containers test fails in build
Updated by livdywan over 2 years ago
- Subject changed from openQA-in-OpenQA openqa_from_containers test fails in build to openQA-in-OpenQA openqa_from_containers test fails in build size:M
- Description updated (diff)
- Status changed from New to Workable
Updated by mkittler over 2 years ago
- Category deleted (
Regressions/Crashes) - Target version deleted (
Ready)
The video shows that some install scripts failed during the installation with zypper. So while the container was built/exited successfully the zypper command that ran within may not have returned with a zero return code.
Updated by tinita over 2 years ago
- Category set to Regressions/Crashes
- Target version set to Ready
Updated by tinita over 2 years ago
Apparently openqa_install+publish also fails, and the error is visible:
https://openqa.opensuse.org/tests/2597042#step/openqa_webui/11
It's about an unsigned repomd.xml file.
Maybe that's also what's happening in the container test and we can't see it in the screenshots? Maybe that's what Marius was able to see in the video?
Updated by mkittler over 2 years ago
- Status changed from Workable to In Progress
- Assignee set to mkittler
PR for improving the error handling: https://github.com/os-autoinst/os-autoinst-distri-openQA/pull/99
Not sure whether we can fix the root cause. Maybe it is also a good idea to increase the timeout or number of retries.
Updated by openqa_review over 2 years ago
- Due date set to 2022-09-16
Setting due date based on mean cycle time of SUSE QE Tools
Updated by mkittler over 2 years ago
- Status changed from In Progress to Resolved
There were already over 10 builds since the PR has been merged and all passed. Not sure whether it makes sense to increase the timeout (as mentioned in the previous comment) so I'll just leave it as-is. Note that in the passing jobs one can now see already much better the result of each contain build (as each command now has its own screenshot at the end). This should make future investigations if something goes wrong easier.