action #107062
Updated by okurz over 2 years ago
## Observation I will use this ticket to collect the different errors I observe in our tests (at least for QE-C squad) that fail due to network issues. Normally a restart helps to get the job green again (in case we need it green), but this is not the ideal solution. The idea of this ticket is to collect more potential issues caught by reviewers and propose solutions for some of them, in the code (retry same command several times might help) or in the infra side. There is an example for each error I found, but from my experience reviewing jobs every day, these failures happen multiple times a day and randomly (difficult to predict). 1) **SUSEConnect timeouts** -> https://openqa.suse.de/tests/8189768#step/docker/34 `Test died: command 'SUSEConnect -p sle-module-containers/${VERSION_ID}/${CPU} ' timed out at /usr/lib/os-autoinst/testapi.pm line 1039.` Or https://openqa.suse.de/tests/8193554#step/suseconnect_scc/8 `Test died: command 'SUSEConnect -r $regcode' timed out at /usr/lib/os-autoinst/testapi.pm line 950.` 2) **[updates.suse.com](updates.suse.com) not reachable** -> https://openqa.suse.de/tests/8189697#step/image_docker/1110 ``` Retrieving: kmod-25-6.10.1.aarch64.rpm [.........error] Abort, retry, ignore? [a/r/i/...? shows all options] (a): a Download (curl) error for 'https://updates.suse.com/SUSE/Updates/SLE-Module-Basesystem/15-SP2/aarch64/update/aarch64/kmod-25-6.10.1.aarch64.rpm?nE0jiYdfiOdLYjH0o-llNN2xIDXncon0vYw8z1aBPGx00H9S1eN413vUsfSJnzFrVz-CoZoGtSdsPKIDRAOQy3Xw2Tac3Yx5_1i8TPomSNiqhDJ0Ayxro23n46NHHB-XHq669RlHs17wiUFSJiSMCSh-YzdGdFw': Error code: Connection failed Error message: Could not resolve host: updates.suse.com Problem occurred during or after installation or removal of packages: Installation has been aborted as directed. Please see the above error message for a hint. ``` 3) **SCC timeouts** -> https://openqa.suse.de/tests/8189613#step/image_docker/316 ``` docker run --entrypoint /usr/lib/zypp/plugins/services/container-suseconnect-zypp -i zypper_docker_derived lp ... 2022/02/18 07:16:19 Installed product: SLES-12.3-x86_64 2022/02/18 07:16:19 Registration server set to https://scc.suse.com 2022/02/18 07:16:30 Get https://scc.suse.com/connect/subscriptions/products?arch=x86_64&identifier=SLES&version=12.3: dial tcp: lookup scc.suse.com on 10.0.2.3:53: read udp 172.17.0.2:37151->10.0.2.3:53: i/o timeout ``` 4) **zypper ref timeout or error** -> https://openqa.opensuse.org/tests/2193730#step/image_podman/124 ``` podman run -i --name 'refreshed' --entrypoint '' registry.opensuse.org/opensuse/leap/15.3/images/totest/containers/opensuse/leap:15.3 zypper -nv ref ... Retrieving: cb71cb070e8aac79327e6f1b6edc5317122ca1f72970299c3cb2cf505e18b27f-deltainfo.xml.gz [........................done (82.3 KiB/s)] Retrieving: 832729371fe20bc1a4d27e59d76c10ffe2c0b5a1ff71c4e934e7a11baa24a74b-primary.xml.gz [............................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................error (87.0 KiB/s)] WJdDM-124- ``` ## Acceptance criteria * **AC1:** All existing subtasks are resolved, no additional work needed on top