Project

General

Profile

action #107062

Updated by okurz about 2 years ago

I will use this ticket to collect the different errors I observe in our tests (at least for QE-C squad) that fail due to network issues. 
 Normally a restart helps to get the job green again (in case we need it green), but this is not the ideal solution. 

 The idea of this ticket is to collect more potential issues caught by reviewers and propose solutions for some of them, in the code (retry same command several times might help) or in the infra side.  


 There is an example for each error I found, but from my experience reviewing jobs every day, these failures happen multiple times a day and randomly (difficult to predict). 

 1) **SUSEConnect timeouts** -> https://openqa.suse.de/tests/8189768#step/docker/34 
 `Test died: command 'SUSEConnect -p sle-module-containers/${VERSION_ID}/${CPU} ' timed out at /usr/lib/os-autoinst/testapi.pm line 1039.` 

 Or https://openqa.suse.de/tests/8193554#step/suseconnect_scc/8 
 `Test died: command 'SUSEConnect -r $regcode' INTERNAL-USE-ONLY-b6d61017abdfb29b' timed out at /usr/lib/os-autoinst/testapi.pm line 950.` 

 2) **[updates.suse.com](updates.suse.com) not reachable** -> https://openqa.suse.de/tests/8189697#step/image_docker/1110 
 ``` 
 Retrieving: kmod-25-6.10.1.aarch64.rpm [.........error] 
 Abort, retry, ignore? [a/r/i/...? shows all options] (a): a 
 Download (curl) error for 'https://updates.suse.com/SUSE/Updates/SLE-Module-Basesystem/15-SP2/aarch64/update/aarch64/kmod-25-6.10.1.aarch64.rpm?nE0jiYdfiOdLYjH0o-llNN2xIDXncon0vYw8z1aBPGx00H9S1eN413vUsfSJnzFrVz-CoZoGtSdsPKIDRAOQy3Xw2Tac3Yx5_1i8TPomSNiqhDJ0Ayxro23n46NHHB-XHq669RlHs17wiUFSJiSMCSh-YzdGdFw': 
 Error code: Connection failed 
 Error message: Could not resolve host: updates.suse.com 

 Problem occurred during or after installation or removal of packages: 
 Installation has been aborted as directed. 
 Please see the above error message for a hint. 
 ``` 


 3) **SCC timeouts** -> https://openqa.suse.de/tests/8189613#step/image_docker/316 

 ``` 
 docker run --entrypoint /usr/lib/zypp/plugins/services/container-suseconnect-zypp -i zypper_docker_derived lp 
 ... 
 2022/02/18 07:16:19 Installed product: SLES-12.3-x86_64 
 2022/02/18 07:16:19 Registration server set to https://scc.suse.com 
 2022/02/18 07:16:30 Get https://scc.suse.com/connect/subscriptions/products?arch=x86_64&identifier=SLES&version=12.3: dial tcp: lookup scc.suse.com on 10.0.2.3:53: read udp 172.17.0.2:37151->10.0.2.3:53: i/o timeout 
 ``` 


 4) **zypper ref timeout or error** -> https://openqa.opensuse.org/tests/2193730#step/image_podman/124 
 ``` 
 podman run -i --name 'refreshed' --entrypoint '' registry.opensuse.org/opensuse/leap/15.3/images/totest/containers/opensuse/leap:15.3 zypper -nv ref 
 ... 
 Retrieving: cb71cb070e8aac79327e6f1b6edc5317122ca1f72970299c3cb2cf505e18b27f-deltainfo.xml.gz [........................done (82.3 KiB/s)] 
 Retrieving: 832729371fe20bc1a4d27e59d76c10ffe2c0b5a1ff71c4e934e7a11baa24a74b-primary.xml.gz [............................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................error (87.0 KiB/s)] 
 WJdDM-124- 
 ``` 

Back