openSUSE Project Management Tool: Issueshttps://progress.opensuse.org/https://progress.opensuse.org/themes/openSUSE/favicon/favicon.ico?15829177842019-10-15T06:06:43ZopenSUSE Project Management Tool
Redmine openQA Project - coordination #58166 (Resolved): EPIC: Continue tests after failures on !qemuhttps://progress.opensuse.org/issues/581662019-10-15T06:06:43Zxlaixlai@suse.com
<p>Our jobs run on ipmi workers. When many tests chained, to get high test efficiency, we need the feature that the following tests can continue when earlier tests fail.</p>
<p>We were suggested to set fatal flag to 0 to these tests. However from the tried example, it did not work.</p>
<p>Failure job link: <br>
<a href="http://10.67.18.220/tests/38#" class="external">http://10.67.18.220/tests/38#</a>.</p>
<p>Can any expert on this help to confirm whether we use it the correct way? </p>
<p>Job details:</p>
<pre><code>Test order:
login_console -> fail_moduleA -> fail_moduleB
fail_moduleA main code:
sub run {
type_string("echo start fail_moduleA.pm\n");
die "die on purpose to check if test continue to next module";
}
sub post_fail_hook {
#force_soft_failure("let test continue...");
type_string("post_fail_hook DONE");
save_screenshot;
}
sub test_flags {
return {fatal => 0};
But B was not started after A fail.
</code></pre> openQA Infrastructure - action #55868 (Resolved): USB device is removed from gonzo.qa.suse.de.https://progress.opensuse.org/issues/558682019-08-23T08:20:01Zxlaixlai@suse.com
<p>We equipped on some servers some usb device to test virtualization pvusb feature. However today from a failure test <a href="https://openqa.suse.de/tests/3280807">https://openqa.suse.de/tests/3280807</a>, we noticed that the logitech mouse usb device on it was accidentally removed by someone and resulted in test failure.</p>
<p>SUT_IP: gonzo.qa.suse.de<br>
Missing USB device: <br>
Bus 003 Device 002: ID 046d:c001 Logitech, Inc. N48/M-BB48/M-UK96A [FirstMouse Plus]</p>
<p><a class="user active user-mention" href="https://progress.opensuse.org/users/24624">@nicksinger</a> Would you please help to find out who moved it and ask him/her to recover it? I know it is not easy since so many people have access to lab. If you can not find it, would you please help to remind people to not move our usb devices? Really appreciate if you can help paste some warning msg on these machines.</p>
<p>SUTs with usb device:<br>
21:<br>
WORKER_CLASS: virt-mm-64bit-ipmi,virt-pvusb-64bit-ipmi,64bit-ipmi<br>
IPMI_HOSTNAME: sp.kermit.qa.suse.de<br>
IPMI_PASSWORD: ADMIN<br>
IPMI_USER: ADMIN<br>
MAX_JOB_TIME: 32000<br>
SUT_IP: kermit.qa.suse.de<br>
PVUSB_DEVICE: Keyboard<br>
22: //missing now<br>
WORKER_CLASS: virt-mm-64bit-ipmi,virt-pvusb-64bit-ipmi,64bit-ipmi<br>
IPMI_HOSTNAME: sp.gonzo.qa.suse.de<br>
IPMI_PASSWORD: ADMIN<br>
IPMI_USER: ADMIN<br>
MAX_JOB_TIME: 32000<br>
SUT_IP: gonzo.qa.suse.de<br>
PVUSB_DEVICE: Logitech<br>
23:<br>
WORKER_CLASS: virt-mm-64bit-ipmi,virt-pvusb-64bit-ipmi,64bit-ipmi<br>
IPMI_HOSTNAME: sp.fozzie.qa.suse.de<br>
IPMI_PASSWORD: ADMIN<br>
IPMI_USER: ADMIN<br>
MAX_JOB_TIME: 32000<br>
SUT_IP: fozzie-1.qa.suse.de<br>
PVUSB_DEVICE: Logitech<br>
24:<br>
WORKER_CLASS: virt-mm-64bit-ipmi,virt-pvusb-64bit-ipmi,64bit-ipmi<br>
IPMI_HOSTNAME: sp.scooter.qa.suse.de<br>
IPMI_PASSWORD: ADMIN<br>
IPMI_USER: ADMIN<br>
MAX_JOB_TIME: 32000<br>
SUT_IP: scooter-1.qa.suse.de</p>
<pre><code> 16:
WORKER_CLASS: virt-pvusb-64bit-ipmi,64bit-ipmi
IPMI_HOSTNAME: 10.162.28.200
IPMI_PASSWORD: qatesting
IPMI_USER: admin
MAX_JOB_TIME: 32000
SUT_IP: 10.162.2.75
PVUSB_DEVICE: Winbond
17: # new worker quinn.qa.suse.de
WORKER_CLASS: virt-pvusb-64bit-ipmi,64bit-ipmi
IPMI_HOSTNAME: 10.162.3.87
IPMI_PASSWORD: ADMIN
IPMI_USER: ADMIN
SUT_IP: 10.162.2.87
MAX_JOB_TIME: 32000
PVUSB_DEVICE: Verbatim
</code></pre> openQA Infrastructure - action #55757 (Resolved): OSD ftp service has problem: NFS mount on qanet...https://progress.opensuse.org/issues/557572019-08-20T10:21:33Zxlaixlai@suse.com
<p>From around 4 hours ago, pxe installation on osd always fail at boot_from_pxe with error "Could not find kernel image xxx", <a href="https://openqa.suse.de/tests/3270212#step/boot_from_pxe/6" class="external">https://openqa.suse.de/tests/3270212#step/boot_from_pxe/6</a>.</p>
<p>We tested ftp service, it is not accessible for images, but http works well.</p>
<p>Please help recover. It blocks virtualization test for beta4 candidate.</p>
<a name="Log"></a>
<h1 >Log:<a href="#Log" class="wiki-anchor">¶</a></h1>
<p>FTP STATUS:<br>
'wget <a href="ftp://openqa.suse.de/assets/repo/fixed/SLE-12-SP4-Server-DVD-x86_64-GM-DVD1/boot/x86_64/loader/linux" class="external">ftp://openqa.suse.de/assets/repo/fixed/SLE-12-SP4-Server-DVD-x86_64-GM-DVD1/boot/x86_64/loader/linux</a><br>
--2019-08-20 06:15:32-- <a href="ftp://openqa.suse.de/assets/repo/fixed/SLE-12-SP4-Server-DVD-x86_64-GM-DVD1/boot/x86_64/loader/linux" class="external">ftp://openqa.suse.de/assets/repo/fixed/SLE-12-SP4-Server-DVD-x86_64-GM-DVD1/boot/x86_64/loader/linux</a><br>
=> ‘linux.1’<br>
Resolving openqa.suse.de (openqa.suse.de)... 10.160.0.207, 2620:113:80c0:8080:10:160:0:207 <br>
Connecting to openqa.suse.de (openqa.suse.de)|10.160.0.207|:21... connected. <br>
Logging in as anonymous ... Logged in!<br>
==> SYST ... done. ==> PWD ... done.<br>
==> TYPE I ... done. ==> CWD (1) /assets/repo/fixed/SLE-12-SP4-Server-DVD-x86_64-GM-DVD1/boot/x86_64/loader ...<br>
No such directory ‘assets/repo/fixed/SLE-12-SP4-Server-DVD-x86_64-GM-DVD1/boot/x86_64/loader’. <br>
'<br>
HTTP:<br>
'wget <a href="http://openqa.suse.de/assets/repo/fixed/SLE-12-SP4-Server-DVD-x86_64-GM-DVD1/boot/x86_64/loader/linux" class="external">http://openqa.suse.de/assets/repo/fixed/SLE-12-SP4-Server-DVD-x86_64-GM-DVD1/boot/x86_64/loader/linux</a><br>
--2019-08-20 06:16:35-- <a href="http://openqa.suse.de/assets/repo/fixed/SLE-12-SP4-Server-DVD-x86_64-GM-DVD1/boot/x86_64/loader/linux" class="external">http://openqa.suse.de/assets/repo/fixed/SLE-12-SP4-Server-DVD-x86_64-GM-DVD1/boot/x86_64/loader/linux</a><br>
Resolving openqa.suse.de (openqa.suse.de)... 10.160.0.207, 2620:113:80c0:8080:10:160:0:207 <br>
Connecting to openqa.suse.de (openqa.suse.de)|10.160.0.207|:80... connected. <br>
HTTP request sent, awaiting response... 200 OK<br>
Length: 6856480 (6.5M)<br>
Saving to: ‘linux.1’</p>
<p>100%[===============================================================>] 6,856,480 --.-K/s in 0.08s </p>
<p>2019-08-20 06:16:35 (80.0 MB/s) - ‘linux.1’ saved [6856480/6856480]<br>
'</p>
<p>'# curl <a href="ftp://openqa.suse.de/assets/repo/fixed/SLE-12-SP4-Server-DVD-x86_64-GM-DVD1/boot/x86_64/loader/linux" class="external">ftp://openqa.suse.de/assets/repo/fixed/SLE-12-SP4-Server-DVD-x86_64-GM-DVD1/boot/x86_64/loader/linux</a><br>
curl: (9) Server denied you to change to the given directory<br>
'</p>
openQA Infrastructure - action #54263 (Resolved): Please help to setup ipmi on openqaw5-xen.qa.su...https://progress.opensuse.org/issues/542632019-07-15T10:09:58Zxlaixlai@suse.com
<p>Hello,</p>
<p>This machine is for virtualization QA test in openqa. Please help to setup ipmi connection on it for remote control from beijing. Detailed info in <a href="https://racktables.nue.suse.com/index.php?page=object&tab=default&object_id=3468" class="external">https://racktables.nue.suse.com/index.php?page=object&tab=default&object_id=3468</a>.</p>
<p>Thank you in advance!</p>
openQA Infrastructure - action #51710 (Resolved): [openqa infra] Need 15sp1 repo under repo/fixedhttps://progress.opensuse.org/issues/517102019-05-21T03:21:09Zxlaixlai@suse.com
<p>In 12sp5 test plan, virtualization needs to cover tests on sle15sp1 host. So we need 15sp1 gmc repo under /var/lib/openqa/share/factory/repo/fixed just like other released products SLE12SP3. But I do not have permission to do so. Can anyone help to do it?</p>
<p><a class="user active user-mention" href="https://progress.opensuse.org/users/24624">@nicksinger</a>, please help to reassign if you find other more suitable one. Thanks!</p>
<p>Log:<br>
xlai@openqa:/var/lib/openqa/share/factory/repo/fixed> ln -s ../SLE-15-SP1-Installer-DVD-x86_64-Build227.1-Media1 ./SLE-15-SP1-Installer-DVD-x86_64-GM-DVD1<br>
ln: failed to create symbolic link './SLE-15-SP1-Installer-DVD-x86_64-GM-DVD1': Permission denied<br>
xlai@openqa:/var/lib/openqa/share/factory/repo/fixed> </p>
openQA Infrastructure - action #48029 (Resolved): [network failure] Can not access ipmi worker gr...https://progress.opensuse.org/issues/480292019-02-18T06:35:35Zxlaixlai@suse.com
<p>All recent triggered jobs were incomplete on this worker because not able to establish ipmitool connection. Local ping to that ip also failed.</p>
<p>Worker config:<br>
10:<br>
WORKER_CLASS: 64bit-ipmi<br>
IPMI_HOSTNAME: openqaipmi5-sp.qa.suse.de<br>
IPMI_PASSWORD: qatesting<br>
IPMI_USER: admin<br>
MAX_JOB_TIME: 32000<br>
SUT_IP: openqaipmi5.qa.suse.de</p>
<p>Ping from beijing failure:</p>
<p>linux-gepp:~ # ping openqaipmi5-sp.qa.suse.de<br>
PING openqaipmi5-sp.qa.suse.de (10.162.28.160) 56(84) bytes of data.<br>
<sup>C</sup><br>
--- openqaipmi5-sp.qa.suse.de ping statistics ---<br>
92 packets transmitted, 0 received, 100% packet loss, time 91728ms</p>
<p>linux-gepp:~ # </p>
openQA Infrastructure - action #44795 (Rejected): [tools] setup failure: Cache service not availa...https://progress.opensuse.org/issues/447952018-12-06T02:31:53Zxlaixlai@suse.com
<p>In latest build 108.1, most of virtualization jobs(several tens) fail as incomplete job with similar reason:</p>
<p>[2018-12-05T11:34:06.0806 CET] [info] +++ setup notes +++<br>
[2018-12-05T11:34:06.0806 CET] [info] start time: 2018-12-05 10:34:06<br>
[2018-12-05T11:34:06.0807 CET] [info] running on grenache-1:17 (Linux 4.4.138-94.39-default #1 SMP Mon Jun 18 13:27:26 UTC 2018 (baa07f9) ppc64le)<br>
[2018-12-05T11:34:06.0812 CET] [warn] job is missing files, releasing job<br>
[2018-12-05T11:34:06.0842 CET] [info] +++ worker notes +++<br>
[2018-12-05T11:34:06.0842 CET] [info] end time: 2018-12-05 10:34:06<br>
[2018-12-05T11:34:06.0842 CET] [info] result: setup failure: Cache service not available.<br>
[2018-12-05T11:34:06.0843 CET] [info] uploading autoinst-log.txt</p>
<p>Fail job link:<br>
<a href="https://openqa.suse.de/tests/2303781/file/autoinst-log.txt" class="external">https://openqa.suse.de/tests/2303781/file/autoinst-log.txt</a></p>
openQA Infrastructure - action #44498 (Resolved): [ipmi][grenache-1] Incomplete job due to no spa...https://progress.opensuse.org/issues/444982018-11-29T07:43:06Zxlaixlai@suse.com
<p>Key log:<br>
[2018-11-28T11:43:34.336 CET] [debug] Backend process died, backend errors are reported below in the following lines Can't write to log: No space left on device at /usr/lib/os-autoinst/bmwqemu.pm line 200.</p>
<p>XIO: fatal IO error 11 (Resource temporarily unavailable) on X server ":57369"<br>
after 15738 requests (15581 known processed) with 0 events remaining.<br>
[2018-11-28T11:43:34.660 CET] [debug] backend process exited: 200<br>
xterm: fatal IO error 11 (Resource temporarily unavailable) or KillClient on X server ":57369"<br>
[2018-11-28T11:43:34.740 CET] [debug] Driver backend collected unknown process with pid 477868 and exit status: 1<br>
[2018-11-28T11:43:34.740 CET] [debug] Driver backend collected unknown process with pid 477844 and exit status: 0<br>
[2018-11-28T11:43:34.740 CET] [debug] sysread failed: <br>
[2018-11-28T11:43:34.746 CET] [debug] Driver backend collected unknown process with pid 477870 and exit status: 84<br>
[2018-11-28T11:43:34.746 CET] [debug] Driver backend collected unknown process with pid 477880 and exit status: 0<br>
[2018-11-28T11:43:34.751 CET] [debug] commands process exited: 0<br>
[2018-11-28T11:43:37.0493 CET] [info] +++ worker notes +++<br>
[2018-11-28T11:43:37.0494 CET] [info] end time: 2018-11-28 10:43:37<br>
[2018-11-28T11:43:37.0495 CET] [info] result: died<br>
[2018-11-28T11:43:37.0496 CET] [info] uploading video.ogv<br>
[2018-11-28T11:43:37.0583 CET] [info] uploading vars.json<br>
[2018-11-28T11:43:37.0612 CET] [info] uploading serial0.txt</p>
<p>Failure job:<br>
grenache-1:16<br>
<a href="https://openqa.suse.de/tests/2285123/file/autoinst-log.txt" class="external">https://openqa.suse.de/tests/2285123/file/autoinst-log.txt</a></p>
<p>grenache-1:10<br>
<a href="https://openqa.suse.de/tests/2285129" class="external">https://openqa.suse.de/tests/2285129</a></p>
<p>grenache-1:17<br>
<a href="https://openqa.suse.de/tests/2285036/file/autoinst-log.txt" class="external">https://openqa.suse.de/tests/2285036/file/autoinst-log.txt</a></p>
openQA Infrastructure - action #44351 (Resolved): [ipmi] Workers/jobs stuck https://progress.opensuse.org/issues/443512018-11-26T10:20:30Zxlaixlai@suse.com
<p>Currently at least 4 ipmi workers (openqaworker2:23/24/25/26) are stuck. They can not finish jobs or take new jobs for over a day and some even several days.</p>
<p>Despite the misleading message from the developer mode, this issue has nothing to do with the developer mode. (The misleading message is also already fixed on latest master.)</p>
<p>This issue is likely tied to the latest refactoring of the worker cache. At least the <code>autoinst-log.txt</code> leads to that conclusion:</p>
<pre><code>tail -f /var/lib/openqa/pool/26/autoinst-log.txt
[2018-11-26T11:07:10.0806 CET] [info] +++ setup notes +++
[2018-11-26T11:07:10.0806 CET] [info] start time: 2018-11-26 10:07:10
[2018-11-26T11:07:10.0806 CET] [info] running on openqaworker2:26 (Linux 4.7.5-2.g02c4d35-default #1 SMP PREEMPT Mon Sep 26 08:11:45 UTC 2016 (02c4d35) x86_64)
[2018-11-26T11:07:10.0829 CET] [debug] Downloading SLE-15-SP1-Installer-DVD-x86_64-Build100.4-Media1.iso - request sent to Cache Service.
</code></pre>
<p>Since spvm jobs on grenache show the same symptom, this is likely also not ipmi-specific.</p>
<p>Job links: <a href="https://openqa.suse.de/tests/2272494" class="external">https://openqa.suse.de/tests/2272494</a> (openqaworker2), <a href="https://openqa.suse.de/tests/2278365" class="external">https://openqa.suse.de/tests/2278365</a> (grenache-1)</p>
openQA Infrastructure - action #44228 (Resolved): Need to deploy SLE12SP4 GM x86_64 repo to http:...https://progress.opensuse.org/issues/442282018-11-22T08:17:03Zxlaixlai@suse.com
<p>In SLE15SP1 test plan, virtualization will cover tests on sle12sp4 GM host. So like before, can anyone help to make it under <a href="http://openqa.suse.de/assets/repo/fixed/" class="external">http://openqa.suse.de/assets/repo/fixed/</a>, like other sle12sp3 or sle15 GM setup?</p>
openQA Infrastructure - action #43874 (Resolved): HDD in kermit.qa.suse.de has problems in combin...https://progress.opensuse.org/issues/438742018-11-16T08:12:55Zxlaixlai@suse.com
<p>OpenQA worker openqaworker2:23 relevant SUT, configuration:<br>
"IPMI_HOSTNAME" : "sp.kermit.qa.suse.de",<br>
"IPMI_PASSWORD" : "ADMIN",<br>
"IPMI_USER" : "ADMIN",</p>
<p>mkfs and fdisk operations on this disk fails (on sle12sp3 /15sp1 system), showing error of writing superblocks.</p>
<p>Please help to fix it.</p>
openQA Infrastructure - action #40544 (Resolved): [OpenQA][IPMI backend] IPMI worker can not surv...https://progress.opensuse.org/issues/405442018-09-04T07:29:51Zxlaixlai@suse.com
<p>We have two dell machines, vh003.qa2.suse.asia and vh004.qa2.suse.asia. When they are binded with ipmi worker, the jobs on those two machines can not survive reboot. For example, after host installation when it boots to the new os, the sol console can only get black screen, not reactive at all. So does any other simple reboot.</p>
<p>After debugging by john and jerry, it is found that the reset_console operation leads to this failure because the existing sol console connection is not properly cleaned up and result in failure in the new sol console setup.</p>
<p>John and jerry also have their 2 proposals as solutions which are open for discussions. I will let them describe in more details in later comments.</p>
openQA Project - action #38522 (Resolved): Updating Job Group default priority does not have effe...https://progress.opensuse.org/issues/385222018-07-18T10:11:19Zxlaixlai@suse.com
<p>In virtualization-acceptance job group in sle12sp4, change the default priority from 50 to 30, but the tests still show priority as 50, and also after I retrigger the job via "client isos post", the priority of job is still 50.</p>
<p>I tried to readd the tests after setting default priority of that group to 30 suggested by sergio(this way worked for him), but the added test still shows old priority 50.</p>
<p>BTW, we need to evaluate the run time for this job group during beta1 with priority 30, can anyone help to change the priority if you can by anyway?</p>
openQA Project - action #16602 (Resolved): Comment with unicode characters do not work because of...https://progress.opensuse.org/issues/166022017-02-09T02:24:15Zxlaixlai@suse.com
<a name="observation"></a>
<h2 >observation<a href="#observation" class="wiki-anchor">¶</a></h2>
<p>An error message is reported when I updated result for a build on virtualization group on openqa.suse.de</p>
<p>What was typed on comment:</p>
<pre><code>**Date:** <2017-02-09>
**Build:** 0234
**Arch:** x86_64
**Status: <font color="red">Red</font>
**New Product bugs:**
* No
**Requirement:**
* one more ipmi machine: poo#15376
**Existing Product bugs:**
* sles12sp3 pv guest installation: bsc#1012005
* pvusb bug on sles12sp3 xen: bsc#1023948
**Existing openqa issues:**
* guest migration job fail/multi-machine: poo#15376
* ipmi ikvm unstable: poo#14338
* ipmi serial console unstable: poo#13914
* ipmi keyboard simulator unstable: poo#12982 poo#16088
</code></pre>
<a name="steps-to-reproduce"></a>
<h2 >steps to reproduce<a href="#steps-to-reproduce" class="wiki-anchor">¶</a></h2>
<ul>
<li>on an openQA instance with enabled rabbitmq plugin</li>
<li>type character '/' in any comment</li>
<li>observe reported error</li>
</ul>
<a name="problem"></a>
<h2 >problem<a href="#problem" class="wiki-anchor">¶</a></h2>
<p>H1. Mojo::RabbitMQ::Client does not handle unicode input correctly<br>
<a href="https://github.com/inway/mojo-rabbitmq-client/commit/c043447d4fa53a1456515c4def1c4d1367cd87b0#commitcomment-20829251" class="external">https://github.com/inway/mojo-rabbitmq-client/commit/c043447d4fa53a1456515c4def1c4d1367cd87b0#commitcomment-20829251</a><br>
H2. <code>lib/OpenQA/WebAPI/Plugin/AMQP.pm</code> does not forward unicode correctly to Mojo::RabbitMQ::Client</p>
<a name="workaround"></a>
<h2 >workaround<a href="#workaround" class="wiki-anchor">¶</a></h2>
<p>only type ascii within comments.</p>
openQA Infrastructure - action #16088 (Rejected): [ipmi] Do not respond to send_key.https://progress.opensuse.org/issues/160882017-01-19T06:02:20Zxlaixlai@suse.com
<p>In job <a href="https://openqa.suse.de/tests/716647#step/reboot_and_wait_up_normal2/3" class="external">https://openqa.suse.de/tests/716647#step/reboot_and_wait_up_normal2/3</a>, we use send_key_until_needle_match api to select xen grub menuentry, however after catching a not matching screen, and a send_key is sent, screen does not change.</p>