openSUSE Project Management Tool: Issueshttps://progress.opensuse.org/https://progress.opensuse.org/themes/openSUSE/favicon/favicon.ico?15829177842024-02-27T17:27:27ZopenSUSE Project Management Tool
Redmine openQA Project - action #156169 (New): Automatically validate ay-openqa-worker.xml.erbhttps://progress.opensuse.org/issues/1561692024-02-27T17:27:27Zybonatakisioannis.bonatakis@suse.com
<p><a href="https://github.com/os-autoinst/openQA/blob/master/contrib/ay-openqa-worker.xml.erb" class="external">https://github.com/os-autoinst/openQA/blob/master/contrib/ay-openqa-worker.xml.erb</a> is used for openqa workers. <br>
This is passed as-is to the autoyast and seems to work.<br>
But when generate the xml manual</p>
<ul>
<li>Either TW or Leap</li>
<li>install autoyast2</li>
<li>Run <code>sudo yast2 autoyast check-profile filename=ay-openqa-worker.xml.erb output=result.xml run-scripts=true run-erb=true</code></li>
</ul>
<p>Then complains for:</p>
<ul>
<li>firewall configuration</li>
<li>failure to run the scripts</li>
</ul>
<p>I am not sure about the requirements about the later, but firewall should be easy to address.</p>
openQA Tests - action #135218 (In Progress): test regression in ww4_await_pxe_installhttps://progress.opensuse.org/issues/1352182023-09-06T06:44:43Zybonatakisioannis.bonatakis@suse.com
<a name="Observation"></a>
<h2 >Observation<a href="#Observation" class="wiki-anchor">¶</a></h2>
<p>Tries to boot before the server gets ready and process fails.</p>
<p>openQA test in scenario sle-15-SP6-Online-x86_64-hpc_ww4_compute0@64bit fails in<br>
<a href="https://openqa.suse.de/tests/12014546/modules/ww4_await_pxe_install/steps/7" class="external">ww4_await_pxe_install</a></p>
<a name="Test-suite-description"></a>
<h2 >Test suite description<a href="#Test-suite-description" class="wiki-anchor">¶</a></h2>
<p>first compute node for warewulf booting from network (PXE) after the controller is ready for connection.</p>
<a name="Reproducible"></a>
<h2 >Reproducible<a href="#Reproducible" class="wiki-anchor">¶</a></h2>
<p>Fails since (at least) Build <a href="https://openqa.suse.de/tests/11979509" class="external">16.1</a></p>
<a name="Expected-result"></a>
<h2 >Expected result<a href="#Expected-result" class="wiki-anchor">¶</a></h2>
<p>Last good: (unknown) (or more recent)</p>
<a name="Further-details"></a>
<h2 >Further details<a href="#Further-details" class="wiki-anchor">¶</a></h2>
<p>Always latest result in this scenario: <a href="https://openqa.suse.de/tests/latest?arch=x86_64&distri=sle&flavor=Online&machine=64bit&test=hpc_ww4_compute0&version=15-SP6" class="external">latest</a></p>
openQA Project - action #129172 (Resolved): PXEBOOT doesnt set bootindex on aarch64https://progress.opensuse.org/issues/1291722023-05-12T06:30:55Zybonatakisioannis.bonatakis@suse.com
<p>Digging into the qemu backend seems that the bits to boot a machine (aarch4) from network is missing.</p>
<ul>
<li>first <code>BOOTFROM</code> doesnt support n options
<a href="https://github.com/os-autoinst/os-autoinst/blob/master/backend/qemu.pm#L663">https://github.com/os-autoinst/os-autoinst/blob/master/backend/qemu.pm#L663</a></li>
<li><code>PXEBOOT</code> doesnt set the booting order at all
<a href="https://github.com/os-autoinst/os-autoinst/blob/master/backend/qemu.pm#L713">https://github.com/os-autoinst/os-autoinst/blob/master/backend/qemu.pm#L713</a>
<a href="https://github.com/os-autoinst/os-autoinst/blob/master/backend/qemu.pm#L892">https://github.com/os-autoinst/os-autoinst/blob/master/backend/qemu.pm#L892</a></li>
</ul>
<p>the os-autoinst output looks like that</p>
<pre><code>/usr/bin/qemu-system-aarch64 -device virtio-gpu-pci,edid=on,xres=1024,yres=768 -chardev ringbuf,id=serial0,logfile=serial0,logappend=on -serial chardev:serial0 -audiodev none,id=snd0 -device intel-hda -device hda-output,audiodev=snd0 -m 4096 -machine virt,usb=off,gic-version=3,its=off -cpu host -netdev tap,id=qanet0,ifname=tap2,script=no,downscript=no -device virtio-net,netdev=qanet0,mac=52:54:00:12:05:02 -object rng-random,filename=/dev/urandom,id=rng0 -device virtio-rng-pci,rng=rng0 -device nec-usb-xhci -device usb-tablet -device usb-kbd -smp 1 -enable-kvm -no-shutdown -vnc :93,share=force-shared -device virtio-serial -chardev pipe,id=virtio_console,path=virtio_console,logfile=virtio_console.log,logappend=on -device virtconsole,chardev=virtio_console,name=org.openqa.console.virtio_console -chardev pipe,id=virtio_console_user,path=virtio_console_user,logfile=virtio_console_user.log,logappend=on -device virtconsole,chardev=virtio_console_user,name=org.openqa.console.virtio_console_user -chardev socket,path=qmp_socket,server=on,wait=off,id=qmp_socket,logfile=qmp_socket.log,logappend=on -qmp chardev:qmp_socket -S -device virtio-scsi-pci,id=scsi0 -blockdev driver=file,node-name=hd0-file,filename=/var/lib/openqa/pool/3/raid/hd0,cache.no-flush=on -blockdev driver=qcow2,node-name=hd0,file=hd0-file,cache.no-flush=on,discard=unmap -device virtio-blk-device,id=hd0-device,drive=hd0,serial=hd0 -blockdev driver=file,node-name=cd0-overlay0-file,filename=/var/lib/openqa/pool/3/raid/cd0-overlay0,cache.no-flush=on -blockdev driver=qcow2,node-name=cd0-overlay0,file=cd0-overlay0-file,cache.no-flush=on,discard=unmap -device scsi-cd,id=cd0-device,drive=cd0-overlay0,serial=cd0 -drive id=pflash-code-overlay0,if=pflash,file=/var/lib/openqa/pool/3/raid/pflash-code-overlay0,unit=0,readonly=on -drive id=pflash-vars-overlay0,if=pflash,file=/var/lib/openqa/pool/3/raid/pflash-vars-overlay0,unit=1
</code></pre>
<p>where bootindex is missing</p>
openQA Tests - action #128594 (New): test fails in spack_masterhttps://progress.opensuse.org/issues/1285942023-05-03T12:50:53Zybonatakisioannis.bonatakis@suse.com
<a name="Observation"></a>
<h2 >Observation<a href="#Observation" class="wiki-anchor">¶</a></h2>
<p>This is related to LD_LIBRARY_PATH again similar to bsc#1208751 [1].</p>
<p>To solve the problem back then, we replace the source file which we compile and run with a simple one which it doesnt have dependency on boost library.<br>
As such we had only to load mpich. SLE15SP5 seems with this. However the SLE15SP3 doesnt accept this change. <br>
The spack versions are quite the same<br>
SLE15SP3 -> 0.19.1-150300.5.16.1<br>
SLE15SP5 -> 0.19.1-150400.12.5.1</p>
<p>The thing is that despite the source file nothing else has change. the test fails in the same version which is used by previous jobs.</p>
<p>[1] <a href="https://bugzilla.suse.com/show_bug.cgi?id=1208751" class="external">https://bugzilla.suse.com/show_bug.cgi?id=1208751</a></p>
<p>openQA test in scenario sle-15-SP3-Server-DVD-HPC-Incidents-x86_64-hpc_BETA_mpich_spack_master@64bit-4gbram fails in<br>
<a href="https://openqa.suse.de/tests/11015548/modules/spack_master/steps/118" class="external">spack_master</a></p>
<a name="Test-suite-description"></a>
<h2 >Test suite description<a href="#Test-suite-description" class="wiki-anchor">¶</a></h2>
<p>Basic tests of mpich with CPU count=2. </p>
<a name="Reproducible"></a>
<h2 >Reproducible<a href="#Reproducible" class="wiki-anchor">¶</a></h2>
<p>Fails since (at least) Build <a href="https://openqa.suse.de/tests/10938144" class="external">:28369:hdf5</a></p>
<a name="Expected-result"></a>
<h2 >Expected result<a href="#Expected-result" class="wiki-anchor">¶</a></h2>
<p>Last good: <a href="https://openqa.suse.de/tests/10882077" class="external">:28369:hdf5</a> (or more recent)</p>
<a name="Further-details"></a>
<h2 >Further details<a href="#Further-details" class="wiki-anchor">¶</a></h2>
<p>Always latest result in this scenario: <a href="https://openqa.suse.de/tests/latest?arch=x86_64&distri=sle&flavor=Server-DVD-HPC-Incidents&machine=64bit-4gbram&test=hpc_BETA_mpich_spack_master&version=15-SP3" class="external">latest</a></p>
openQA Project - action #126032 (Resolved): iso posts do not start all the children chainhttps://progress.opensuse.org/issues/1260322023-03-15T04:34:31Zybonatakisioannis.bonatakis@suse.com
<a name="Observation"></a>
<h2 >Observation<a href="#Observation" class="wiki-anchor">¶</a></h2>
<p>Not all the child jobs are scheduled from the parent.</p>
<p>I noticed that in the last two build IIRC. For some reason that occurred in this specific test.<br>
The rest of the multijobs looks to start as expected.<br>
However a few examples i tried out reproduce the problem. </p>
<p><a href="#" onclick="$('#collapse-c15a1223-show, #collapse-c15a1223-hide').toggle(); $('#collapse-c15a1223').fadeToggle(150);; return false;" id="collapse-c15a1223-show" class="icon icon-collapsed collapsible">iso posts example with _SKIP_CHAINED_DEPS</a><a href="#" onclick="$('#collapse-c15a1223-show, #collapse-c15a1223-hide').toggle(); $('#collapse-c15a1223').fadeToggle(150);; return false;" id="collapse-c15a1223-hide" class="icon icon-expanded collapsible" style="display:none;">iso posts example with _SKIP_CHAINED_DEPS</a><div id="collapse-c15a1223" class="collapsed-text" style="display:none;"><p>openqa-cli api --pretty --osd -X POST isos ISO=SLE-15-SP5-Online-aarch64-Build80.1-Media1.iso DISTRI=sle VERSION=15-SP5 FLAVOR=Online ARCH=aarch64 BUILD=80.1 TEST=hpc_BETA_mpich_mpi_supportserver _GRPOUP_ID=130<br>
{<br>
"count" : 2,<br>
"failed" : [],<br>
"ids" : [<br>
10698182,<br>
10698183<br>
],<br>
"scheduled_product_id" : 1768408<br>
}</p>
</div></p>
<p><a href="#" onclick="$('#collapse-85d10afb-show, #collapse-85d10afb-hide').toggle(); $('#collapse-85d10afb').fadeToggle(150);; return false;" id="collapse-85d10afb-show" class="icon icon-collapsed collapsible">iso posts example with _SKIP_CHAINED_DEPS</a><a href="#" onclick="$('#collapse-85d10afb-show, #collapse-85d10afb-hide').toggle(); $('#collapse-85d10afb').fadeToggle(150);; return false;" id="collapse-85d10afb-hide" class="icon icon-expanded collapsible" style="display:none;">iso posts example with _SKIP_CHAINED_DEPS</a><div id="collapse-85d10afb" class="collapsed-text" style="display:none;"><p>openqa-cli api --pretty --osd -X POST isos ISO=SLE-15-SP5-Online-aarch64-Build80.1-Media1.iso DISTRI=sle VERSION=15-SP5 FLAVOR=Online ARCH=aarch64 BUILD=80.1 TEST=hpc_BETA_mvapich2_mpi_supportserver _GRPOUP_ID=130 _SKIP_CHAINED_DEPS=1<br>
{<br>
"count" : 1,<br>
"failed" : [],<br>
"ids" : [<br>
10698186<br>
],<br>
"scheduled_product_id" : 1768410</p>
</div></p>
<p>All children has <code>PARALLEL_WITH=hpc_BETA_mpich_mpi_supportserver</code> on Test Suites</p>
<p>openQA test in scenario sle-15-SP5-Online-x86_64-hpc_BETA_mpich_mpi_supportserver@64bit fails in<br>
<a href="https://openqa.suse.de/tests/10691528/modules/wait_children/steps/7" class="external">wait_children</a></p>
<a name="Test-suite-description"></a>
<h2 >Test suite description<a href="#Test-suite-description" class="wiki-anchor">¶</a></h2>
<p>Basic tests of mpich with CPU count=2. Maintainer: schlad <a href="mailto:schlad@suse.de">schlad@suse.de</a></p>
<a name="Reproducible"></a>
<h2 >Reproducible<a href="#Reproducible" class="wiki-anchor">¶</a></h2>
<p>Fails since (at least) Build <a href="https://openqa.suse.de/tests/10691528" class="external">80.1</a> (current job)</p>
<a name="Expected-result"></a>
<h2 >Expected result<a href="#Expected-result" class="wiki-anchor">¶</a></h2>
<p><a href="#" onclick="$('#collapse-08d69346-show, #collapse-08d69346-hide').toggle(); $('#collapse-08d69346').fadeToggle(150);; return false;" id="collapse-08d69346-show" class="icon icon-collapsed collapsible">should look like</a><a href="#" onclick="$('#collapse-08d69346-show, #collapse-08d69346-hide').toggle(); $('#collapse-08d69346').fadeToggle(150);; return false;" id="collapse-08d69346-hide" class="icon icon-expanded collapsible" style="display:none;">should look like</a><div id="collapse-08d69346" class="collapsed-text" style="display:none;"><p>openqa-cli api --pretty --osd -X POST isos ISO=SLE-15-SP5-Online-x86_64-Build80.1-Media1.iso DISTRI=sle VERSION=15-SP5 FLAVOR=Online ARCH=x86_64 BUILD=80.1 TEST=hpc_BETA_mpich_mpi_supportserver,hpc_BETA_mpich_mpi_slave01,hpc_BETA_mpich_mpi_slave00,hpc_BETA_mpich_mpi_master _GRPOUP_ID=130 _SKIP_CHAINED_DEPS=1<br>
{<br>
"count" : 4,<br>
"failed" : [],<br>
"ids" : [<br>
10698191,<br>
10698192,<br>
10698193,<br>
10698194<br>
],<br>
"scheduled_product_id" : 1768412<br>
}</p>
</div><br>
Last good: <a href="https://openqa.suse.de/tests/10687724" class="external">80.1</a> (or more recent)</p>
<a name="Further-details"></a>
<h2 >Further details<a href="#Further-details" class="wiki-anchor">¶</a></h2>
<p>Always latest result in this scenario: <a href="https://openqa.suse.de/tests/latest?arch=x86_64&distri=sle&flavor=Online&machine=64bit&test=hpc_BETA_mpich_mpi_supportserver&version=15-SP5" class="external">latest</a></p>
openQA Tests - action #120900 (New): Required patterns values are not removed on select_patterns https://progress.opensuse.org/issues/1209002022-11-23T17:38:39Zybonatakisioannis.bonatakis@suse.com
<a name="Observation"></a>
<h2 >Observation<a href="#Observation" class="wiki-anchor">¶</a></h2>
<p>I am not sure if the title is accurate. What actually happens can be seen on the attached screen <a href="https://openqa.suse.de/tests/9825401/modules/select_patterns/steps/124" class="external">select_patterns</a><br>
The specific job set <code>PATTERNS": "base,minimal,apparmor"</code> but at the end only minimal and apparmor are selected. The problem is that another pattern is deselected after the base is checked.<br>
Due to the logic of the <code>select_specific_patterns_by_iteration</code> function, which iterates from top to bottom, it can happen a previous pattern to alter as a dependency of another selection/deselection on a later checkbox.</p>
<p>Also i found the code a bit complicated. For starters there is two different approaches to uncheck patterns</p>
<ol>
<li><code>$self->deselect_pattern() if get_var('EXCLUDE_PATTERNS');</code></li>
<li>inside <code>select_specific_patterns_by_iteration</code> using minus notation on PATTERNS.</li>
</ol>
<p>I think this is unnecessarily complexity<br>
Also i believe that the <code>process_patterns</code> should take care and acts of <em>defaults</em> PATTERNS. As now the logic is inside <code>select_specific_patterns_by_iteration</code></p>
<p>This ticket to improve and improve in all the above. I dont think this is high priority -nor even normal- as it doesnt have destructive impact on the most of the test cases but hides <strong>a seriously defect which produces an unexpected installation than the one was set it up for</strong>.</p>
<a name="Reproducible"></a>
<h2 >Reproducible<a href="#Reproducible" class="wiki-anchor">¶</a></h2>
<p>To reproduce you can set <code>PATTERNS": "base,minimal,apparmor"</code> and choose a job which by default has <code>GNOME</code> checked. At the end of the <code>select_specific_patterns_by_iteration</code> review the <em>installation settings</em> window before the installation</p>
<a name="Expected-result"></a>
<h2 >Expected result<a href="#Expected-result" class="wiki-anchor">¶</a></h2>
<p>PATTERNS values are ensured are selected at the end of the <code>process_patterns</code></p>
openQA Tests - action #118663 (New): [qe-core] wait_serial reports fail on serial_terminal when i...https://progress.opensuse.org/issues/1186632022-10-13T13:38:34Zybonatakisioannis.bonatakis@suse.com
<a name="Description"></a>
<h1 >Description<a href="#Description" class="wiki-anchor">¶</a></h1>
<p>The problem appears in a scenario where you run a container and launch its interactive console and the module itself uses serial_terminal</p>
<p><a href="#" onclick="$('#collapse-01d53588-show, #collapse-01d53588-hide').toggle(); $('#collapse-01d53588').fadeToggle(150);; return false;" id="collapse-01d53588-show" class="icon icon-collapsed collapsible">autoinst-log.txt...</a><a href="#" onclick="$('#collapse-01d53588-show, #collapse-01d53588-hide').toggle(); $('#collapse-01d53588').fadeToggle(150);; return false;" id="collapse-01d53588-hide" class="icon icon-expanded collapsible" style="display:none;">autoinst-log.txt...</a><div id="collapse-01d53588" class="collapsed-text" style="display:none;"><pre><code>[2022-10-13T15:17:01.951831+02:00] [debug] tests/containers/apptainer.pm:50 called testapi::validate_script_output
[2022-10-13T15:17:01.952227+02:00] [debug] <<< testapi::wait_serial(timeout=90, buffer_size=undef, no_regex=1, expect_not_found=0, regexp="# ", quiet=1, record_output=undef)
[2022-10-13T15:17:01.953435+02:00] [debug] <<< consoles::serial_screen::read_until(timeout=90, no_regex=1, record_output=undef, buffer_size=undef, pattern=[
"# "
], regexp="# ", expect_not_found=0, quiet=1, json_cmd_token="stLkEWrr", cmd="backend_wait_serial")
[2022-10-13T15:18:32.041417+02:00] [debug] >>> testapi::wait_serial: # : fail
[2022-10-13T15:18:32.041731+02:00] [debug] tests/containers/apptainer.pm:50 called testapi::validate_script_output
[2022-10-13T15:18:32.041945+02:00] [debug] <<< distribution::script_output("Content of /tmp/script_uow4.sh :\n \"cat > /tmp/script_uow4.sh << 'EOT__uow4'; echo _uow4-\$?-\" \n")
[2022-10-13T15:18:32.043257+02:00] [debug] <<< consoles::serial_screen::type_string(text="cat > /tmp/script_uow4.sh << 'EOT__uow4'; echo _uow4-\$?-\n", json_cmd_token="kCaIjsnI", cmd="backend_type_string")
[2022-10-13T15:18:32.044126+02:00] [debug] tests/containers/apptainer.pm:50 called testapi::validate_script_output
[2022-10-13T15:18:32.044536+02:00] [debug] <<< testapi::wait_serial(quiet=1, record_output=undef, timeout=90, buffer_size=undef, expect_not_found=0, regexp="cat > /tmp/script_uow4.sh << 'EOT__uow4'; echo _uow4-\$?-", no_regex=1)
[2022-10-13T15:18:32.045981+02:00] [debug] <<< consoles::serial_screen::read_until(record_output=undef, timeout=90, no_regex=1, quiet=1, json_cmd_token="KTIlmTxF", cmd="backend_wait_serial", pattern=[
"cat > /tmp/script_uow4.sh << 'EOT__uow4'; echo _uow4-\$?-"
], buffer_size=undef, regexp="cat > /tmp/script_uow4.sh << 'EOT__uow4'; echo _uow4-\$?-", expect_not_found=0)
[2022-10-13T15:18:32.046286+02:00] [info] ::: consoles::serial_screen::read_until: Matched output from SUT in 2 loops & 0.000667732208967209 seconds: cat > /tmp/script_uow4.sh << 'EOT__uow4'; echo _uow4-$?-
[2022-10-13T15:18:32.047033+02:00] [debug] >>> testapi::wait_serial: cat > /tmp/script_uow4.sh << 'EOT__uow4'; echo _uow4-$?-: ok
[2022-10-13T15:18:32.047315+02:00] [debug] tests/containers/apptainer.pm:50 called testapi::validate_script_output
[2022-10-13T15:18:32.047697+02:00] [debug] <<< testapi::wait_serial(record_output=undef, quiet=1, no_regex=1, expect_not_found=0, regexp="> ", buffer_size=undef, timeout=90)
[2022-10-13T15:18:32.049043+02:00] [debug] <<< consoles::serial_screen::read_until(no_regex=1, timeout=90, record_output=undef, expect_not_found=0, regexp="> ", buffer_size=undef, pattern=[
"> "
], cmd="backend_wait_serial", json_cmd_token="TLLLRDQk", quiet=1)
[2022-10-13T15:18:32.049266+02:00] [info] ::: consoles::serial_screen::read_until: Matched output from SUT in 1 loops & 0.000562381930649281 seconds: >
[2022-10-13T15:18:32.049970+02:00] [debug] >>> testapi::wait_serial: > : ok
[2022-10-13T15:18:32.050915+02:00] [debug] <<< consoles::serial_screen::type_string(text="cat /etc/os-release && echo testdone\nEOT__uow4\n", cmd="backend_type_string", json_cmd_token="KTMInCwi")
[2022-10-13T15:18:32.051701+02:00] [debug] tests/containers/apptainer.pm:50 called testapi::validate_script_output
[2022-10-13T15:18:32.052216+02:00] [debug] <<< testapi::wait_serial(regexp="> EOT__uow4", expect_not_found=0, no_regex=1, timeout=90, buffer_size=undef, record_output=undef, quiet=1)
[2022-10-13T15:18:32.053458+02:00] [debug] <<< consoles::serial_screen::read_until(json_cmd_token="azrXZuje", cmd="backend_wait_serial", quiet=1, expect_not_found=0, regexp="> EOT__uow4", pattern=[
"> EOT__uow4"
], buffer_size=undef, record_output=undef, no_regex=1, timeout=90)
[2022-10-13T15:18:32.053728+02:00] [info] ::: consoles::serial_screen::read_until: Matched output from SUT in 2 loops & 0.000609030947089195 seconds: > EOT__uow4
[2022-10-13T15:18:32.054428+02:00] [debug] >>> testapi::wait_serial: > EOT__uow4: ok
[2022-10-13T15:18:32.054704+02:00] [debug] tests/containers/apptainer.pm:50 called testapi::validate_script_output
[2022-10-13T15:18:32.055060+02:00] [debug] <<< testapi::wait_serial(record_output=undef, quiet=1, no_regex=0, regexp="_uow4-0-", expect_not_found=0, buffer_size=undef, timeout=90)
[2022-10-13T15:18:32.056388+02:00] [debug] <<< consoles::serial_screen::read_until(timeout=90, no_regex=0, record_output=undef, pattern="_uow4-0-", buffer_size=undef, regexp="_uow4-0-", expect_not_found=0, quiet=1, cmd="backend_wait_serial", json_cmd_token="aCMENdVw")
[2022-10-13T15:18:32.058856+02:00] [info] ::: consoles::serial_screen::read_until: Matched output from SUT in 2 loops & 0.00273504760116339 seconds: _uow4-0-
[2022-10-13T15:18:32.059562+02:00] [debug] >>> testapi::wait_serial: _uow4-0-: ok
[2022-10-13T15:18:32.059848+02:00] [debug] tests/containers/apptainer.pm:50 called testapi::validate_script_output
[2022-10-13T15:18:32.060210+02:00] [debug] <<< testapi::wait_serial(expect_not_found=0, regexp="# ", no_regex=1, buffer_size=undef, timeout=90, record_output=undef, quiet=1)
[2022-10-13T15:18:32.061421+02:00] [debug] <<< consoles::serial_screen::read_until(buffer_size=undef, pattern=[
"# "
], regexp="# ", expect_not_found=0, quiet=1, cmd="backend_wait_serial", json_cmd_token="HkXhwxdk", timeout=90, no_regex=1, record_output=undef)
[2022-10-13T15:20:02.113308+02:00] [debug] >>> testapi::wait_serial: # : fail
[2022-10-13T15:20:02.114530+02:00] [debug] <<< consoles::serial_screen::type_string(json_cmd_token="ZmelFHja", cmd="backend_type_string", text="echo _uow4; bash -oe pipefail /tmp/script_uow4.sh ; echo SCRIPT_FINISHED_uow4-\$?-\n")
[2022-10-13T15:20:02.115298+02:00] [debug] tests/containers/apptainer.pm:50 called testapi::validate_script_output
[2022-10-13T15:20:02.115764+02:00] [debug] <<< testapi::wait_serial(record_output=undef, quiet=1, no_regex=1, expect_not_found=0, regexp="echo _uow4; bash -oe pipefail /tmp/script_uow4.sh ; echo SCRIPT_FINISHED_uow4-\$?-", buffer_size=undef, timeout=90)
[2022-10-13T15:20:02.117013+02:00] [debug] <<< consoles::serial_screen::read_until(buffer_size=undef, pattern=[
"echo _uow4; bash -oe pipefail /tmp/script_uow4.sh ; echo SCRIPT_FINISHED_uow4-\$?-"
], expect_not_found=0, regexp="echo _uow4; bash -oe pipefail /tmp/script_uow4.sh ; echo SCRIPT_FINISHED_uow4-\$?-", quiet=1, cmd="backend_wait_serial", json_cmd_token="HGtqBdRI", timeout=90, no_regex=1, record_output=undef)
[2022-10-13T15:20:02.117351+02:00] [info] ::: consoles::serial_screen::read_until: Matched output from SUT in 2 loops & 0.00069341529160738 seconds: echo _uow4; bash -oe pipefail /tmp/script_uow4.sh ; echo SCRIPT_FINISHED_uow4-$?-
[2022-10-13T15:20:02.118108+02:00] [debug] >>> testapi::wait_serial: echo _uow4; bash -oe pipefail /tmp/script_uow4.sh ; echo SCRIPT_FINISHED_uow4-$?-: ok
[2022-10-13T15:20:02.118391+02:00] [debug] tests/containers/apptainer.pm:50 called testapi::validate_script_output
[2022-10-13T15:20:02.118743+02:00] [debug] <<< testapi::wait_serial(expect_not_found=0, regexp="SCRIPT_FINISHED_uow4-\\d+-", no_regex=0, buffer_size=undef, timeout=90, record_output=1, quiet=1)
[2022-10-13T15:20:02.120085+02:00] [debug] <<< consoles::serial_screen::read_until(no_regex=0, timeout=90, record_output=1, expect_not_found=0, regexp="SCRIPT_FINISHED_uow4-\\d+-", buffer_size=undef, pattern="SCRIPT_FINISHED_uow4-\\d+-", cmd="backend_wait_serial", json_cmd_token="uwWUVyYC", quiet=1)
[2022-10-13T15:20:02.125432+02:00] [info] ::: consoles::serial_screen::read_until: Matched output from SUT in 8 loops & 0.00561576243489981 seconds: SCRIPT_FINISHED_uow4-0-
[2022-10-13T15:20:02.126191+02:00] [debug] >>> testapi::wait_serial: SCRIPT_FINISHED_uow4-\d+-: ok
</code></pre></div></p>
<p>I tried to play with <code>set_serial_prompt</code> and <code>set_standard_prompt</code> functions without actual solve the problem.<br>
This isnt shown up when <code>select_console</code> is used</p>
<p>The failure comes from (script_output of distribution.pm)</p>
<pre><code>testapi::wait_serial($self->{serial_term_prompt}, no_regex => 1, quiet => $args{quiet});
</code></pre>
<p>Expected:<br>
wait_serial returns <em>ok</em> or ignore <code>testapi::wait_serial</code> if is an interactive terminal</p>
<p>Additional:<br>
I think that this is low priority because doesnt make test fail or anything, but it would be nice to fix it and improve the code as it causes some additional time of the test execution without any good reason</p>
qe-yam - action #113725 (Rejected): Make autoyast tests exit when the autoyast profile encounter ...https://progress.opensuse.org/issues/1137252022-07-18T13:35:03Zybonatakisioannis.bonatakis@suse.com
<a name="observation"></a>
<h1 >observation<a href="#observation" class="wiki-anchor">¶</a></h1>
<p>What happens is that when the autoyast profile is wrong or not found, the tests keep trying to match needles and continue.</p>
<p>One such case is [0]. The problem appears something like 5 minutes after it starts but keeps running for 2h where the job time limits are reached.<br>
Because this cause a slow feedback loop of the tests, in combination of the resources which are occupied without any good reason i would like to propose a solution.</p>
<p>the easier would be to match a needle with the popup and abort. <br>
I dont know if there is already a module for it but we could run a check on the xml profile before installation module. I am not sure but i think there was something implemented for this specific reason. </p>
<p>To reproduce:<br>
you can use <a href="https://gist.github.com/b10n1k/bca22757c5e4eb11473ceda30820f6d2" class="external">https://gist.github.com/b10n1k/bca22757c5e4eb11473ceda30820f6d2</a> as <u>profile.xml</u></p>
<p>Actual results:<br>
<a href="http://aquarius.suse.cz/tests/11324#step/installation/2" class="external">http://aquarius.suse.cz/tests/11324#step/installation/2</a></p>
<p>Expected:<br>
i am not sure about that. should that be one of the following:</p>
<ul>
<li>Exit after some rational time when the error exists.</li>
<li>terminate needle checking after some expected time??
the while loop have specific expectations during installation. When those expectation do not ever match, the loop run for even.</li>
<li>Validate xml before installation and exit then if problem appears</li>
</ul>
<p>[0] <a href="http://aquarius.suse.cz/tests/11324" class="external">http://aquarius.suse.cz/tests/11324</a></p>
openQA auto review - openqa-force-result #112442 (Resolved): auto_review:"'systemctl --no-pager s...https://progress.opensuse.org/issues/1124422022-06-14T19:17:25Zybonatakisioannis.bonatakis@suse.com
<p>Enable auto-review for slurm jobs reporting on <a href="https://progress.opensuse.org/issues/112436" class="external">https://progress.opensuse.org/issues/112436</a> #112436</p>
openQA Project - action #109292 (Resolved): OSD is missing x86_64 jobs duplicate key value violat...https://progress.opensuse.org/issues/1092922022-03-31T07:57:01Zybonatakisioannis.bonatakis@suse.com
<p>With the last two (117.1,118.3) or three builds x86_64 jobs are missing.</p>
<p>The very first time there was a dependency circle issue with one of the job group yaml. That found to prevent the scheduling. However the jobs are keep missing even after the correction and the scheduling looks to work without problem after manual intervention.</p>
openQA Tests - action #108581 (Resolved): False softfails in scc_registrationhttps://progress.opensuse.org/issues/1085812022-03-19T08:35:17Zybonatakisioannis.bonatakis@suse.com
<a name="Observation"></a>
<h2 >Observation<a href="#Observation" class="wiki-anchor">¶</a></h2>
<p>There are softfails in the installation and particular on ssc_registration module. However i havent reproduce the softfail in most of the cases. </p>
<p>From the bug[0] context, it is not clear whether this is an actual product issue. <br>
I havent tested it outside of the openqa env yet but many of the tests i run do not show the problem except some particular jobs(sles+sdk+proxy_SCC_via_YaST) which YaST team have some peculiarities, and i think this might be a test issue or env setup. </p>
<p>Anyhow the softfail should be ignored in the cases which it just work as it reports misleading results for the product in the whole. </p>
<a name="Reproducible"></a>
<h2 >Reproducible<a href="#Reproducible" class="wiki-anchor">¶</a></h2>
<p>Every build after <a href="https://github.com/os-autoinst/os-autoinst-distri-opensuse/pull/14342" class="external">https://github.com/os-autoinst/os-autoinst-distri-opensuse/pull/14342</a></p>
<a name="Expected-result"></a>
<h2 >Expected result<a href="#Expected-result" class="wiki-anchor">¶</a></h2>
<p>Softfail raised after checking of the problem<br>
or restrict the softfail only for the jobs which seems to affect the most</p>
<a name="Further-details"></a>
<h2 >Further details<a href="#Further-details" class="wiki-anchor">¶</a></h2>
<p>[0] <a href="https://bugzilla.suse.com/show_bug.cgi?id=1191112" class="external">https://bugzilla.suse.com/show_bug.cgi?id=1191112</a></p>
openQA Project - action #107878 (Resolved): number of failed job provides wrong value on the buil...https://progress.opensuse.org/issues/1078782022-03-04T11:13:23Zybonatakisioannis.bonatakis@suse.com
<p><a href="https://openqa.suse.de/group_overview/130" class="external">https://openqa.suse.de/group_overview/130</a> display 2 failed jobs for build101.1.</p>
<p>if you press on the failed area of the bar, the <code>Test result overview</code> shows only one failed job. Cleaning all the filters i see that the only other job which was not successful is one failed as <code>timeout_exceeded</code>. </p>
<p>So either the <u>failed area</u> should enabled the <code>Timeout exceeded</code>(or any other flag) or it bar should point out only the actual failed jobs.</p>
openQA Tests - action #107395 (Resolved): [HPC] zypper returns error code 107 in slurm_masterhttps://progress.opensuse.org/issues/1073952022-02-23T19:04:58Zybonatakisioannis.bonatakis@suse.com
<a name="Observation"></a>
<h2 >Observation<a href="#Observation" class="wiki-anchor">¶</a></h2>
<pre><code>(4/6) Installing: twopence-0.4.2-3.d_t.22.x86_64 [.
warning: /var/cache/zypp/packages/devel_tools/x86_64/twopence-0.4.2-3.d_t.22.x86_64.rpm: Header V3 RSA/SHA256 Signature, key ID 498d5a23: NOKEY
.....
/usr/lib/tmpfiles.d/net-snmp.conf:1: Line references path below legacy directory /var/run/, updating /var/run/net-snmp → /run/net-snmp; please update the tmpfiles.d/ drop-in file accordingly.
/usr/lib/tmpfiles.d/systemd.conf:19: Failed to resolve user 'systemd-network': No such process
/usr/lib/tmpfiles.d/systemd.conf:20: Failed to resolve user 'systemd-network': No such process
/usr/lib/tmpfiles.d/systemd.conf:21: Failed to resolve user 'systemd-network': No such process
/usr/lib/tmpfiles.d/systemd.conf:22: Failed to resolve user 'systemd-network': No such process
warning: %post(twopence-0.4.2-3.d_t.22.x86_64) scriptlet failed, exit status 65
done]
(5/6) Installing: libtwopence0-0.4.2-3.d_t.22.x86_64 [.
warning: /var/cache/zypp/packages/devel_tools/x86_64/libtwopence0-0.4.2-3.d_t.22.x86_64.rpm: Header V3 RSA/SHA256 Signature, key ID 498d5a23: NOKEY
......done]
(6/6) Installing: twopence-shell-client-0.4.2-3.d_t.22.x86_64 [.
warning: /var/cache/zypp/packages/devel_tools/x86_64/twopence-shell-client-0.4.2-3.d_t.22.x86_64.rpm: Header V3 RSA/SHA256 Signature, key ID 498d5a23: NOKEY
.......done]
n3BEE-107-
</code></pre>
<p>107 - ZYPPER_EXIT_INF_RPM_SCRIPT_FAILED means "Installation basically succeeded, but some of the packages %post install scripts returned an error"</p>
<p>openQA test in scenario sle-15-SP4-Online-x86_64-hpc_EPSILON_slurm_master@64bit fails in<br>
<a href="https://openqa.suse.de/tests/8218862/modules/slurm_master/steps/162" class="external">slurm_master</a></p>
<a name="Test-suite-description"></a>
<h2 >Test suite description<a href="#Test-suite-description" class="wiki-anchor">¶</a></h2>
<p>HPC cluster for experimental and fast-moving external tests. maintainer: <a href="mailto:schlad@suse.de">schlad@suse.de</a></p>
<a name="Reproducible"></a>
<h2 >Reproducible<a href="#Reproducible" class="wiki-anchor">¶</a></h2>
<p>Fails since (at least) Build <a href="https://openqa.suse.de/tests/8205633" class="external">99.1</a></p>
<a name="Expected-result"></a>
<h2 >Expected result<a href="#Expected-result" class="wiki-anchor">¶</a></h2>
<p>Last good: <a href="https://openqa.suse.de/tests/8181333" class="external">98.1</a> (or more recent)</p>
<a name="Further-details"></a>
<h2 >Further details<a href="#Further-details" class="wiki-anchor">¶</a></h2>
<p>Always latest result in this scenario: <a href="https://openqa.suse.de/tests/latest?arch=x86_64&distri=sle&flavor=Online&machine=64bit&test=hpc_EPSILON_slurm_master&version=15-SP4" class="external">latest</a></p>
openQA Project - action #98577 (Resolved): Unknown ARRAY( variables matching HDD_1 or ISO in job ...https://progress.opensuse.org/issues/985772021-09-14T07:08:55Zybonatakisioannis.bonatakis@suse.com
<a name="Observation"></a>
<h2 >Observation<a href="#Observation" class="wiki-anchor">¶</a></h2>
<ul>
<li>For instance <a href="https://openqa.suse.de/tests/7092033/file/vars.json" class="external">https://openqa.suse.de/tests/7092033/file/vars.json</a> has <code>"ARRAY(0x55bb116ac3a0)" : "SLE-15-SP3-Full-x86_64-GM-Media1.iso"</code> which is repeated and assigned correctly on <code>ISO</code>.</li>
<li>Another one on publiccloud <a href="https://openqa.suse.de/tests/7096497/file/vars.json" class="external">https://openqa.suse.de/tests/7096497/file/vars.json</a> has <code>"ARRAY(0x55b5694714b8)" : "publiccloud_15sp3_Azure_BYOS_Updates.qcow2"</code> which is what <code>HDD_1</code> represents.</li>
</ul>
<p>i havent noticed any destruction or impact on the test so far but i havent also found where this comes from.</p>
<a name="Acceptance-criteria"></a>
<h2 >Acceptance criteria<a href="#Acceptance-criteria" class="wiki-anchor">¶</a></h2>
<ul>
<li><strong>AC1</strong>: No variables based on stringified array types present in job settings</li>
</ul>
<a name="Suggestions"></a>
<h2 >Suggestions<a href="#Suggestions" class="wiki-anchor">¶</a></h2>
<ul>
<li>Add unit test validating variable names (assuming this is a bug in openQA or os-autoinst)</li>
<li>Mark jobs with variable names containing <code>(</code> as incomplete (assuming this is a bug in another tool)</li>
</ul>
openQA Project - action #98388 (Resolved): Non-existing asset "uefi-vars" is still shown up on #d...https://progress.opensuse.org/issues/983882021-09-09T09:38:23Zybonatakisioannis.bonatakis@suse.com
<a name="Observation"></a>
<h2 >Observation<a href="#Observation" class="wiki-anchor">¶</a></h2>
<p>An example is <a href="https://openqa.suse.de/tests/6994628#downloads" class="external">https://openqa.suse.de/tests/6994628#downloads</a></p>
<p>When i clone the job it fails with</p>
<pre><code>downloading
http://openqa.suse.de/tests/6994628/asset/other/SLE-15-SP4-Online-x86_64-Build31.2-Media1.iso.sha256
to
/var/lib/openqa/factory/other/SLE-15-SP4-Online-x86_64-Build31.2-Media1.iso.sha256
downloading
http://openqa.suse.de/tests/6994628/asset/iso/SLE-15-SP4-Online-x86_64-Build31.2-Media1.iso
to
/var/lib/openqa/factory/iso/SLE-15-SP4-Online-x86_64-Build31.2-Media1.iso
downloading
http://openqa.suse.de/tests/6994628/asset/hdd/SLES-15-SP4-x86_64-Build31.2-containers.qcow2
to
/var/lib/openqa/factory/hdd/SLES-15-SP4-x86_64-Build31.2-containers.qcow2
downloading
http://openqa.suse.de/tests/6994628/asset/hdd/SLES-15-SP4-x86_64-Build31.2-containers-uefi-vars.qcow2
to
/var/lib/openqa/factory/hdd/SLES-15-SP4-x86_64-Build31.2-containers-uefi-vars.qcow2
6994628 failed: 404 Not Found
</code></pre>
<p><a href="https://openqa.suse.de/tests/6994628/asset/hdd/SLES-15-SP4-x86_64-Build31.2-containers-uefi-vars.qcow2" class="external">uefi-vars qcow2</a> seems that it is not available any more and if you try to get this file you get 404.</p>
<p>Expected:</p>
<ul>
<li>ui should not show non-available assets</li>
</ul>
<a name="Steps-to-reproduce"></a>
<h2 >Steps to reproduce<a href="#Steps-to-reproduce" class="wiki-anchor">¶</a></h2>
<p>Clone the job in the description, it returns a 404 error for an asset still shown in the UI page</p>
<a name="Suggestions"></a>
<h2 >Suggestions<a href="#Suggestions" class="wiki-anchor">¶</a></h2>
<ul>
<li>Investigate why is it still shown in the UI page if the cleanup removes it</li>
<li>Investigate why this is removed (if it is from the cleanup script) but not the rest</li>
<li>Fix UI so that it doesn't show non-available assets</li>
<li>(optional) i wonder if the cleanup can be more clever and keep the relevant assets</li>
</ul>