openSUSE Project Management Tool: Issueshttps://progress.opensuse.org/https://progress.opensuse.org/themes/openSUSE/favicon/favicon.ico?15829177842022-03-29T14:11:54ZopenSUSE Project Management Tool
Redmine openQA Project - action #109190 (New): Invalid reusage of VLAN-Tag in multi-machine scenario, whe...https://progress.opensuse.org/issues/1091902022-03-29T14:11:54Zcfconradcfamullaconrad@suse.com
<a name="Observation"></a>
<h2 >Observation<a href="#Observation" class="wiki-anchor">¶</a></h2>
<ul>
<li>The job <a href="http://openqa-3.wicked.suse.de/tests/80283#step/before_test/23" class="external">http://openqa-3.wicked.suse.de/tests/80283#step/before_test/23</a> show a
<code>eth0: IPv4 duplicate address 10.0.2.11 detected (in use by 52:54:00:12:00:56)!</code> message</li>
<li>The mac 52:54:00:12:00:56 belongs to <a href="http://openqa-3.wicked.suse.de/admin/workers/86" class="external">http://openqa-3.wicked.suse.de/admin/workers/86</a> which runs
the job <a href="http://openqa-3.wicked.suse.de/tests/80207" class="external">http://openqa-3.wicked.suse.de/tests/80207</a> during that time.</li>
</ul>
<p>The job 80207 is a multi machine job and the parent is <a href="http://openqa-3.wicked.suse.de/tests/80206" class="external">http://openqa-3.wicked.suse.de/tests/80206</a>, which start at <code>2022-03-29T09:27:13.195681+02:00</code> and end at <code>2022-03-29T10:04:05.167885+02:00</code>, while job 80207 ends at <code>[2022-03-29T10:06:21.469947+02:00]</code>.</p>
<p>The failing job show the qemu command at: <code>[2022-03-29T10:04:26.913030+02:00] [debug] starting: /usr/bin/qemu-system-x86_64 -vga cirrus -only-migratable...</code>, thus<br>
it start a qemu instance with the same VLAN which is still used by job 80207.</p>
<p>Simple reproducer, create two parallel boot jobs</p>
<pre><code>id=$(openqa-cli api --host http://openqa-3.wicked.suse.de -X POST jobs 'ARCH=x86_64' 'DISTRI=opensuse' 'FLAVOR=CI' 'MACHINE=x86_64' 'VERSION=Tumbleweed' '_GROUP_ID=0' \
'BOOT_HDD_IMAGE=1' 'DESKTOP=textmode' 'HDD_1=tumbleweed.qcow2' 'KEEP_GRUB_TIMEOUT=1' \
'BACKEND=qemu' 'NICTYPE=tap' 'WORKER_CLASS=tap,qemu_x86_64' \
'SCHEDULE=tests/boot/boot_to_desktop' 'TEST=check_vlan_on_mm_job_parent' | jq -r '.id')
echo "PARENT_ID:$id"
openqa-cli api --host http://openqa-3.wicked.suse.de -X POST jobs 'ARCH=x86_64' 'DISTRI=opensuse' 'FLAVOR=CI' 'MACHINE=x86_64' 'VERSION=Tumbleweed' '_GROUP_ID=0' \
'BOOT_HDD_IMAGE=1' 'DESKTOP=textmode' 'HDD_1=tumbleweed.qcow2' 'KEEP_GRUB_TIMEOUT=1' \
'BACKEND=qemu' 'NICTYPE=tap' 'WORKER_CLASS=tap,qemu_x86_64' \
'SCHEDULE=tests/boot/boot_to_desktop' 'TEST=check_vlan_on_mm_job_child' \
"_PARALLEL_JOBS=$id"
</code></pre> openQA Tests - action #99816 (Rejected): EC2-BYOS-Updates LTP syscalls fails in fsconfig01 -- mkf...https://progress.opensuse.org/issues/998162021-10-06T07:50:42Zcfconradcfamullaconrad@suse.com
<a name="Observation"></a>
<h2 >Observation<a href="#Observation" class="wiki-anchor">¶</a></h2>
<p>openQA test in scenario sle-12-SP4-EC2-BYOS-Updates-x86_64-publiccloud_ltp_syscalls@64bit fails in<br>
<a href="https://openqa.suse.de/tests/7323209/modules/fsconfig01/steps/1" class="external">fsconfig01</a></p>
<a name="Test-suite-description"></a>
<h2 >Test suite description<a href="#Test-suite-description" class="wiki-anchor">¶</a></h2>
<a name="Reproducible"></a>
<h2 >Reproducible<a href="#Reproducible" class="wiki-anchor">¶</a></h2>
<p>Fails since (at least) Build <a href="https://openqa.suse.de/tests/7323209" class="external">20211006-1</a> (current job)</p>
<a name="Expected-result"></a>
<h2 >Expected result<a href="#Expected-result" class="wiki-anchor">¶</a></h2>
<p>Last good: <a href="https://openqa.suse.de/tests/7315414" class="external">20211005-1</a> (or more recent)</p>
<a name="Further-details"></a>
<h2 >Further details<a href="#Further-details" class="wiki-anchor">¶</a></h2>
<p>Always latest result in this scenario: <a href="https://openqa.suse.de/tests/latest?arch=x86_64&distri=sle&flavor=EC2-BYOS-Updates&machine=64bit&test=publiccloud_ltp_syscalls&version=12-SP4" class="external">latest</a></p>
openQA Tests - action #99765 (Rejected): test fails in rootless_podman - timeout duo check image ...https://progress.opensuse.org/issues/997652021-10-04T20:29:06Zcfconradcfamullaconrad@suse.com
<a name="Observation"></a>
<h2 >Observation<a href="#Observation" class="wiki-anchor">¶</a></h2>
<p>openQA test in scenario sle-15-SP2-Server-DVD-Updates-s390x-podman_tests@s390x-kvm-sle12 fails in<br>
<a href="https://openqa.suse.de/tests/7311624/modules/rootless_podman/steps/116" class="external">rootless_podman</a></p>
<p>The error is not really visible, as we use <code>script_run()</code> and this doesn't take care of timeout handling.</p>
<pre><code>Container executes:
options --rm -i --name refreshed-image registry.suse.com/suse/sle15:15.2 zypper -nv ref
</code></pre>
<a name="Test-suite-description"></a>
<h2 >Test suite description<a href="#Test-suite-description" class="wiki-anchor">¶</a></h2>
<p>The base test suite is used for job templates defined in YAML documents. It has no settings of its own.</p>
<a name="Reproducible"></a>
<h2 >Reproducible<a href="#Reproducible" class="wiki-anchor">¶</a></h2>
<p>Fails since (at least) Build <a href="https://openqa.suse.de/tests/7311624" class="external">20211004-2</a> (current job)</p>
<a name="Expected-result"></a>
<h2 >Expected result<a href="#Expected-result" class="wiki-anchor">¶</a></h2>
<p>Last good: <a href="https://openqa.suse.de/tests/7307748" class="external">20211004-1</a> (or more recent)</p>
<a name="Further-details"></a>
<h2 >Further details<a href="#Further-details" class="wiki-anchor">¶</a></h2>
<p>Always latest result in this scenario: <a href="https://openqa.suse.de/tests/latest?arch=s390x&distri=sle&flavor=Server-DVD-Updates&machine=s390x-kvm-sle12&test=podman_tests&version=15-SP2" class="external">latest</a></p>
openQA Tests - action #99735 (Rejected): test fails in rootless_podman -- podman commit refreshed...https://progress.opensuse.org/issues/997352021-10-04T11:38:27Zcfconradcfamullaconrad@suse.com
<a name="Observation"></a>
<h2 >Observation<a href="#Observation" class="wiki-anchor">¶</a></h2>
<p>openQA test in scenario sle-15-SP2-Server-DVD-Updates-aarch64-podman_tests@aarch64-virtio fails in<br>
<a href="https://openqa.suse.de/tests/7307763/modules/rootless_podman/steps/118" class="external">rootless_podman</a></p>
<a name="Test-suite-description"></a>
<h2 >Test suite description<a href="#Test-suite-description" class="wiki-anchor">¶</a></h2>
<p>The base test suite is used for job templates defined in YAML documents. It has no settings of its own.</p>
<a name="Reproducible"></a>
<h2 >Reproducible<a href="#Reproducible" class="wiki-anchor">¶</a></h2>
<p>Fails since (at least) Build <a href="https://openqa.suse.de/tests/7279452" class="external">20210930-2</a></p>
<a name="Expected-result"></a>
<h2 >Expected result<a href="#Expected-result" class="wiki-anchor">¶</a></h2>
<p>Last good: <a href="https://openqa.suse.de/tests/7273445" class="external">20210930-1</a> (or more recent)</p>
<a name="Further-details"></a>
<h2 >Further details<a href="#Further-details" class="wiki-anchor">¶</a></h2>
<p>Always latest result in this scenario: <a href="https://openqa.suse.de/tests/latest?arch=aarch64&distri=sle&flavor=Server-DVD-Updates&machine=aarch64-virtio&test=podman_tests&version=15-SP2" class="external">latest</a></p>
openQA Tests - action #97106 (Resolved): test fails in enable_selinuxhttps://progress.opensuse.org/issues/971062021-08-18T09:38:51Zcfconradcfamullaconrad@suse.com
<a name="Observation"></a>
<h2 >Observation<a href="#Observation" class="wiki-anchor">¶</a></h2>
<p>Command: <code>aureport -a</code></p>
<pre><code># Test died: script failed with : 3tByJ
AVC Report
===============================================================
# date time comm subj syscall class permission obj result event
===============================================================
<no events of interest were found>
SCRIPT_FINISHED3tByJ-1-
at /usr/lib/os-autoinst/testapi.pm line 1153.
</code></pre>
<p>From my POV this isn't a failure, we just didn't hit anything, not sure why <code>aureport -a</code> return 1 in such a case.<br>
<a href="https://github.com/linux-audit/audit-userspace/blob/master/src/aureport.c#L154" class="external">https://github.com/linux-audit/audit-userspace/blob/master/src/aureport.c#L154</a></p>
<a name="Test-suite-description"></a>
<h2 >Test suite description<a href="#Test-suite-description" class="wiki-anchor">¶</a></h2>
<p>Same test as microos but enabling SELinux after boot.</p>
<a name="Reproducible"></a>
<h2 >Reproducible<a href="#Reproducible" class="wiki-anchor">¶</a></h2>
<p>Fails since (at least) Build <a href="https://openqa.opensuse.org/tests/1808622" class="external">20210626</a></p>
<a name="Expected-result"></a>
<h2 >Expected result<a href="#Expected-result" class="wiki-anchor">¶</a></h2>
<p>Last good: <a href="https://openqa.opensuse.org/tests/1750717" class="external">20210519</a> (or more recent)</p>
<a name="Further-details"></a>
<h2 >Further details<a href="#Further-details" class="wiki-anchor">¶</a></h2>
<p>Always latest result in this scenario: <a href="https://openqa.opensuse.org/tests/latest?arch=x86_64&distri=microos&flavor=MicroOS-Image&machine=64bit&test=microos_selinux&version=Tumbleweed" class="external">latest</a></p>
openQA Tests - action #66973 (Rejected): [qac][wicked][dante] Enable gitlab-cihttps://progress.opensuse.org/issues/669732020-05-18T08:37:30Zcfconradcfamullaconrad@suse.com
<ul>
<li>introduce .gitlab-ci.yml in <a href="https://gitlab.suse.de/wicked-maintainers/wicked" class="external">https://gitlab.suse.de/wicked-maintainers/wicked</a></li>
<li>provide scripts to run and wait for results</li>
</ul>
openQA Tests - action #63208 (Rejected): [kernel][public cloud] Increase standardDSv3Family Cores...https://progress.opensuse.org/issues/632082020-02-06T11:21:14Zcfconradcfamullaconrad@suse.com
<p><a href="https://openqa.suse.de/tests/3870290#step/run_ltp/85" class="external">https://openqa.suse.de/tests/3870290#step/run_ltp/85</a> </p>
<p>terraform apply fail with:</p>
<pre><code>Error: compute.VirtualMachinesClient#CreateOrUpdate: Failure sending request: StatusCode=0 -- Original Error: autorest/azure: Service returned an error. Status=<nil> Code="OperationNotAllowed" Message="Operation could not be completed as it results in exceeding approved standardDSv3Family Cores quota. Additional details - Deployment Model: Resource Manager, Location: westeurope, Current Limit: 16, Current Usage: 16, Additional Required: 8, (Minimum) New Limit Required: 24. Submit a request for Quota increase at https://aka.ms/ProdportalCRP/?#create/Microsoft.Support/Parameters/%7B%22subId%22:%22c011786b-59d7-4817-880c-7cd8a6ca4b19%22,%22pesId%22:%2206bfd9d3-516b-d5c6-5802-169c800dec89%22,%22supportTopicId%22:%22e12e3d1d-7fa0-af33-c6d0-3c50df9658a3%22%7D by specifying parameters listed in the ‘Details’ section for deployment to succeed. Please read more about quota limits at https://docs.microsoft.com/en-us/azure/azure-supportability/per-vm-quota-requests."
on plan.tf line 129, in resource "azurerm_virtual_machine" "openqa-vm":
129: resource "azurerm_virtual_machine" "openqa-vm" {
</code></pre> openQA Project - action #58826 (Resolved): Result not rendered in detail view on short (e.g. <10s...https://progress.opensuse.org/issues/588262019-10-29T13:52:57Zcfconradcfamullaconrad@suse.com
<p>This was discovered during investigation of poo#39845.</p>
<p>The problem is, if a test-module run very short time, the result isn't rendered in detail view and only "None" gets displayed.<br>
Once the job finished the correct result gets displayed.</p>
<p>This is how it looks in "not expected" state:<br>
<img src="http://imagebin.suse.de/2480/img" alt="http://imagebin.suse.de/2480/img" /></p>
<p>Increase the duration of once test-module to >10s (DO_NOT_FAIL=1) it looks like:<br>
<img src="http://imagebin.suse.de/2484/img" alt="http://imagebin.suse.de/2484/img" /></p>
<a name="Reproduce"></a>
<h1 >Reproduce<a href="#Reproduce" class="wiki-anchor">¶</a></h1>
<p><a href="https://github.com/cfconrad/os-autoinst-distri-opensuse/commit/b2204b65b15459654d531b1dfd6221aab296a3f7" class="external">https://github.com/cfconrad/os-autoinst-distri-opensuse/commit/b2204b65b15459654d531b1dfd6221aab296a3f7</a></p>
<p>run with:<br>
<code>CLEMIX_EXCLUDE='^(?!no_res)' CLEMIX_NO_BOOT=1</code></p>
openQA Tests - action #58739 (Closed): [qac][jdp][public cloud] Group publiccloud images by FLAVO...https://progress.opensuse.org/issues/587392019-10-28T12:32:56Zcfconradcfamullaconrad@suse.com
<p>Problem:<br>
Publiccloud-team produce image with KIWI and BUILD number. <br>
The result is, that we can get different KIWI-BUILD tuple within each FLAVOR (@see <a href="https://build.suse.de/package/view_file/SUSE:SLE-12-SP5:GA/SLES12-SP5-Azure/config.kiwi?expand=1" class="external">https://build.suse.de/package/view_file/SUSE:SLE-12-SP5:GA/SLES12-SP5-Azure/config.kiwi?expand=1</a>)</p>
<p>TODO:<br>
We need to adopt JDP to build groups based on VERSION-FLAVOR-ARCH-MACHINE. (@see <a href="https://gitlab.suse.de/rpalethorpe/jdp/blob/master/notebooks/Report-Status-Diff.ipynb" class="external">https://gitlab.suse.de/rpalethorpe/jdp/blob/master/notebooks/Report-Status-Diff.ipynb</a>)</p>
<p>Example list of PC images:</p>
<pre><code>SLES12-SP5-Azure.x86_64-0.9.0-Standard-On-Demand-Build3.2
SLES12-SP5-Azure.x86_64-0.9.0-Basic-On-Demand-Build3.2
SLES12-SP5-Azure.x86_64-0.9.1-BYOS-Build3.2
SLES12-SP5-EC2.x86_64-0.9.1-BYOS-Build1.55
SLES12-SP5-EC2.x86_64-0.9.1-On-Demand-Build1.55
SLES12-SP5-GCE.x86_64-0.9.1-On-Demand-Build1.55
SLES12-SP5-GCE.x86_64-0.9.1-BYOS-Build1.55
</code></pre>
<p>The FLAVOR variable from openQA contains both (CPS name and Image flavor), e.g. <code>GCE-On-Demand</code> or <code>Azure-Standard-On-Demand</code>.<br>
The BUILD variable form openQA contains KIWI and Build number, separated by <code>-</code>, e.g. <code>0.9.1-1.55</code>.</p>
openQA Project - action #58100 (Workable): HashKeyQuotes: force no quotes for names containing "_"https://progress.opensuse.org/issues/581002019-10-14T07:45:19Zcfconradcfamullaconrad@suse.com
<a name="Motivation"></a>
<h2 >Motivation<a href="#Motivation" class="wiki-anchor">¶</a></h2>
<p>Currently we allow quotes for names containing "<u>". From perl perspective a name containing '</u>' is still a simple identifier and can be used without quotes for hashkey.</p>
<p>From <a href="https://perldoc.perl.org/perldata.html" class="external">https://perldoc.perl.org/perldata.html</a> :</p>
<pre><code>The => operator is mostly just a more visually distinctive synonym for a comma, but it also arranges
for its left-hand operand to be interpreted as a string if it's a bareword that would be a legal simple
identifier.
</code></pre>
<p>So we will end up with a regex like this:</p>
<pre><code>/^[a-zA-Z][0-9a-zA-Z_]*$/
</code></pre>
<p>Changing it, produce perlcritic violations, so a cleanup is needed as well.</p>
<a name="Acceptance-criteria"></a>
<h2 >Acceptance criteria<a href="#Acceptance-criteria" class="wiki-anchor">¶</a></h2>
<ul>
<li><strong>AC1:</strong> hash keys containing <code>_</code> are accepted without surrounding quotes</li>
<li><strong>AC2:</strong> Adopted tidy rules have been applied to os-autoinst and downstream os-autoinst-distri-opensuse</li>
</ul>
<a name="Suggestions"></a>
<h2 >Suggestions<a href="#Suggestions" class="wiki-anchor">¶</a></h2>
<ul>
<li>Change existing tidy checks within os-autoinst</li>
<li>Ensure os-autoinst code adheres to the new rules</li>
<li>Apply the same for os-autoinst-distri-opensuse</li>
</ul>
openQA Tests - action #55427 (Rejected): [kernel][public cloud] Investigate ec2 image upload erro...https://progress.opensuse.org/issues/554272019-08-13T09:07:23Zcfconradcfamullaconrad@suse.com
<p>Image upload is ok, but during cleanup we get following error message:</p>
<pre><code>Created image: ami-074c17922d96d6876
An error occurred (DependencyViolation) when calling the DeleteSecurityGroup operation: resource sg-0cd4d10d680677a98 has a dependent object
</code></pre>
<p>Just re-trigger works for now, as the image is uploaded and the next run just found it.</p>
openQA Tests - action #45758 (Rejected): [qac][public cloud] Get hard boot time limits for specif...https://progress.opensuse.org/issues/457582019-01-07T10:27:52Zcfconradcfamullaconrad@suse.com
<p>For the test modules, we should have fixed boot time limits for specific CSPs.</p>
<p>Q: The boot time is VM type related, do we need to take this into account?</p>
openQA Project - action #44654 (Resolved): [tool] Summarize results in test details tabhttps://progress.opensuse.org/issues/446542018-12-03T13:35:29Zcfconradcfamullaconrad@suse.com
<a name="Motivation"></a>
<h2 >Motivation<a href="#Motivation" class="wiki-anchor">¶</a></h2>
<p>To get a quick overview in test details the idea is to show grouped results in tab on details page, somehow similar to what we have in tests list: <img src="https://progress.opensuse.org/attachments/download/7238/overview.png" alt="short summary" loading="lazy" />.</p>
<a name="Acceptance-criteria"></a>
<h2 >Acceptance criteria<a href="#Acceptance-criteria" class="wiki-anchor">¶</a></h2>
<ul>
<li><strong>AC1:</strong> test details tabs shows test module results summary</li>
</ul>
<a name="Suggestions"></a>
<h2 >Suggestions<a href="#Suggestions" class="wiki-anchor">¶</a></h2>
<p>The initial idea was to have it for "external results", but it could also apply to details, which need to be discussed. We also show the number of comments in the comment tab.</p>
openQA Project - action #40913 (Resolved): script_output sometimes fail on virtio consolehttps://progress.opensuse.org/issues/409132018-09-12T09:34:38Zcfconradcfamullaconrad@suse.com
<a name="Observation"></a>
<h2 >Observation<a href="#Observation" class="wiki-anchor">¶</a></h2>
<p>It seams randomly happen in openQA, that tests fail, cause "cat -" never finish.</p>
<a name="Steps-to-reproduce"></a>
<h2 >Steps to reproduce<a href="#Steps-to-reproduce" class="wiki-anchor">¶</a></h2>
<p>Observations on openQA</p>
<ul>
<li><a href="https://progress.opensuse.org/issues/30613#note-23" class="external">https://progress.opensuse.org/issues/30613#note-23</a> </li>
<li><a href="https://openqa.suse.de/tests/2031979#step/boot_ltp/78" class="external">https://openqa.suse.de/tests/2031979#step/boot_ltp/78</a></li>
</ul>
<p>I was able to bring my openQA instance in such a state. I'm actually not sure, if this<br>
is the same problem as we have in osd, but it looks similar. The big different is, that if it happen once,<br>
it happen always for that worker.</p>
<p>What I did so far:</p>
<ul>
<li>Start a test which is using virtio console</li>
<li>restart openQA while the test is running</li>
<li>run tests again</li>
</ul>
<a name="Problem"></a>
<h2 >Problem<a href="#Problem" class="wiki-anchor">¶</a></h2>
<p>A call like this:</p>
<pre><code>cat - > /tmp/script8RI3l.sh; echo 8RI3l-$?-
</code></pre>
<p>Doesn't get the EOT and so we never reach the prompt again.</p>
<a name="Suggestion"></a>
<h2 >Suggestion<a href="#Suggestion" class="wiki-anchor">¶</a></h2>
<p>We need deeper investigations.</p>
openQA Project - action #39845 (Resolved): Results of tests with very short duration (~<10s) are ...https://progress.opensuse.org/issues/398452018-08-16T10:13:28Zcfconradcfamullaconrad@suse.com
<p>If the execution of the job takes approximately less then 10s the results are not displayed in the openqa web ui.<br>
When enlarge the execution time with "script_run('sleep 8');" results are displayed.</p>
<p>I noticed this only with the ssh backend (<a href="https://github.com/os-autoinst/os-autoinst/pull/1012" class="external">https://github.com/os-autoinst/os-autoinst/pull/1012</a>), which is in development.</p>
<p>Failed job: <a href="http://10.86.1.52/tests/36" class="external">http://10.86.1.52/tests/36</a></p>