Project

General

Profile

Actions

action #158826

closed

openqa-in-openqa - test fails in start_test due to selecting the wrong base qcow image size:S

Added by tinita 19 days ago. Updated 13 days ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
Regressions/Crashes
Target version:
Start date:
2024-04-11
Due date:
% Done:

0%

Estimated time:

Description

Observation

openQA test in scenario openqa-Tumbleweed-dev-x86_64-openqa_install_multimachine@64bit-4G fails in
start_test

Reproducible

Fails since (at least) Build :TW.27793 (current job)

Expected result

Last good: :TW.27792 (or more recent)

Further details

Always latest result in this scenario: latest

This happens because we have a build that is sorted as the latest:
https://openqa.opensuse.org/tests/3969403#settings BUILD Martchus/os-autoinst-distri-opensuse.git#mm-sync

Probably the code selecting the snapshot to use it broken, hence the clone script doesn't find the assets.


Related issues 1 (1 open0 closed)

Related to openQA Project - action #157273: Run os-autoinst-distri-openQA directly from git without anything related in /var/lib/openqa/share/tests - Follow-up with ideas not tried out in the spike solution #157270 size:MBlockeddheidler2024-04-17

Actions
Actions #1

Updated by okurz 19 days ago

  • Tags set to reactive work
  • Priority changed from Normal to High
Actions #2

Updated by okurz 18 days ago

  • Subject changed from openqa-in-openqa - test fails in start_test to openqa-in-openqa - test fails in start_test due to selecting the wrong base qcow image size:S
  • Description updated (diff)
  • Status changed from New to Workable
Actions #3

Updated by tinita 18 days ago

  • Status changed from Workable to In Progress
  • Assignee set to tinita
Actions #4

Updated by tinita 18 days ago

The problem is that the failing tests are checking out an older version:
https://openqa.opensuse.org/tests/4077075/file/vars.json

"CASEDIR" : "/var/lib/openqa/pool/7/os-autoinst-distri-openQA",
"TEST_GIT_HASH" : "b33598f6310549853fe78dd6697e42adf58f2341",

compared to a passed one:
https://openqa.opensuse.org/tests/4077873/file/vars.json

"CASEDIR" : "/var/lib/openqa/pool/7/os-autoinst-distri-openQA",
"TEST_GIT_HASH" : "9e2d1166307196db8ffc37064cc09a97b1cd8e33",

We only added the fix with adding groupid to the api call after b33598f6310549853fe78dd6697e42adf58f2341, so that's why it retrieves a job fro m a different group https://openqa.opensuse.org/tests/3969403#settings as the latest build, which is too old and the asset is already gone.

Looking at when the failures started, this seems to be related to #157273

Actions #5

Updated by tinita 18 days ago

  • Related to action #157273: Run os-autoinst-distri-openQA directly from git without anything related in /var/lib/openqa/share/tests - Follow-up with ideas not tried out in the spike solution #157270 size:M added
Actions #6

Updated by tinita 18 days ago

  • Status changed from In Progress to Workable

This is being worked on in #157273

Actions #7

Updated by okurz 16 days ago

  • Priority changed from High to Urgent

repeated production test failures -> urgent

Actions #8

Updated by tinita 15 days ago

  • Assignee deleted (tinita)
Actions #9

Updated by dheidler 14 days ago

  • Status changed from Workable to In Progress
  • Assignee set to dheidler
Actions #10

Updated by dheidler 14 days ago

Deleted the cache on openqaworker21 and 22.
That should serve as a workaround until https://github.com/os-autoinst/os-autoinst/pull/2486 is merged.

openqaworker21:~ # rm /var/lib/openqa/cache/git/os-autoinst/os-autoinst-distri-openQA* -r
openqaworker22:~ # rm /var/lib/openqa/cache/git/os-autoinst/os-autoinst-distri-openQA* -r

Actions #11

Updated by dheidler 14 days ago

  • Priority changed from Urgent to High
Actions #12

Updated by dheidler 14 days ago ยท Edited

  • Status changed from In Progress to Blocked

Waiting for #157273 to finally resolve this one.

Actions #13

Updated by dheidler 13 days ago

  • Status changed from Blocked to Resolved
Actions

Also available in: Atom PDF