Project

General

Profile

action #106898

Protection against asset clobbering

Added by MDoucha 4 months ago. Updated 2 months ago.

Status:
Resolved
Priority:
Low
Assignee:
Category:
Feature requests
Target version:
Start date:
2022-02-16
Due date:
% Done:

0%

Estimated time:
Difficulty:

Description

QCOW images in OpenQA occasionally get corrupted because multiple jobs try to publish the same file at the same time, either due to PUBLISH_* setting misconfiguration or duplicate install jobs scheduled in parallel. For example, this job failed to start:
https://openqa.suse.de/tests/8162749

because these three install jobs finished 20 minutes apart and tried to upload the same QCOW image:
https://openqa.suse.de/tests/8162347
https://openqa.suse.de/tests/8161501
https://openqa.suse.de/tests/8160547

Please add some sort of protection against asset clobbering via PUBLISH_* variables:

  • two jobs must not publish the same file in parallel
  • jobs must not publish a file while another job may be downloading the previous version
  • PUBLISH_* misconfiguration (e.g. copy-paste mistakes among multiple testsuites) should be detected and reported in the WebUI, for example as the reason why install job was terminated

Related issues

Related to openQA Project - action #109319: [qe-core] aarch64 tests failing in qemu-img due to broken image (was: "with cache error") size:SResolved2022-03-31

History

#1 Updated by okurz 4 months ago

  • Target version set to future

#2 Updated by okurz 3 months ago

  • Related to action #109319: [qe-core] aarch64 tests failing in qemu-img due to broken image (was: "with cache error") size:S added

#3 Updated by okurz 2 months ago

  • Status changed from New to Resolved
  • Assignee set to okurz
  • Target version changed from future to Ready

AFAICS https://github.com/os-autoinst/openQA/pull/4597 solves this together with pointing out the limitations and implications in https://open.qa/docs/#_specifying_assets_created_by_a_job

Also available in: Atom PDF