Project

General

Profile

Actions

action #12344

closed

sporadic "corrupt images" in svirt based test on zkvm

Added by okurz almost 8 years ago. Updated over 6 years ago.

Status:
Resolved
Priority:
Low
Assignee:
-
Category:
Feature requests
Target version:
-
Start date:
2016-06-15
Due date:
% Done:

0%

Estimated time:

Description

observation

svirt based tests can die when booting before installation because the image is unusable, e.g. see
http://lord.arch/tests/1048/file/autoinst-log.txt

Domain openQA-SUT-12 defined from /var/lib/libvirt/images/openQA-SUT-12.xml


20:13:09.4589 Command's stderr:

20:13:10.7595 Command's stdout:


20:13:10.7596 Command's stderr:
error: Failed to start domain openQA-SUT-12
error: internal error: process exited while connecting to monitor: 2016-06-13T20:17:10.483081Z qemu-system-s390x: -drive file=/var/lib/libvirt/images/openQA-SUT-12.img,if=none,id=drive-virtio-disk0,format=qcow2: qcow2: Image is corrupt; cannot be opened read/write


20:13:10.8867 # Test died:
{
  'args' => [],
  'console' => 'svirt',
  'function' => 'define_and_start'
}
virsh start failed at /local/os-autoinst/consoles/sshVirtsh.pm line 392.

also seen in
http://lord.arch/tests/1055
http://lord.arch/tests/1087

steps to reproduce

Run zkvm tests often, happens in about 3/20 runs, at least in my setup.

problem

H1. specific to my openQA+reserved_s390x@zkvm worker (VIRSH_HOSTNAME=s390pb.suse.de, VIRSH_GUEST=10.161.145.7, VIRSH_INSTANCE=12)
H2. the s390x guest is "reused" or the image is cleanup up by someone else
H3. out of disk space and return value of qemu-img command is not properly checked?

suggestion

We could improve the check for the image being present before we try to access it again after reboot.

workaround

sporadic, so retrigger


Related issues 1 (0 open1 closed)

Copied to openQA Project - action #12838: sporadic "corrupt images" in various tests or fails uploading, e.g. with "Premature connection close"Resolvedoholecek2016-06-15

Actions
Actions

Also available in: Atom PDF