Project

General

Profile

Actions

action #95833

closed

[qe-sap][ha] test fails in ha_cluster_init - iscsid: Kernel reported iSCSI connection 1:0 error

Added by acarvajal almost 3 years ago. Updated over 2 years ago.

Status:
Rejected
Priority:
Normal
Assignee:
-
Category:
Bugs in existing tests
Target version:
-
Start date:
2021-07-22
Due date:
% Done:

100%

Estimated time:
Difficulty:

Description

Observation

openQA test in scenario sle-15-SP1-Server-DVD-HA-Incidents-x86_64-qam_ha_priority_fencing_node01@64bit fails in
ha_cluster_init

Test suite description

The base test suite is used for job templates defined in YAML documents. It has no settings of its own.

Reproducible

Fails since (at least) Build MR:246274:spice-vdagent

Expected result

Last good: :19994:ffmpeg (or more recent)

Further details

Always latest result in this scenario: latest

journal attached to the failing test includes:

Jul 21 12:01:29.114843 priorityfencing-node01 sbd[3248]: /dev/disk/by-path/ip-10.0.2.1:3260-iscsi-iqn.2016-02.de.openqa:132-lun-0:    error: servant_md: No slot allocated, and automatic allocation failed for disk /dev/disk/by-path/ip-10.0.2.1:3260-iscsi-iqn.2016-02.de.openqa:132-lun-0.
Jul 21 12:01:29.826675 priorityfencing-node01 sbd[3246]:    error: inquisitor_child: SBD: Not enough votes to proceed. Aborting start-up.

and

Jul 21 12:01:34.439488 priorityfencing-node01 iscsid[2737]: iscsid: Kernel reported iSCSI connection 1:0 error (1022 - ISCSI_ERR_NOP_TIMEDOUT: A NOP has timed out) state (3)
Jul 21 12:02:22.499997 priorityfencing-node01 iscsid[2737]: iscsid: connection1:0 is operational after recovery (4 attempts)

This looks like a networking issue between the cluster node and the support server job which provides iSCSI.

Actions

Also available in: Atom PDF