Project

General

Profile

Actions

action #134906

closed

osd-deployment failed due to openqaworker1 showing "No response" in salt size:M

Added by okurz over 1 year ago. Updated about 1 year ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
-
Start date:
2023-08-31
Due date:
2023-09-23
% Done:

0%

Estimated time:

Description

Observation

https://gitlab.suse.de/openqa/osd-deployment/-/jobs/1794346#L9197 shows

Minions returned with non-zero exit code
openqaworker1.qe.nue2.suse.org:
    Minion did not return. [No response]

Acceptance criteria

Suggestions

  • Research how to backport + package lock in salt recipes, e.g. start with https://docs.saltproject.io/en/latest/ref/modules/all/salt.modules.zypperpkg.html or ask experts in chat (but be careful not be drawn into a "just install SUSE Manager" discussion)
  • Add instructions to salt to ensure the salt-minion package is backported and package locked
  • As alternative consider another separate repo that has the backported/fixed version and is applied to all salt controlled machines (not devel:openQA as this is a salt problem, not openQA machine specific)

Related issues 4 (0 open4 closed)

Related to openQA Infrastructure (public) - action #134132: Bare-metal control openQA worker in NUE2 size:MResolvedokurz

Actions
Related to openQA Infrastructure (public) - action #131249: [alert][ci][deployment] OSD deployment failed, grenache-1, worker5, worker2 salt-minion does not return, error message "No response" size:MResolvedokurz2023-06-22

Actions
Related to openQA Infrastructure (public) - action #135404: openqaworker-arm-2.suse.de minion not returningResolvednicksinger2023-09-082023-09-23

Actions
Related to openQA Infrastructure (public) - action #136325: salt deploy fails due to multiple offline workers in qe.nue2.suse.org+prg2.suse.orgResolvedokurz2023-09-22

Actions
Actions

Also available in: Atom PDF