Project

General

Profile

Actions

action #97658

closed

many (maybe all) jobs on rebel within o3 run into timeout_exceeded "setup exceeded MAX_SETUP_TIME" size:M

Added by okurz over 2 years ago. Updated over 2 years ago.

Status:
Resolved
Priority:
Low
Assignee:
Category:
-
Target version:
Start date:
2021-08-30
Due date:
% Done:

0%

Estimated time:

Description

Observation

From https://matrix.to/#/!ilXMcHXPOjTZeauZcg:libera.chat/$VfpltxmcDxTLSzTVKh7fa5gFHn6TR3hdcX1cnNnBNlg?via=libera.chat&via=matrix.org&via=opensuse.org AdaLovelace asked

Does anybody know about network issues or anything in this direction between openQA and the mainframe? There are timeouts at the start of openQA tests. https://openqa.opensuse.org/tests/overview?distri=opensuse&version=Tumbleweed&build=20210829&groupid=34

Suggestions

  • Try to recover the existing machine from a rescue system
  • If the hardware needs a replacement, file an infra ticket asking to replace hardware and boot a rescue system to install the machine
  • Ensure that o3 s390x openQA jobs are processed correctly

Out of scope

  • Reconsider backup strategy or replacement using containers

Related issues 4 (0 open4 closed)

Related to openSUSE admin - tickets #97691: Corrupted file system on rebel prevents openQA tests for s390xClosed2021-08-30

Actions
Related to openQA Infrastructure - action #98307: Many jobs in o3 fail with timeout_exceeded on openqaworker1 auto_review:"timeout: setup exceeded MAX_SETUP_TIME":retry size:MResolvedmkittler2021-09-08

Actions
Blocks openQA Infrastructure - action #93381: [O3]request to add an IPMI SUT to O3 size:MResolvednicksinger2021-06-02

Actions
Copied to openQA Infrastructure - action #97751: replacement setup for o3 s390x openQA workers size:MResolveddheidler2021-09-17

Actions
Actions

Also available in: Atom PDF