Project

General

Profile

Actions

action #133748

open

coordination #121720: [saga][epic] QE setup in PRG2+NUE3

coordination #129280: [epic] Move from SUSE NUE1 (Maxtorhof) to new NBG Datacenters

Move of openqaworker-arm-1 to FC Basement size:M

Added by okurz 4 months ago. Updated 2 months ago.

Status:
Workable
Priority:
Normal
Assignee:
-
Target version:
Start date:
Due date:
% Done:

0%

Estimated time:

Description

Motivation

In #132614 openqaworker-arm-1 was moved to FC Basement so that we have one hot-redundant aarch64 OSD machine outside of PRG2. For that to be setup we need to also accomodate the automatic recovery feature.

Acceptance criteria

  • AC1: openqaworker-arm-1 runs OSD production jobs again
  • AC2: The automatic recovery of openqaworker-arm-1 on crashes works

Suggestions

Rollback steps

Actions #2

Updated by okurz 4 months ago

  • Description updated (diff)

I now tried to mute the "notification" which triggers the webhook of gitlab for openqaworker-arm-1, added to rollback steps

Actions #3

Updated by mkittler 4 months ago

  • Subject changed from Move of openqaworker-arm-1 to FC Basement to Move of openqaworker-arm-1 to FC Basement size:M
Actions #4

Updated by mkittler 4 months ago

  • Status changed from New to Workable
Actions #5

Updated by okurz 4 months ago

  • Description updated (diff)
Actions #6

Updated by okurz 4 months ago

I crosschecked the machine connections and updated racktables. The fibre channel connection is correct as documented in racktables, eth0 in OS is for the physical port SFP+-1. On the DHCP server no request shows up for this network interface. I assume the switch is not configured correctly yet for the SFP port.

Actions #7

Updated by okurz 4 months ago

  • Priority changed from High to Normal
Actions #8

Updated by okurz 3 months ago

  • Priority changed from Normal to Urgent
Actions #9

Updated by okurz 3 months ago

  • Priority changed from Urgent to High

We don't have capacity to work on that many infra tasks with urgent prio, reducing to "High"

Actions #10

Updated by okurz 3 months ago

  • Parent task changed from #130955 to #129280
Actions #11

Updated by okurz 2 months ago

  • Priority changed from High to Normal
  • Target version changed from Ready to future

We will just have to trust the prg2 workers for now

Actions

Also available in: Atom PDF