Project

General

Profile

Actions

action #157528

open

QA - coordination #153655: [saga][epic] Future datacenter and network setup at SUSE

QA - coordination #160520: [epic] Streamlined PowerPC openQA resource usage

Remove redundant ASM connections for powerPC machines size:S

Added by nicksinger 3 months ago. Updated about 1 month ago.

Status:
Workable
Priority:
Low
Assignee:
-
Category:
Regressions/Crashes
Target version:
Start date:
2024-03-19
Due date:
% Done:

0%

Estimated time:
Tags:

Description

Motivation

Our current hypothesis is that the PPC HMC struggles with two simultaneous connections to the ASM at the same time. It causes the managed system to "flicker" in the webui and constantly abort any operation you execute. We should explore if these connection issues can be resolved by only having one, single connection between ASM<->HMC.

Machines where this happens:

  • soapberry
  • blackcurrant

Acceptance criteria

Suggestions

  • Research upstream what IBM suggests. We assume it's not foreseen that one connects more than one physical network connection to the same HMC
  • Create an infra ticket according to https://progress.opensuse.org/projects/qa/wiki/Tools#SUSE-IT-ticket-handling asking to remove the secondary, redundant network connection. At best physically remove and update racktables, not in switch config so that not somebody else some months later tries to "fix" a disabled switch port
  • Ensure that machines are still controllable over HMC after cable removal
  • Ensure that racktables is up-to-date with the remaining connection
Actions #1

Updated by okurz 3 months ago

  • Tags set to infra, ppc, hmc, prg2
  • Category set to Regressions/Crashes
  • Target version set to Ready
  • Parent task set to #123800

For both soapberry and blackcurrant in the HMC I now went to the ASM, removed older temporarily disconnected HMC connection entries and in the HMC remembered the IP, removed the system connection and re-added that one but only that one. If that keeps the HMC connections stable we should ask IT to disconnect the secondary physical ethernet connection.

Actions #2

Updated by okurz 3 months ago

Haven't seen any flickering anymore so the software workaround seems to have helped.

Actions #3

Updated by okurz 3 months ago

  • Subject changed from Remove redundant ASM connections for powerPC machines to Remove redundant ASM connections for powerPC machines size:S
  • Description updated (diff)
  • Status changed from New to Workable
Actions #4

Updated by nicksinger 3 months ago

  • Status changed from Workable to In Progress
  • Assignee set to nicksinger
Actions #5

Updated by openqa_review 3 months ago

  • Due date set to 2024-04-10

Setting due date based on mean cycle time of SUSE QE Tools

Actions #6

Updated by nicksinger 3 months ago

  • Status changed from In Progress to Workable
  • Assignee deleted (nicksinger)

Unassign due to sick leave. Connections in software already removed, SD-Ticket missing which requests to physically unplug the cables.
Also check https://bugzilla.suse.com/show_bug.cgi?id=1221485

Actions #7

Updated by okurz 3 months ago

  • Due date deleted (2024-04-10)
  • Status changed from Workable to Blocked
  • Assignee set to okurz
  • Priority changed from Normal to Low
  • Target version changed from Ready to Tools - Next

Good point. I also followed https://bugzilla.suse.com/show_bug.cgi?id=1221485 and I think we can wait for progress in there.

Actions #8

Updated by okurz 3 months ago

  • Status changed from Blocked to Workable
  • Assignee deleted (okurz)
  • Priority changed from Low to Normal
  • Target version changed from Tools - Next to Ready

https://bugzilla.suse.com/show_bug.cgi?id=1221485#c14 shows that they actually suffered from the same problem, same network connected to both physical HMC ethernet ports which should be avoided. So back to before.

Actions #9

Updated by okurz about 2 months ago

  • Priority changed from Normal to Low
Actions #10

Updated by okurz about 1 month ago

  • Target version changed from Ready to future
  • Parent task changed from #123800 to #160520
Actions

Also available in: Atom PDF