Project

General

Profile

Actions

action #128420

closed

[alert][grafana] 100% packet loss from qa-power8-4-kvm, grenache-1 and powerqaworker-qam-1 to s390zp{11,15,17}.suse.de size:M

Added by nicksinger about 1 year ago. Updated 11 months ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
-
Target version:
Start date:
Due date:
% Done:

0%

Estimated time:
Tags:

Description

Observation

Starting 2023-04-27 15:15:00 the mentioned machines in the title failed to access/ping s390 LPARs. Something between these hosts has changed or broke and needs to be fixed.
We had similar issues in the past, see the following SD tickets:

Suggestions

  • Check what these machines have in common. A quick look of mine showed that they are in the "old" qa network close by: https://racktables.suse.de/index.php?page=rack&rack_id=516
  • Check if other machines in that location, network, room, switch have the same problems
  • Create a new SD ticket referencing the old ones. Robert mentioned in one of them that we might need to get rid of a second uplink

Rollback steps

  1. Remove silence for rule_uid=2Z025iB4km
Actions

Also available in: Atom PDF