Project

General

Profile

Actions

action #113746

closed

monitoring: The grafana "ping time" panel does not list all hosts size:S

Added by okurz almost 2 years ago. Updated almost 2 years ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
-
Target version:
Start date:
2022-07-18
Due date:
2022-08-09
% Done:

0%

Estimated time:

Description

Observation

For example currently https://monitor.qa.suse.de/d/WDopenqaworker10/worker-dashboard-openqaworker10?orgId=1&refresh=1m&viewPanel=65099&from=now-90d&to=now currently shows a list of hosts, e.g. dist.suse.de, download.opensuse.org, etc., but not scc.suse.com from
the list https://gitlab.suse.de/openqa/salt-pillars-openqa/-/blob/master/openqa/workerconf.sls#L18:

 required_external_networks:
  - dist.suse.de
  - s390zp11.suse.de
  - s390zp14.suse.de
  - s390zp15.suse.de
  - s390zp17.suse.de
  - s390zp18.suse.de
  - s390zp19.suse.de
  - download.opensuse.org
  - proxy.scc.suse.de
  - qanet.qa.suse.de
  - scc.suse.com

Acceptance criteria

  • AC1: All hosts from "required_external_networks" are shown in the monitoring panel
  • AC2: Alerts are triggered for unavailable hosts

Suggestions

  • Drop scc.suse.com since it can't be pinged
  • Add another boolean panel for hosts being unreachable. Since otherwise we get no alerts for no data i.e. no ping at all

Related issues 2 (0 open2 closed)

Related to openQA Infrastructure - action #113716: [qe-core] proxy-scc is downResolvedszarate2022-07-182022-07-19

Actions
Related to openQA Infrastructure - action #114802: Handle "QA network infrastructure Package loss alert" introduced by #113746 size:MResolvedmkittler2022-07-282022-10-12

Actions
Actions

Also available in: Atom PDF