Project

General

Profile

Actions

action #118891

open

Make alerts depend on each other

Added by nicksinger about 2 years ago. Updated about 2 years ago.

Status:
New
Priority:
Normal
Assignee:
-
Category:
-
Target version:
Start date:
2022-10-14
Due date:
% Done:

0%

Estimated time:

Description

Observation

Our alerts operate on different levels of a system. Starting from general checks like a host is "up" and network is reachable up to checking output of services (e.g. worker minions). If machines are down this means of course that its services are down too resulting in a lot of mails/alerts. We should introduce a way to disable more sophisticated checks if basic ones already fail.

Acceptance criteria

  • AC1: Offline machines create just a single alert/e-mail
    • AC1.1: We get reminded or have an overview about the current status

Suggestions


Related issues 1 (1 open0 closed)

Related to openQA Infrastructure - action #118375: Do not alert about "packet loss" if hosts are downNew

Actions
Actions #1

Updated by okurz about 2 years ago

  • Related to action #118375: Do not alert about "packet loss" if hosts are down added
Actions #2

Updated by okurz about 2 years ago

  • Target version set to future
Actions

Also available in: Atom PDF