Project

General

Profile

Actions

action #72136

closed

[osd-admins] [Alerting] Workers alert during osd deployment, then "ok" after 1 minute, should not alert

Added by okurz about 4 years ago. Updated about 4 years ago.

Status:
Resolved
Priority:
Low
Assignee:
Category:
-
Start date:
2020-09-30
Due date:
% Done:

0%

Estimated time:

Description

Observation

[Alerting] Workers alert
Minion workers down. Check systemd services on the openQA host

See https://stats.openqa-monitor.qa.suse.de/d/WebuiDb/webui-summary?fullscreen&edit&tab=alert&panelId=17&orgId=1&refresh=30s for the corresponding panel

Expected result

There should not be any alert during deployment as the service recovered itself after 1m.

Problem

I think we already bumped the time duration for which no alert should not be raised to 8m, maybe need to increase or find better solution.

Actions

Also available in: Atom PDF