Project

General

Profile

Actions

action #132788

closed

[alert][flaky] QA-Power8-5-kvm: QA network infrastructure Ping time alert

Added by okurz 10 months ago. Updated 10 months ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
-
Target version:
Start date:
2023-07-15
Due date:
% Done:

0%

Estimated time:

Description

Observation

Over the last days I have seen multiple emails of a flaky alert "QA-Power8-5-kvm: QA network infrastructure Ping time alert". Each alert resolves again after 5m. I feel a for-period of just 5m is not high enough as that means any reboot of a target machine would cause an alert.

Rollback steps

  • Remove silence

Related issues 1 (0 open1 closed)

Related to openQA Infrastructure - action #133130: Lots of alerts for a single cause. Can we group and de-duplicate?Resolvednicksinger2023-07-20

Actions
Actions #1

Updated by okurz 10 months ago

  • Status changed from New to Feedback
  • Assignee set to okurz
Actions #2

Updated by livdywan 10 months ago

With the MR merged, is it time to remove the silence?

Actions #3

Updated by okurz 10 months ago

  • Related to action #133130: Lots of alerts for a single cause. Can we group and de-duplicate? added
Actions #4

Updated by okurz 10 months ago

  • Status changed from Feedback to Resolved

Hm, https://monitor.qa.suse.de/d/WDgrenache-1/worker-dashboard-grenache-1?viewPanel=65099&orgId=1&editPanel=65099&tab=alert# shows that the "For 1h" is effective but the past days had been a horrible alert frenzy. I will keep the silence but added an accordingl "rollback step" to #133130

Actions

Also available in: Atom PDF