action #117172
closedFlaky alert about infrastructure packet loss
0%
Description
See http://monitor.qa.suse.de/d/EML0bpuGk/monitoring?tab=alert&viewPanel=4&orgId=1 in particular about an alert on 25. September 2022 03:40:44 CEST which turned back to green just 1m later. We should such flaky alert reports
Updated by okurz about 2 years ago
- Status changed from New to In Progress
- Assignee set to okurz
mkittler prepared https://gitlab.suse.de/openqa/salt-states-openqa/-/merge_requests/745 to bump the alerting time period.
Updated by okurz about 2 years ago
- Due date set to 2022-10-11
- Status changed from In Progress to Feedback
I was thinking if we should not increase the alerting time threshold even more but maybe 15m is ok for now.
Updated by mkittler about 2 years ago
- Assignee changed from okurz to mkittler
I've seen the additional alert from today but it was before my SR has been merged. I'd wait a few days to see whether it helped.
Updated by mkittler about 2 years ago
- Status changed from Feedback to Resolved
It hasn't happened again so I'm resolving the ticket for now.
Updated by livdywan about 2 years ago
- Status changed from Resolved to Feedback
The alert was just live for 3 minutes. Did we actually increase it to 15 minutes as per #117172#note-2 or is there another alert involved here? Something doesn't seem to work correctly here.
Updated by okurz about 2 years ago
https://gitlab.suse.de/openqa/salt-states-openqa/-/merge_requests/759 to bump from 15m to 4h.
Updated by okurz about 2 years ago
- Copied to action #118375: Do not alert about "packet loss" if hosts are down added
Updated by okurz about 2 years ago
- Due date deleted (
2022-10-11) - Status changed from Feedback to Resolved
MR merged, rolled out and effective. Further improvements put into #118375