Actions
action #133397
closedopenQA Project - coordination #110833: [saga][epic] Scale up: openQA can handle a schedule of 100k jobs with 1k worker instances
openQA Project - coordination #108209: [epic] Reduce load on OSD
HTTP Response alert Salt alerting and autoresolving shortly size:M
Start date:
2023-07-26
Due date:
% Done:
0%
Estimated time:
Tags:
Description
Observation¶
From Grafana/ osd-admins@suse.de
Values
B0=19.585438379
Labels
alertname HTTP Response alert
grafana_folder Salt
rule_uid tm0h5mf4k
Acceptance criteria¶
- AC1: No more too strict alerts for http responses are observed
Steps to reproduce¶
- Bump the sensitivity of the alert
- Investigate what if any underlying problem
Suggestions¶
- Do not come up with the conclusion that OSD is overloaded sometimes. We already know that! That's what our alerts need to account for
Updated by okurz 4 months ago
- Related to action #133325: osd http response alerts - bump threshold further up added
Updated by okurz 4 months ago
Likely related:
https://suse.slack.com/archives/C02CGKBCGT1/p1690468821341979?thread_ts=1690468821.341979&cid=C02CGKBCGT1
openQA is slow as molasses today :snail:
Actions