Project

General

Profile

Actions

action #159639

open

[alert] "web UI: Too many 5xx HTTP responses alert" size:S

Added by okurz 23 days ago. Updated 2 days ago.

Status:
Blocked
Priority:
Normal
Assignee:
Category:
Regressions/Crashes
Target version:
Start date:
2024-04-26
Due date:
% Done:

0%

Estimated time:

Description

Observation

https://monitor.qa.suse.de/d/WebuiDb/webui-summary?viewPanel=80&orgId=1&from=1714042970812&to=1714056541493
shows an alert condition (dashed red line)

https://mailman.suse.de/mlarch/SuSE/osd-admins/2024/osd-admins.2024.04/msg00148.html
is the corresponding alert which bundles two alerts and only the less significant one was commented on. We should still look into the 5xx HTTP response alert problem

Acceptance criteria

  • AC1: The error source is addressed and handled

Suggestions

  • Check the actual error about 5xx response and apply according mitigations and fixes or plan them in separate specific tickets if they are not trivial to fix
  • Maybe the alert is misconfigured but we should still look into the logs what the actual 5xx responses were about

Related issues 4 (2 open2 closed)

Related to openQA Infrastructure - action #122848: Configure grouped alerts in Grafana correctly size:MResolvedokurz2023-01-09

Actions
Related to openQA Infrastructure - action #138044: Grouped seemingly unrelated alert emails are confusing size:MRejectedokurz2023-10-09

Actions
Related to openQA Infrastructure - action #159396: Repeated HTTP Response alert for /tests and unresponsiveness due to potential detrimental impact of pg_dump (was: HTTP Response alert for /tests briefly going up to 15.7s) size:MFeedbackokurz2024-06-09

Actions
Blocked by openQA Project - action #159792: Add better logging for 500 errors on websocket routes size:MBlockedokurz2024-04-26

Actions
Actions

Also available in: Atom PDF