Actions
action #154426
closedHTTP Response alert Salt alerting and autoresolving shortly size:M
Status:
Resolved
Priority:
High
Assignee:
Category:
-
Target version:
Start date:
Due date:
% Done:
0%
Estimated time:
Tags:
Description
Observation¶
From Grafana/ osd-admins@suse.de
Values
B0=13.681051591
Labels
alertname HTTP Response alert
grafana_folder Salt
rule_uid tm0h5mf4k
Suggestions¶
- Take a look what we did (or did not do) in #133397
- Start from https://stats.openqa-monitor.qa.suse.de/d/WebuiDb/webui-summary?viewPanel=78&orgId=1&from=1706482095115&to=1706485674645 and compare to other panels on the webUI dashboard https://stats.openqa-monitor.qa.suse.de/d/WebuiDb/webui-summary?orgId=1&from=1706482095115&to=1706485674645 for the same time
- Look into system journal and other logs on OSD from that timeframe to find out what happened
- Fix the actual problem or look into preventing false alerts
Updated by jbaier_cz 11 months ago
- Copied from action #133397: HTTP Response alert Salt alerting and autoresolving shortly size:M added
Updated by okurz 11 months ago
- Tags changed from alert, osd, grafana, http response, infra to alert, osd, grafana, http response, infra, reactive work
- Subject changed from HTTP Response alert Salt alerting and autoresolving shortly size:M to HTTP Response alert Salt alerting and autoresolving shortly
Updated by jbaier_cz 11 months ago
- Status changed from Workable to In Progress
- Assignee set to jbaier_cz
Relevant logs from osd shows some network issue during the period:
Jan 29 00:14:20 openqa telegraf[1327]: 2024-01-28T23:14:20Z E! [inputs.http] Error in plugin: [url=https://openqa.suse.de/admin/influxdb/minion]: Get "https://openqa.suse.de/admin/influxdb/minion": context deadline exceeded (Client.Timeout exceeded while awaiting headers)
Jan 29 00:14:20 openqa telegraf[1327]: 2024-01-28T23:14:20Z E! [inputs.http] Error in plugin: [url=https://openqa.suse.de/admin/influxdb/jobs]: Get "https://openqa.suse.de/admin/influxdb/jobs": context deadline exceeded (Client.Timeout exceeded while awaiting headers)
Not related, but maybe interesting line:
Jan 29 00:15:05 openqa openqa[6000]: [debug] Rejecting authentication for user "openqaworker4" with ip "10.168.192.181", valid key "160AA95F68C410D5", secret "E38C9451DB07468D", timestamp mismatch - check whether clocks on the local host and the web UI host are in sync
Also worth noticing, there is almost no CPU load and almost zero networking traffic during that period
Actions