Actions
action #174586
openIncomplete jobs (not restarted) of last 24h alert Salt
Status:
In Progress
Priority:
Urgent
Assignee:
Category:
-
Target version:
Start date:
2024-12-19
Due date:
% Done:
0%
Estimated time:
Tags:
Description
Observation¶
Values
B0=365
Labels
alertname Incomplete jobs (not restarted) of last 24h alert
grafana_folder Salt
rule_uid cXo2cmBVk
The spike could be due to the CDN and SCC problems, see https://suse.slack.com/archives/C02AYV7UJSD/p1734451708127589 for context. The alert is ok right now, but there is still a bunch of incompletes in the panel, might be worth investigating.
Updated by jbaier_cz about 6 hours ago
- Copied from action #154345: Incomplete jobs (not restarted) of last 24h alert Salt added
Updated by okurz about 5 hours ago
- Tags changed from reactive work, alert to reactive work, alert, infra
- Priority changed from High to Urgent
Updated by gpathak about 1 hour ago ยท Edited
- Status changed from New to In Progress
Seems like the issue related to SCC CDN is resolved.
The tests are passing:
- https://openqa.suse.de/tests/16250335#step/zypper_ref/18
- https://openqa.suse.de/tests/16252220#step/zypper_extend/92
- https://openqa.suse.de/tests/16236266#step/suseconnect_scc/22
- https://openqa.suse.de/tests/16252253#step/scc_registration/6 The dashboard now shows pretty low incomplete jobs:https://monitor.qa.suse.de/d/nRDab3Jiz/openqa-jobs-test?orgId=1&viewPanel=panel-17&from=now-24h&to=now
In particular this job always fails https://openqa.suse.de/tests/16248796#line-66, not sure why the YAML schedule is missing and how we can provide the Schedule file.
And tests related to baremetal worker is failing with error Could not retrieve required variable SUT_IP
: https://openqa.suse.de/tests/16251337#line-117
Actions