Project

General

Profile

Actions

action #137300

closed

[FIRING:1] (Incomplete jobs (not restarted) of last 24h alert Salt size:M

Added by tinita about 1 year ago. Updated about 1 year ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
Regressions/Crashes
Target version:
Start date:
2023-10-02
Due date:
% Done:

0%

Estimated time:

Description

Observation

Firing [stats.openqa-monitor.qa.suse.de]
Incomplete jobs (not restarted) of last 24h alert
View alert [stats.openqa-monitor.qa.suse.de]
Values
B0=314 
Labels
alertname
Incomplete jobs (not restarted) of last 24h alert

http://stats.openqa-monitor.qa.suse.de/alerting/grafana/cXo2cmBVk/view
http://stats.openqa-monitor.qa.suse.de/alerting/grafana/cXo2cmBVk/view

Acceptance criteria

  • AC1: Alert is not triggered anymore
  • AC2: It is known what triggered the alert originally

Suggestions

  • Investigate what happened on September 28th
  • run select id,test,reason from jobs where result='incomplete' and t_created >= '2023-09-27' and t_created <= '2023-09-29' limit 30; to find anything obvious by reason. I think we can group by shortened reason

Related issues 1 (0 open1 closed)

Related to openQA Project - action #96684: Abort asset download via the cache service when related job runs into a timeout (or is otherwise cancelled) size:MRejectedmkittler2021-08-09

Actions
Actions

Also available in: Atom PDF