Project

General

Profile

Actions

action #119161

closed

Approval step of qem-bot says incident has failed job in incidents but it looks empty on the dashboard size:M

Added by mkittler over 1 year ago. Updated about 1 year ago.

Status:
Resolved
Priority:
Low
Assignee:
Target version:
Start date:
2022-10-21
Due date:
% Done:

0%

Estimated time:

Description

Observation

INFO: Job 1935219 not found 
INFO: Job 1935211 not found 
INFO: Inc 25951 has failed job in incidents
INFO: Inc 25991 does not have any job_settings
INFO: Inc 26161 does not have any aggregates settings

(from https://gitlab.suse.de/qa-maintenance/bot-ng/-/jobs/1202686)

On the dashboard I couldn't see any jobs for that incident (https://dashboard.qam.suse.de/incident/25951) and when checking the openQA database I also haven't gotten any jobs:

openqa=# select id, BUILD from jobs where BUILD like '%25951%';
 id | build 
----+-------
(0 Zeilen)

This looks rather weird. It would be good to have an explanation what's going on and if we'd find a way to make it more obvious.

Acceptance criteria

  • AC1: "has failed job" messages are more specific

Suggestion

  • Distinguish between non-existing and (potentially) ignored jobs. The dashboard does not know if there are only ignored jobs so if possible (jobs for products in development groups are not submitted to the dashboard by the bot). Consider transmitting the amount of all/failed/ignored jobs and remembering that so that we can distinguish
  • Alternative: We could just point to the according gitlab CI step that potentially mentions ignored jobs
  • Make the message as specific as possible, depending on how much the bot actually known in the accept step

Related issues 2 (0 open2 closed)

Related to QA - action #103701: Resubmited incident (ID) with new release request (RR) inherits incident test results from previous RRResolvedosukup2021-12-08

Actions
Related to QA - action #107923: qem-bot: Ignore not-ok openQA jobs for specific incident based on openQA job comment size:MResolvedjbaier_cz

Actions
Actions #1

Updated by jbaier_cz over 1 year ago

Afaik the wording is just bad; the condition in https://github.com/openSUSE/qem-bot/blob/master/openqabot/approver.py#L67 just means that all jobs are not successful (because there is none at all). We can maybe distinguish between "there are failed jobs" and "there are no jobs" in https://github.com/openSUSE/qem-bot/blob/master/openqabot/approver.py#L124

Actions #2

Updated by okurz over 1 year ago

  • Priority changed from Normal to Low
  • Target version set to Ready
Actions #3

Updated by mkittler over 1 year ago

  • Subject changed from Approval step of qem-bot says incident has failed job in incidents but it looks empty on the dashboard to Approval step of qem-bot says incident has failed job in incidents but it looks empty on the dashboard size:M
  • Description updated (diff)
  • Status changed from New to Workable
Actions #4

Updated by kraih over 1 year ago

  • Assignee set to kraih

Maybe i'll practice some Python.

Actions #5

Updated by okurz over 1 year ago

  • Related to action #103701: Resubmited incident (ID) with new release request (RR) inherits incident test results from previous RR added
Actions #6

Updated by okurz over 1 year ago

Please see #107923#note-37 about what looks like a related issue. Or is it #103701 all over again?

Actions #7

Updated by kraih over 1 year ago

  • Assignee deleted (kraih)

Putting the ticket back into the queue for now. Will pick it up again later if nobody else wants to work on it.

Actions #8

Updated by jbaier_cz over 1 year ago

  • Related to action #107923: qem-bot: Ignore not-ok openQA jobs for specific incident based on openQA job comment size:M added
Actions #9

Updated by jbaier_cz over 1 year ago

Some of that was also targeted by https://github.com/openSUSE/qem-bot/pull/84, so this might be actually already solved.

Actions #10

Updated by okurz over 1 year ago

  • Target version changed from Ready to future
Actions #11

Updated by okurz about 1 year ago

  • Tags changed from reactive work to reactive work, dashboard, qem-bot, log
  • Status changed from Workable to Resolved
  • Assignee set to okurz
  • Target version changed from future to Ready
Actions

Also available in: Atom PDF