Project

General

Profile

Actions

action #119161

closed

Approval step of qem-bot says incident has failed job in incidents but it looks empty on the dashboard size:M

Added by mkittler about 2 years ago. Updated almost 2 years ago.

Status:
Resolved
Priority:
Low
Assignee:
Start date:
2022-10-21
Due date:
% Done:

0%

Estimated time:

Description

Observation

INFO: Job 1935219 not found 
INFO: Job 1935211 not found 
INFO: Inc 25951 has failed job in incidents
INFO: Inc 25991 does not have any job_settings
INFO: Inc 26161 does not have any aggregates settings

(from https://gitlab.suse.de/qa-maintenance/bot-ng/-/jobs/1202686)

On the dashboard I couldn't see any jobs for that incident (https://dashboard.qam.suse.de/incident/25951) and when checking the openQA database I also haven't gotten any jobs:

openqa=# select id, BUILD from jobs where BUILD like '%25951%';
 id | build 
----+-------
(0 Zeilen)

This looks rather weird. It would be good to have an explanation what's going on and if we'd find a way to make it more obvious.

Acceptance criteria

  • AC1: "has failed job" messages are more specific

Suggestion

  • Distinguish between non-existing and (potentially) ignored jobs. The dashboard does not know if there are only ignored jobs so if possible (jobs for products in development groups are not submitted to the dashboard by the bot). Consider transmitting the amount of all/failed/ignored jobs and remembering that so that we can distinguish
  • Alternative: We could just point to the according gitlab CI step that potentially mentions ignored jobs
  • Make the message as specific as possible, depending on how much the bot actually known in the accept step

Related issues 2 (0 open2 closed)

Related to QA (public) - action #103701: Resubmited incident (ID) with new release request (RR) inherits incident test results from previous RRResolvedosukup2021-12-08

Actions
Related to QA (public) - action #107923: qem-bot: Ignore not-ok openQA jobs for specific incident based on openQA job comment size:MResolvedjbaier_cz

Actions
Actions #1

Updated by jbaier_cz about 2 years ago

Afaik the wording is just bad; the condition in https://github.com/openSUSE/qem-bot/blob/master/openqabot/approver.py#L67 just means that all jobs are not successful (because there is none at all). We can maybe distinguish between "there are failed jobs" and "there are no jobs" in https://github.com/openSUSE/qem-bot/blob/master/openqabot/approver.py#L124

Actions #2

Updated by okurz about 2 years ago

  • Priority changed from Normal to Low
  • Target version set to Ready
Actions #3

Updated by mkittler about 2 years ago

  • Subject changed from Approval step of qem-bot says incident has failed job in incidents but it looks empty on the dashboard to Approval step of qem-bot says incident has failed job in incidents but it looks empty on the dashboard size:M
  • Description updated (diff)
  • Status changed from New to Workable
Actions #4

Updated by kraih about 2 years ago

  • Assignee set to kraih

Maybe i'll practice some Python.

Actions #5

Updated by okurz about 2 years ago

  • Related to action #103701: Resubmited incident (ID) with new release request (RR) inherits incident test results from previous RR added
Actions #6

Updated by okurz about 2 years ago

Please see #107923#note-37 about what looks like a related issue. Or is it #103701 all over again?

Actions #7

Updated by kraih about 2 years ago

  • Assignee deleted (kraih)

Putting the ticket back into the queue for now. Will pick it up again later if nobody else wants to work on it.

Actions #8

Updated by jbaier_cz about 2 years ago

  • Related to action #107923: qem-bot: Ignore not-ok openQA jobs for specific incident based on openQA job comment size:M added
Actions #9

Updated by jbaier_cz about 2 years ago

Some of that was also targeted by https://github.com/openSUSE/qem-bot/pull/84, so this might be actually already solved.

Actions #10

Updated by okurz about 2 years ago

  • Target version changed from Ready to future
Actions #11

Updated by okurz almost 2 years ago

  • Tags changed from reactive work to reactive work, dashboard, qem-bot, log
  • Status changed from Workable to Resolved
  • Assignee set to okurz
  • Target version changed from future to Ready
Actions

Also available in: Atom PDF