Project

General

Profile

Actions

action #98562

open

Cancel jobs with invalid WORKER_CLASS after a timeout

Added by livdywan about 3 years ago. Updated about 3 years ago.

Status:
New
Priority:
Low
Assignee:
-
Category:
Feature requests
Target version:
Start date:
2021-09-13
Due date:
% Done:

0%

Estimated time:

Description

Motivation

@tinita was investigating job age alerts and found a job with a WORKER_CLASS that doesn't match any workers. This was traced by to @asmorodskyi who then identified the change which was incorrect use of +WORKER_CLASS (+WORKER_CLASS is combined rather than overridden).

Regardless of what caused this, instead of a developer monitoring jobs and figuring out what happened, we should have openQA cancel unmatch.

Acceptance criteria

  • AC1: Cancel unmatched jobs after a timeout
  • AC2: File a ticket

Workaround

Have a person monitor alerts and investigate jobs that never run, cancel the job and file a new ticket.


Related issues 2 (1 open1 closed)

Copied to openQA Project - action #100973: Cancel any scheduled jobs after a configurable timeout, e.g. days size:MResolvedosukup2021-09-13

Actions
Copied to openQA Project - coordination #102864: [epic] Inform openQA webUI users about potential worker class mismatch or long delaysNew2021-09-13

Actions
Actions

Also available in: Atom PDF