Project

General

Profile

action #100973

Cancel any scheduled jobs after a configurable timeout, e.g. days size:M

Added by okurz about 2 months ago. Updated 27 days ago.

Status:
Resolved
Priority:
Low
Assignee:
Category:
Feature requests
Target version:
Start date:
2021-09-13
Due date:
% Done:

0%

Estimated time:
Difficulty:

Description

Motivation

tinita was investigating job age alerts and found a job with a WORKER_CLASS that doesn't match any workers. This was traced by to asmorodskyi who then identified the change which was incorrect use of +WORKER_CLASS (+WORKER_CLASS is combined rather than overridden).

Regardless of what caused this, instead of a developer monitoring jobs and figuring out what happened, we should have openQA cancel unmatch.

Acceptance criteria

  • AC1: Cancel any scheduled jobs after a timeout

Suggestions

  • Cancel any job that is scheduled for multiple days, good default is 7 days
  • Do the cancellation in the scheduler, use an additional timer if performance is impacted

Workaround

Have a person monitor alerts and investigate jobs that never run, cancel the job and file a new ticket.


Related issues

Copied from openQA Project - action #98562: Cancel jobs with invalid WORKER_CLASS after a timeoutNew2021-09-13

History

#1 Updated by okurz about 2 months ago

  • Copied from action #98562: Cancel jobs with invalid WORKER_CLASS after a timeout added

#2 Updated by mkittler about 2 months ago

  • Subject changed from Cancel any scheduled jobs after a configurable timeout, e.g. days to Cancel any scheduled jobs after a configurable timeout, e.g. days size:M
  • Description updated (diff)
  • Status changed from New to Workable

#3 Updated by osukup about 1 month ago

  • Assignee set to osukup

i,ll try

#4 Updated by osukup about 1 month ago

  • Status changed from Workable to In Progress

#6 Updated by osukup 27 days ago

  • Status changed from In Progress to Resolved

merged and deployed

Also available in: Atom PDF