Project

General

Profile

Actions

action #89419

closed

Incomplete jobs after OSD deployment

Added by livdywan over 3 years ago. Updated over 3 years ago.

Status:
Rejected
Priority:
High
Assignee:
Category:
-
Target version:
Start date:
2021-03-03
Due date:
% Done:

0%

Estimated time:

Description

Observation

Possibly interesting excerpts from the settings:

CASEDIR https://github.com/hjluo/os-autoinst-distri-opensuse.git#unlock
NAME    05581139-sle-15-SP3-Regression-on-Migration-from-SLE12-SPx-s390x-Buildhjluo_os-autoinst-distri-opensuse_unlock-offline_sles12sp4_ltss_pscc_sdk-asmm-contm-lgm-tcm-wsm_all_full@hjluo_os-autoinst-distri-opensuse_unlock@s390x-kvm-sle12

Note: No alerts were observed.

Actions #1

Updated by mkittler over 3 years ago

  • Status changed from New to Rejected
  • Assignee set to mkittler

Incompletes with the reason quit: worker has been stopped or restarted are expected when a job has been cancelled because the worker was restarted. The jobs you've found have also been automatically cloned which is also expected. Hence the alerts are also not triggered by this behavior.

The only odd thing is of course that I've changed the deployment to avoid this kind of interruption so these kind of jobs shouldn't appear anymore unless one really stops a worker manually. The reason why the jobs you've found have been stopped is that openqa-worker.target was still active at the time on grenache-1 because it hasn't been restarted recently:

martchus@grenache-1:~> systemctl status openqa-worker.target
● openqa-worker.target - openQA Worker
   Loaded: loaded (/usr/lib/systemd/system/openqa-worker.target; disabled; vendor preset: disabled)
   Active: inactive (dead) since Wed 2021-03-03 16:18:21 CET; 1 weeks 1 days ago

Warning: Journal has been rotated since unit was started. Log output is incomplete or unavailable.
martchus@grenache-1:~> uptime
 14:56:04  40 Tage 11:21 an,  1 Benutzer,  Durchschnittslast: 1,43, 1,39, 1,50

But it is dead now and I've also took care about other workers in the meantime. So there's really nothing to be improved here at this point.

Actions

Also available in: Atom PDF