Project

General

Profile

Actions

action #28325

closed

[tools][sprint 201712.2][aarch64][bonus] worker incompletes job with timeout after ~9h on seattle14 - no action in log for more than 8h

Added by okurz over 6 years ago. Updated about 6 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
Feature requests
Target version:
Start date:
2017-11-24
Due date:
% Done:

0%

Estimated time:

Description

Observation

https://openqa.suse.de/tests/1268572/file/autoinst-log.txt

shows:

16:24:03.5060 1116 <<< testapi::assert_screen(mustmatch=[
  'bootloader-shim-import-prompt',
  'bootloader-grub2'
], timeout=30)
01:16:43.0065 1109 signalhandler got TERM - loop 1

so quite some gap in between

Journal on the worker does not tell so much either:

Nov 23 17:23:22 seattle14 worker[18759]: [Thu Nov 23 17:23:22 2017] [worker:info] 1109: WORKING 1268572
Nov 24 02:16:42 seattle14 worker[18759]: [Fri Nov 24 02:16:42 2017] [worker:warn] max job time exceeded, aborting 01268572-sle-15-Installer-DVD-aarch64-Build3
Nov 24 02:16:43 seattle14 worker[18759]: killed 1109
Nov 24 02:17:24 seattle14 worker[18759]: [Fri Nov 24 02:17:24 2017] [worker:warn] job 1268572 spent more time than MAX_JOB_TIME

Reproducible

First time occurence for some time

Problem

  • Can we know? Do we need more logging here?
Actions

Also available in: Atom PDF