Project

General

Profile

Actions

action #13952

closed

too many warnings about dead workers in log

Added by okurz about 8 years ago. Updated about 8 years ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
-
Target version:
-
Start date:
2016-09-28
Due date:
% Done:

0%

Estimated time:

Description

observation

log /var/log/openqa show many warnings like the following

[Tue Sep 27 20:31:46 2016] [28789:warn] 592014 got a status update but has no worker. huh?
[Tue Sep 27 20:31:46 2016] [28789:warn] 592015 got a status update but has no worker. huh?
[Tue Sep 27 20:31:47 2016] [28783:warn] 592017 got an artefact but has no worker. huh?
[Tue Sep 27 20:31:47 2016] [28812:warn] 592014 got an artefact but has no worker. huh?
[Tue Sep 27 20:31:47 2016] [28812:warn] 592015 got an artefact but has no worker. huh?
[Tue Sep 27 20:31:47 2016] [28812:warn] 592017 got an artefact but has no worker. huh?
…

problem

The warnings reported in the log files are a symptom of the dead job detection (and mitigation). Problem is that the worker should react on the first 404 by the webui but isn't. It's retrying when it should rather abort the job processing at once.


Related issues 2 (0 open2 closed)

Related to openQA Project (public) - action #6564: (re-)add Job::worker_idResolvedmkittler2015-03-06

Actions
Related to openQA Project (public) - action #15386: Continue job already considered deadRejectedmkittler2016-12-08

Actions
Actions #1

Updated by okurz about 8 years ago

  • Private changed from Yes to No
Actions #2

Updated by mkittler about 8 years ago

  • Assignee set to mkittler
Actions #3

Updated by mkittler about 8 years ago

Didn't find out much:

Actions #4

Updated by okurz about 8 years ago

Actions #5

Updated by mkittler about 8 years ago

  • Status changed from New to In Progress
Actions #6

Updated by mkittler about 8 years ago

  • Status changed from In Progress to Resolved

PR has been merged. I can not test whether this works in production. Since the problem only occurs occasionally I close the ticket. We can reopen it when it turns out the fix was not sufficient.

Actions #7

Updated by mkittler about 8 years ago

  • Related to action #15386: Continue job already considered dead added
Actions

Also available in: Atom PDF