Project

General

Profile

Actions

action #80408

closed

openQA Project (public) - coordination #39719: [saga][epic] Detection of "known failures" for stable tests, easy test results review and easy tracking of known issues

openQA Project (public) - coordination #62420: [epic] Distinguish all types of incompletes

revert longer timeout override for openQA services as we could not see less problems with corrupted worker cache

Added by okurz about 4 years ago. Updated about 4 years ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
-
Start date:
2020-11-26
Due date:
% Done:

0%

Estimated time:

Description

Motivation

To find out if worker cache services corrupt the sqlite database due to being killed on systemd service termination we enlarged the timeout on o3 and osd of all relevant worker systemd services temporarily in #80106 . As mkittler reported (confirm!) that neither helped with getting rid of corrupted cache nor did it prevent the killing of services but now the shutdown of systems can take much longer as we still have #62441

Acceptance criteria

  • AC1: openQA worker hosts shut down within less than 2m again

Suggestions

Revert all actions from #80106


Related issues 1 (0 open1 closed)

Copied from openQA Infrastructure (public) - action #80106: corrupted worker cache sqlite: Enlarge systemd service kill timeout temporarilyResolvednicksinger

Actions
Actions

Also available in: Atom PDF