Project

General

Profile

Actions

action #90974

closed

coordination #39719: [saga][epic] Detection of "known failures" for stable tests, easy test results review and easy tracking of known issues

coordination #62420: [epic] Distinguish all types of incompletes

Make it obvious if qemu gets terminated unexpectedly due to out-of-memory

Added by okurz about 3 years ago. Updated over 2 years ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
Feature requests
Target version:
Start date:
Due date:
% Done:

0%

Estimated time:
40.00 h

Description

Motivation

qemu can need a lot of memory and is influenced by how openQA users configure the test jobs. This can lead to "out-of-memory" conditions and we should feedback this situation to the test reviewers. #90161 is a recent example where jobs failed on malbec.arch due to OOM but the feedback was suboptimal as the corresponding openQA test is https://openqa.suse.de/tests/5674784 which was incomplete with reason "Reason: backend died: QEMU exited unexpectedly, see log for details" and auto-review labeled with #71188 but not specifically pointing to an OOM condition

Acceptance criteria

  • AC1: if qemu dies due to being killed due to OOM this should be obvious from the incomplete reason

Suggestions

  • So far what okurz could find out the best way to detect OOM is to check the system logs, e.g. with dmesg | grep 'Out of memory: Killed process' which would also reveal the PID of the killed process. Then one could check that PID against the PID of the qemu process that the qemu backend monitors and feed that information back as incomplete reason.
  • Ensure that these conditions are not linked anymore to #71188
  • Crosscheck what other reasons could explain #71188 or close that as well if it's very likely only OOM that would explain such problems

further references:


Related issues 1 (0 open1 closed)

Copied from openQA Infrastructure - action #90161: [Alerting] malbec: Memory usage alert triggered briefly and turned OK within the next minuteResolvedokurz2021-03-16

Actions
Actions

Also available in: Atom PDF