Project

General

Profile

Actions

action #180116

open

coordination #102906: [saga][epic] Increased stability of tests with less "known failures", known incompletes handled automatically within openQA

coordination #175515: [epic] incomplete jobs with "Failed to find an available port: Address already in use"

Do not run openQA jobs if there are leftover QEMU processes

Added by okurz 18 days ago.

Status:
New
Priority:
Normal
Assignee:
-
Category:
Feature requests
Start date:
2025-04-07
Due date:
% Done:

0%

Estimated time:

Description

Motivation

From #180110. There can be problems like leftover QEMU processes. We should not even try to run openQA jobs if there are leftover QEMU processes.

Acceptance criteria

  • AC1: The worker does not run further openQA jobs if there are leftover QEMU processes so we don't end up with incomplete jobs in case a process is stuck for good (and instead an alert fires due to the broken/unavailable worker so we can take care of the situation manually).

Suggestions

  • There must be something wrong with the self-check. Maybe implementing a fullstack test for that feature would help figuring out what. Maybe spawning multiple worker instances locally using the same pool directory (and hence will conflict with each other) also helps reproducing this issue.

Related issues 1 (1 open0 closed)

Copied from openQA Project (public) - action #180110: [sporadic] auto_review:"Failed to find an available port: Address already in use":retry, produces incomplete jobs on OSD, multiple machinesNew

Actions
Actions #1

Updated by okurz 18 days ago

  • Copied from action #180110: [sporadic] auto_review:"Failed to find an available port: Address already in use":retry, produces incomplete jobs on OSD, multiple machines added
Actions

Also available in: Atom PDF