Project

General

Profile

Actions

action #25982

closed

[sle][functional][sporadic][hard][u][s390x] test fails in bootloader - "Please make sure your installation medium is available.""

Added by okurz over 6 years ago. Updated about 6 years ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
Bugs in existing tests
Start date:
2017-10-11
Due date:
2018-04-10
% Done:

100%

Estimated time:
Difficulty:
hard

Description

Observation

openQA test in scenario sle-15-Installer-DVD-s390x-skip_registration+workaround_modules@zkvm fails in
bootloader_zkvm

Message in autoinst-log.txt:

no/incomplete answer.
Please make sure your installation medium is available.

Reproducible

Fails since (at least) Build 300.1 (current job)

Acceptance criteria

AC1: DONE: get an idea about the root cause
AC2: DONE: gather statistics - it's pretty sporadic
AC3: DONE: Come up with a fix or a workaround to mitigate the issue -> better debug options with this PR, real workaround is to restart the job
AC4: DONE: Implement post_fail_hook to bootloader_s390 and bootloader_zkvm to gather e.g. linuxrc logs (/var/log/linuxrc) -> see Pull Request

Expected result

Last good: 299.1 (or more recent)

Problem

Happens in about 5/100 cases (see comments) and most likely for the first tests in a new build, shortly after a new build hits openQA. Retriggering helps here.

  • REJECTED H1: Jobs are scheduled before the build was actually completely synced to osd -> E1-1 Check logs of osd:/var/log/openqa_rsync.log and the jobs log to see if the job is triggered before syncing is complete
  • REJECTED H2: Something about caching over network?
  • H3: Many jobs starting at the same time cause an overload causing the installer to get a negative reply trying to access the FTP server ->

    • E3-1 Trigger many jobs in parallel for an existing build, e.g. the previous build when the test queue for s390x is empty
    • E3-2 Check if there are any logs on osd regarding the ftp server to see if there was a deny or apache logs?#
  • ACCEPTED H4: Network adapter cannot be set up correctly and therefor linuxrc falls back to dhcp which is not set up for s390 subnet

Further details

Always latest result in this scenario: latest

Workaround

Retrigger jobs. They should be able to find the repo after retriggering.

Actions

Also available in: Atom PDF