Project

General

Profile

Actions

action #39497

closed

[sle][functional][u] send magic-sysrq-w to find out what is blocking the system

Added by mloviska over 5 years ago. Updated over 5 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
Bugs in existing tests
Target version:
SUSE QA - Milestone 19
Start date:
2018-08-09
Due date:
2018-10-09
% Done:

0%

Estimated time:
Difficulty:

Description

Observation

openQA test in scenario sle-12-SP4-Server-DVD-ppc64le-cryptlvm_minimal_x@ppc64le fails in
yast2_lan

Reproducible

Fails since (at least) Build 0328 (current job)

Expected result

Last good: 0327 (or more recent)

Acceptance criteria

  • AC1: We can see if any task is blocking the system even though we can not login into the system in post_fail_hook

Suggestions

  • See what we did already in https://github.com/os-autoinst/os-autoinst-distri-opensuse/pull/4843/files#diff-141f4b5a48eaecb0c631a0de23e41a51R1135 to collect the "blocked tasks" from the system when we can not have a logged in console (anymore)
  • Send the magic sysrq sequence to the system if the post_fail_hooks fails to login like reported above – that probably means that we need to call select_console('log-console', await_console => 0) in the post_fail_hook and check manually if we reach the expected login or logged in prompt or if we are stuck and need to send magic-sysrq-w. Or we might need a "post_fail_hook for the post_fail_hook"
  • Make sure the output of magic sysrq is available in text form, not just in screenshot so that everybody can read it and we can also forward the text to external references, e.g. bug reports

Further details

Always latest result in this scenario: latest


Related issues 2 (0 open2 closed)

Related to QA - coordination #32734: [functional][epic][u][new test] OOM handlingRejected2018-03-03

Actions
Has duplicate openQA Tests - action #39779: [sle][functional][y] test fails in yast2_lan - timeout and yast2 lan diedRejectedokurz2018-08-15

Actions
Actions

Also available in: Atom PDF