Project

General

Profile

Actions

action #14068

closed

[tools] Gather more system information and logs in case of boot/reboot times out

Added by okurz over 7 years ago. Updated about 4 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
Enhancement to existing tests
Target version:
-
Start date:
2016-09-23
Due date:
% Done:

100%

Estimated time:
(Total: 0.00 h)
Difficulty:

Description

observation

For example https://openqa.suse.de/tests/600788#step/zypper_migration/9 fails after migration. In "first_boot" we already support some error handling to press "esc" but we need it also here and in reboot_gnome and also in case linuxrc does not boot up and is stuck in progress bar, see bsc#999231.

problem

As online migration fails often in current cases we want this urgently.
Gathering logs is not easy as the system can stop in very different steps and also in many cases does not allow to log into an existing shell (e.g. stuck during boot).

suggestion

  • Press "esc" in case of boot/reboot times out to gather some console information as we already do in "first_boot"
  • Instruct qemu backends to do a memory dump and save it as we do for logs
    • Add qemu backend support to save memory dump
    • In post_fail_hook of corresponding tests (start with first_boot) trigger the memdump
    • Save the memdump to be accessible
    • Make sure the size of memdump is not too big (e.g. < 2MB) as we have like 1000 failing tests each day and not infinite disk space

Checklist

  • Have a mock method in backend baseclass so memory dump method can be called safely without crashing
  • Have qemu backend to support memory dumps.
  • Have the WebUI/Worker to upload the memory dumps.
  • Have the WebUI display the memory dumps
  • Have the WebUI register and handle Memory dumps, disk files so that the gru can run cleanups when needed
  • Have the webUI to display the command line needed to respawn the VM.

Subtasks 6 (0 open6 closed)

action #15170: boot_to_desktop should use same error analysis approach as first_bootResolvedokurz2016-11-30

Actions
action #13874: [Build 2141] test reboot_gnome fails in reboot, should press "esc" to show detailsResolvedmkravec2016-09-23

Actions
action #16286: online_migration_setup should use same emergency handling as first_bootResolvedmichalnowak2017-01-27

Actions
action #17638: [migration] online_migration_setup should use some error investigation like e.g. first_boot, reboot_gnome, etc.Resolvedqmsu2017-03-09

Actions
action #17196: [tw][gnome-live] test fails in reboot_gnome to show logout dialogResolveddimstar2017-02-19

Actions
openQA Project - action #36601: Display the command to spawn a VM for virtualization backendsRejectedokurz2018-05-28

Actions

Related issues 6 (1 open5 closed)

Related to openQA Tests - action #14086: [Build 2160] test zypper_patch fails to reboot on ppc64leResolvedmitiao2016-10-06

Actions
Related to openQA Tests - action #13896: collect linuxrc logs on installation startup problems / turn off plymouth to debug startup problemsResolvedokurz2016-09-26

Actions
Related to openQA Project - action #12246: [tools]upload of log files can fail sometimes (was: https://openqa.suse.de/tests/412464 has no X-related log files)Resolvedszarate2016-06-07

Actions
Related to openQA Project - action #12836: preserve disk image / virtual machine / keep them running in case of failures on demandWorkable2016-07-24

Actions
Related to openQA Tests - action #16520: [qam][opensuse][sle][functional] enhance logging and debugging in case of failed shutdown, e.g. press 'esc' on plymouth splash screenResolvednicksinger2017-02-062017-11-08

Actions
Related to openQA Tests - action #34609: [sle][functional][u][medium] Improve Implementation of workaround for bsc#1083646 and debug output in reconnect_s390 on S390-KVMRejectedmgriessmeier2018-04-102018-04-24

Actions
Actions

Also available in: Atom PDF