Project

General

Profile

Actions

action #40544

closed

[OpenQA][IPMI backend] IPMI worker can not survive reboot on dell SUT

Added by xlai over 5 years ago. Updated almost 5 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
-
Target version:
-
Start date:
2018-09-03
Due date:
% Done:

0%

Estimated time:

Description

We have two dell machines, vh003.qa2.suse.asia and vh004.qa2.suse.asia. When they are binded with ipmi worker, the jobs on those two machines can not survive reboot. For example, after host installation when it boots to the new os, the sol console can only get black screen, not reactive at all. So does any other simple reboot.

After debugging by john and jerry, it is found that the reset_console operation leads to this failure because the existing sol console connection is not properly cleaned up and result in failure in the new sol console setup.

John and jerry also have their 2 proposals as solutions which are open for discussions. I will let them describe in more details in later comments.


Related issues 1 (0 open1 closed)

Related to openQA Tests - action #32746: [sle][tools][remote-backends][hard] Incomplete job because console isn't responding correctly. Half-open socket on IPMIResolvedokurz2018-03-05

Actions
Actions #1

Updated by XGWang0 over 5 years ago

I have a fix for this issue, i will fill the PR ASAP if no issue found for my local test on dell and super-micro machine

Actions #2

Updated by cachen over 5 years ago

  • Related to action #32746: [sle][tools][remote-backends][hard] Incomplete job because console isn't responding correctly. Half-open socket on IPMI added
Actions #4

Updated by cachen over 5 years ago

Hold on the PR due to poo#32746, we are trying to fully fix it from backend.

Actions #5

Updated by coolo over 5 years ago

  • Assignee changed from szarate to XGWang0
  • Priority changed from High to Normal
Actions #6

Updated by okurz almost 5 years ago

  • Category set to Regressions/Crashes
Actions #7

Updated by okurz almost 5 years ago

  • Project changed from openQA Project to openQA Infrastructure
  • Category deleted (Regressions/Crashes)
Actions #8

Updated by xlai almost 5 years ago

  • Status changed from New to Resolved
Actions

Also available in: Atom PDF