Project

General

Profile

Actions

action #18826

closed

action #18144: [tools] restart ipmi management controller before every ipmi job

[tools] Investigate serial over lan disconnects for ipmi

Added by RBrownSUSE over 7 years ago. Updated over 7 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
-
Target version:
Start date:
2017-03-29
Due date:
% Done:

70%

Estimated time:

Description

Found while working on issue 18144

ipmi console sometimes reports timeouts or excess errors, disconnecting its serial over LAN

Current attempt to resolve this is to enable serial keepalive, deployed to openqaw2 on 26 Apr 15:45

If this doesn't work, there are other keepalive options, or else the rather expensive approach of monitoring if the SOL is live and reconnecting within the backend.

Actions #2

Updated by coolo over 7 years ago

but the virt tests are okayish

Actions #3

Updated by RBrownSUSE over 7 years ago

Yes, noted, investigating

Actions #4

Updated by RBrownSUSE over 7 years ago

  • Status changed from New to In Progress
Actions #5

Updated by RBrownSUSE over 7 years ago

  • Status changed from In Progress to Resolved

Serial disconnect issues resolved by https://github.com/os-autoinst/os-autoinst/pull/777

Serial keep alive workaround removed as no longer beneficial with auto reconnect

No iKVM issues reported since regular nightly restart of the card, so no evidence that the whole mc controller needs to be restarted on every job

Actions

Also available in: Atom PDF