Project

General

Profile

Actions

action #135491

closed

fozzie and quinn unable to access PXE server or iPXE server (TFTP open timeout)

Added by Julie_CAO over 1 year ago. Updated over 1 year ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
-
Start date:
2023-09-11
Due date:
% Done:

0%

Estimated time:
Tags:

Description

Observation

fozzie and quinn are in NUE1, they failed to access the generic static iPXE menu(the same on with O3), I changed its dhcp config to kernel qa team's baremetal-support services(https://gitlab.suse.de/qa-sle/qanet-configs/-/merge_requests/75), they still failed to reach out this server.

"TFTP open timeout" is reported:
TFTP_error

Could you please take a look?

Problem

atfpd on qanet apparently stuck

Suggestions

  • Try to identify stuck server processes, restart, lazy unmount NFS shares and such

Rollback steps

  • Unsilence alert alertname=Packet loss between worker hosts and other hosts alert

Files

TFTP_error.png (50.4 KB) TFTP_error.png Julie_CAO, 2023-09-11 09:30
Actions

Also available in: Atom PDF