action #135491
Updated by okurz 11 months ago
## Observation fozzie and quinn are in NUE1, they failed to access the generic static iPXE menu(the same on with O3), I changed its dhcp config to kernel qa team's baremetal-support services(https://gitlab.suse.de/qa-sle/qanet-configs/-/merge_requests/75), they still failed to reach out this server. "TFTP open timeout" is reported: ![TFTP_error](TFTP_error.png) Could you please take a look? ## Problem atfpd on qanet apparently stuck ## Suggestions * Try to identify stuck server processes, restart, lazy unmount NFS shares and such ## Rollback steps * Unsilence alert `alertname=Packet loss between worker hosts and other hosts alert`