Project

General

Profile

action #135491

Updated by okurz 8 months ago

## Observation 
 fozzie and quinn are in NUE1, they failed to access the generic static iPXE menu(the same on with O3), I changed its dhcp config to kernel qa team's baremetal-support services(https://gitlab.suse.de/qa-sle/qanet-configs/-/merge_requests/75), they still failed to reach out this server.  

 "TFTP open timeout" is reported: 
 ![TFTP_error](TFTP_error.png) 

 Could you please take a look? 

 ## Problem 
 atfpd on qanet apparently stuck 

 ## Suggestions 
 * Try to identify stuck server processes, restart, lazy unmount NFS shares and such 

 ## Rollback steps 
 * Unsilence alert `alertname=Packet loss between worker hosts and other hosts alert`

Back