Project

General

Profile

action #128345

Updated by livdywan about 1 year ago

 
 ## Observation 

 from **OpenQA logreport for ariel.suse-dmz.opensuse.org**: 

 ``` 
 [2023-04-27T07:29:38.886066Z] [error] Worker 30538 has no heartbeat (400 seconds), restarting 
 ``` 

 ## Acceptance criteria 
 * **AC1:** Heart beat messages are not observed in log reports 

  

 ## Suggestions 
 * "Worker $PID" refers to the process on the web UI since this is where we use Mojo workers (prefork HTTP server) and nowhere else 
 * Heartbeats are a recurring write of a small message from the prefork workers to the manager every few seconds, this must have been blocked by a blocking operation (like a syscall or database query) 
 * Check the WebUI logs for relevant messages

Back