Project

General

Profile

action #163928

Updated by okurz 11 days ago

## Observation 
 https://stats.openqa-monitor.qa.suse.de/d/WebuiDb/webui-summary?orgId=1&viewPanel=78&from=1720996087861&to=1720999161474 

 I took a look at the logs which I attached in the ticket 

 I cant spot the actual problem. And the system seems to perform an update, and recovered after the restart of the services. 
 unresponsiveness took place from 00:42 to 01:05 (>20min) 

 looking at the logs I see some    from telegraf 

 ``` 
 openqa telegraf[6820]: 2024-07-14T22:54:50Z E! [inputs.http] Error in plugin: [url=https://openqa.suse.de/admin/*]: Get "https://openqa.suse.de/admin/*": context deadline exceeded (Client.Timeout exceeded while awaiting headers) headers)``` 
 ``` 

 and many 

 ``` 
 Jul 15 00:54:29 openqa openqa[12024]: [debug] [pid:12024] _carry_over_candidate(14928963): ignoring job 14855612 with repeated problem                                                                                                        
 Jul 15 00:54:29 openqa openqa[12024]: [debug] [pid:12024] _carry_over_candidate(14928963): checking take over from 14834954: _failure_reason=GOOD  
 ```

Back