Project

General

Profile

action #119281

Updated by livdywan about 2 years ago

## Observation 

 time of grafana alert 3:46 

 available (percentage): 0.039 
 -> [OK] at 4:37 

 Grafana alert paused 

 https://stats.openqa-monitor.qa.suse.de/d/GDbaremetal-support/dashboard-for-baremetal-support?tab=alert&orgId=1&from=1666484239091&to=1666498211158 

 Actually it seems all of the figures are missing some data in the affected timeframe i.e. CPU, ping, and others. This shouldn't be relevant though as we only alert on actual data. What we see in the above timeframe is a low-memory condition after host boot for roughly one hour 

 ## Acceptance criteria 

 * **AC1:** No more alerts after automatic restart 

 ## Suggestions 

 * Login into qamaster.qa.suse.de running the VMs, either ssh and then use virsh or virt-manager (everybody that has access to the OSD salt managed infrastructure can login here as well) 
 * Increase the VM memory assignment for the host baremetal-support

Back