Project

General

Profile

action #119281

[alert] baremetal-support: Memory usage alert size:M

Added by robert.richardson 3 months ago. Updated 3 months ago.

Status:
Resolved
Priority:
High
Target version:
Start date:
2022-10-24
Due date:
% Done:

0%

Estimated time:
Tags:

Description

Observation

time of grafana alert 3:46

available (percentage): 0.039
-> [OK] at 4:37

Grafana alert paused

https://stats.openqa-monitor.qa.suse.de/d/GDbaremetal-support/dashboard-for-baremetal-support?tab=alert&orgId=1&from=1666484239091&to=1666498211158

Actually it seems all of the figures are missing some data in the affected timeframe i.e. CPU, ping, and others. This shouldn't be relevant though as we only alert on actual data. What we see in the above timeframe is a low-memory condition after host boot for roughly one hour

Acceptance criteria

  • AC1: No more alerts after automatic restart

Suggestions

  • Login into qamaster.qa.suse.de running the VMs, either ssh and then use virsh or virt-manager (everybody that has access to the OSD salt managed infrastructure can login here as well)
  • Increase the VM memory assignment for the host baremetal-support

History

#1 Updated by okurz 3 months ago

  • Priority changed from Normal to High

#2 Updated by cdywan 3 months ago

  • Subject changed from [alert] baremetal-support: Memory usage alert to [alert] baremetal-support: Memory usage alert size:M
  • Description updated (diff)
  • Status changed from New to Workable

#3 Updated by robert.richardson 3 months ago

  • Status changed from Workable to Resolved
  • Assignee set to robert.richardson

Bumped virtual RAM to 4 GiB

virsh shutdown baremetal-support
virsh setmaxmem baremetal-support 4G --config
virsh setmem baremetal-support 4G --config
virsh start baremetal-support

https://stats.openqa-monitor.qa.suse.de/d/GDbaremetal-support/dashboard-for-baremetal-support?orgId=1&from=1666952700000&to=1666953000000

Also available in: Atom PDF