Project

General

Profile

Actions

action #119281

closed

[alert] baremetal-support: Memory usage alert size:M

Added by robert.richardson over 1 year ago. Updated over 1 year ago.

Status:
Resolved
Priority:
High
Target version:
Start date:
2022-10-24
Due date:
% Done:

0%

Estimated time:
Tags:

Description

Observation

time of grafana alert 3:46

available (percentage): 0.039
-> [OK] at 4:37

Grafana alert paused

https://stats.openqa-monitor.qa.suse.de/d/GDbaremetal-support/dashboard-for-baremetal-support?tab=alert&orgId=1&from=1666484239091&to=1666498211158

Actually it seems all of the figures are missing some data in the affected timeframe i.e. CPU, ping, and others. This shouldn't be relevant though as we only alert on actual data. What we see in the above timeframe is a low-memory condition after host boot for roughly one hour

Acceptance criteria

  • AC1: No more alerts after automatic restart

Suggestions

  • Login into qamaster.qa.suse.de running the VMs, either ssh and then use virsh or virt-manager (everybody that has access to the OSD salt managed infrastructure can login here as well)
  • Increase the VM memory assignment for the host baremetal-support
Actions #1

Updated by okurz over 1 year ago

  • Priority changed from Normal to High
Actions #2

Updated by livdywan over 1 year ago

  • Subject changed from [alert] baremetal-support: Memory usage alert to [alert] baremetal-support: Memory usage alert size:M
  • Description updated (diff)
  • Status changed from New to Workable
Actions #3

Updated by robert.richardson over 1 year ago

  • Status changed from Workable to Resolved
  • Assignee set to robert.richardson

Bumped virtual RAM to 4 GiB

virsh shutdown baremetal-support
virsh setmaxmem baremetal-support 4G --config
virsh setmem baremetal-support 4G --config
virsh start baremetal-support

https://stats.openqa-monitor.qa.suse.de/d/GDbaremetal-support/dashboard-for-baremetal-support?orgId=1&from=1666952700000&to=1666953000000

Actions

Also available in: Atom PDF