Project

General

Profile

Actions

action #119281

closed

[alert] baremetal-support: Memory usage alert size:M

Added by robert.richardson about 2 years ago. Updated about 2 years ago.

Status:
Resolved
Priority:
High
Start date:
2022-10-24
Due date:
% Done:

0%

Estimated time:
Tags:

Description

Observation

time of grafana alert 3:46

available (percentage): 0.039
-> [OK] at 4:37

Grafana alert paused

https://stats.openqa-monitor.qa.suse.de/d/GDbaremetal-support/dashboard-for-baremetal-support?tab=alert&orgId=1&from=1666484239091&to=1666498211158

Actually it seems all of the figures are missing some data in the affected timeframe i.e. CPU, ping, and others. This shouldn't be relevant though as we only alert on actual data. What we see in the above timeframe is a low-memory condition after host boot for roughly one hour

Acceptance criteria

  • AC1: No more alerts after automatic restart

Suggestions

  • Login into qamaster.qa.suse.de running the VMs, either ssh and then use virsh or virt-manager (everybody that has access to the OSD salt managed infrastructure can login here as well)
  • Increase the VM memory assignment for the host baremetal-support
Actions #1

Updated by okurz about 2 years ago

  • Priority changed from Normal to High
Actions #2

Updated by livdywan about 2 years ago

  • Subject changed from [alert] baremetal-support: Memory usage alert to [alert] baremetal-support: Memory usage alert size:M
  • Description updated (diff)
  • Status changed from New to Workable
Actions #3

Updated by robert.richardson about 2 years ago

  • Status changed from Workable to Resolved
  • Assignee set to robert.richardson

Bumped virtual RAM to 4 GiB

virsh shutdown baremetal-support
virsh setmaxmem baremetal-support 4G --config
virsh setmem baremetal-support 4G --config
virsh start baremetal-support

https://stats.openqa-monitor.qa.suse.de/d/GDbaremetal-support/dashboard-for-baremetal-support?orgId=1&from=1666952700000&to=1666953000000

Actions

Also available in: Atom PDF