Project

General

Profile

Actions

action #119281

closed

[alert] baremetal-support: Memory usage alert size:M

Added by robert.richardson about 2 years ago. Updated about 2 years ago.

Status:
Resolved
Priority:
High
Start date:
2022-10-24
Due date:
% Done:

0%

Estimated time:
Tags:

Description

Observation

time of grafana alert 3:46

available (percentage): 0.039
-> [OK] at 4:37

Grafana alert paused

https://stats.openqa-monitor.qa.suse.de/d/GDbaremetal-support/dashboard-for-baremetal-support?tab=alert&orgId=1&from=1666484239091&to=1666498211158

Actually it seems all of the figures are missing some data in the affected timeframe i.e. CPU, ping, and others. This shouldn't be relevant though as we only alert on actual data. What we see in the above timeframe is a low-memory condition after host boot for roughly one hour

Acceptance criteria

  • AC1: No more alerts after automatic restart

Suggestions

  • Login into qamaster.qa.suse.de running the VMs, either ssh and then use virsh or virt-manager (everybody that has access to the OSD salt managed infrastructure can login here as well)
  • Increase the VM memory assignment for the host baremetal-support
Actions

Also available in: Atom PDF