Project

General

Profile

Actions

action #164284

open

[FIRING:1] worker-arm1 (worker-arm1: System load alert openQA worker-arm1 salt system_load_alert_worker-arm1 worker) size:S

Added by livdywan 5 days ago. Updated 1 day ago.

Status:
Workable
Priority:
High
Assignee:
-
Category:
Regressions/Crashes
Target version:
Start date:
Due date:
% Done:

0%

Estimated time:

Description

Observation

The load was exceeding our expected limits for 15 minutes, see https://stats.openqa-monitor.qa.suse.de/d/WDworker-arm1/worker-dashboard-worker-arm1?orgId=1

Acceptance Criteria

  • AC1: No alerts about high load for normal openQA workloads on worker-arm1

Suggestions

  • Look for cues on what caused the high load at the time
  • Let's not increase the load limits in grafana for now
    • Confirm that no jobs were failing or incomplete because of the load
    • Decrease the load limits in the worker i.e. workerconf.sls
Actions #1

Updated by okurz 5 days ago

  • Priority changed from Normal to High
  • Target version set to Ready
Actions #2

Updated by livdywan 1 day ago

  • Subject changed from [FIRING:1] worker-arm1 (worker-arm1: System load alert openQA worker-arm1 salt system_load_alert_worker-arm1 worker) to [FIRING:1] worker-arm1 (worker-arm1: System load alert openQA worker-arm1 salt system_load_alert_worker-arm1 worker) size:S
  • Description updated (diff)
  • Status changed from New to Workable
Actions

Also available in: Atom PDF