Project

General

Profile

action #108743

openQA Project - coordination #80142: [saga][epic] Scale out: Redundant/load-balancing deployments of openQA, easy containers, containers on kubernetes

qa-power8-5-kvm minions alert is heart-broken

Added by okurz 3 months ago. Updated 3 months ago.

Status:
Resolved
Priority:
High
Assignee:
Target version:
Start date:
2022-03-22
Due date:
% Done:

0%

Estimated time:

Description

Observations

worker-dashboard-qa-power8-5-kvm shows a broken heart for Minion Jobs.

Rollback steps

  • Un-pause alert

Related issues

Copied from openQA Infrastructure - action #108740: qa-power8-5-kvm minions alert is heart-broken 💔️Rejected2022-03-22

History

#1 Updated by okurz 3 months ago

  • Copied from action #108740: qa-power8-5-kvm minions alert is heart-broken 💔️ added

#2 Updated by nicksinger 3 months ago

  • Status changed from New to In Progress
  • Assignee set to nicksinger

Apparently we accumulated 107 failed minion jobs. Most of them where older then 1 year according to the minion dashboard. I cleaned them now as they are too old to react on anyway. We now have 7 failed jobs left every other day with a fail in locking the database. I remember there was some work done but it might be just alright.

#3 Updated by nicksinger 3 months ago

  • Status changed from In Progress to Resolved

Also available in: Atom PDF