Project

General

Profile

Actions

action #108743

closed

openQA Project (public) - coordination #80142: [saga][epic] Scale out: Redundant/load-balancing deployments of openQA, easy containers, containers on kubernetes

qa-power8-5-kvm minions alert is heart-broken

Added by okurz almost 3 years ago. Updated almost 3 years ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
-
Start date:
2022-03-22
Due date:
% Done:

0%

Estimated time:

Description

Observations

worker-dashboard-qa-power8-5-kvm shows a broken heart for Minion Jobs.

Rollback steps

  • Un-pause alert

Related issues 1 (0 open1 closed)

Copied from openQA Infrastructure (public) - action #108740: qa-power8-5-kvm minions alert is heart-broken 💔️Rejectedokurz2022-03-22

Actions
Actions #1

Updated by okurz almost 3 years ago

  • Copied from action #108740: qa-power8-5-kvm minions alert is heart-broken 💔️ added
Actions #2

Updated by nicksinger almost 3 years ago

  • Status changed from New to In Progress
  • Assignee set to nicksinger

Apparently we accumulated 107 failed minion jobs. Most of them where older then 1 year according to the minion dashboard. I cleaned them now as they are too old to react on anyway. We now have 7 failed jobs left every other day with a fail in locking the database. I remember there was some work done but it might be just alright.

Actions #3

Updated by nicksinger almost 3 years ago

  • Status changed from In Progress to Resolved
Actions

Also available in: Atom PDF