Project

General

Profile

Actions

action #157726

open

openQA Project - coordination #110833: [saga][epic] Scale up: openQA can handle a schedule of 100k jobs with 1k worker instances

openQA Project - coordination #108209: [epic] Reduce load on OSD

osd-deployment | Failed pipeline for master (worker3[6-9].oqa.prg2.suse.org)

Added by livdywan about 1 month ago. Updated about 1 month ago.

Status:
Blocked
Priority:
Normal
Assignee:
Category:
Regressions/Crashes
Target version:
Start date:
2024-03-18
Due date:
% Done:

0%

Estimated time:

Description

Observation

https://gitlab.suse.de/openqa/osd-deployment/-/jobs/2415705

worker37.oqa.prg2.suse.org:
    Minion did not return. [Not connected]
worker36.oqa.prg2.suse.org:
    Minion did not return. [Not connected]
worker38.oqa.prg2.suse.org:
    Minion did not return. [Not connected]
worker39.oqa.prg2.suse.org:
    Minion did not return. [Not connected]

Acceptance criteria

  • AC1: osd-deployment passes again

Suggestions

Rollback steps


Related issues 2 (1 open1 closed)

Related to openQA Infrastructure - action #157666: OSD unresponsive and then not starting any more jobs on 2024-03-21Resolvedokurz2024-03-12

Actions
Related to openQA Project - coordination #157669: websockets+scheduler improvementsNew2023-08-31

Actions
Actions #1

Updated by okurz about 1 month ago

  • Status changed from New to In Progress
  • Assignee set to okurz
Actions #2

Updated by okurz about 1 month ago

  • Related to action #157666: OSD unresponsive and then not starting any more jobs on 2024-03-21 added
Actions #3

Updated by okurz about 1 month ago

  • Parent task set to #108209
Actions #4

Updated by okurz about 1 month ago

  • Description updated (diff)
  • Status changed from In Progress to Blocked
  • Priority changed from High to Normal
  • Target version changed from Ready to future
Actions #5

Updated by okurz about 1 month ago

Actions #6

Updated by okurz about 1 month ago

Blocking on #157669

Actions

Also available in: Atom PDF