Project

General

Profile

Actions

action #179038

open

coordination #154777: [saga][epic] Shareable os-autoinst and test distribution plugins

coordination #162131: [epic] future version control related features in openQA

Gracious handling of longer remote git clones outages size:S

Added by robert.richardson 16 days ago. Updated 1 day ago.

Status:
Workable
Priority:
Normal
Category:
Feature requests
Target version:
Start date:
2025-03-17
Due date:
2025-04-08 (Due in 6 days)
% Done:

0%

Estimated time:

Description

Motivation

Currently, git_clone minion jobs fail when GitLab is temporarily unreachable (see #178492), by introducing a proper error-handling mechanism, we can ensure:

  • Temporary outages do not cause unnecessary job failures or alerts.

User Story

"As a test engineer and openQA operator,
i want openQA to handle short-lived GitLab outages without causing mass Minion job failures,
so that users do not experience unnecessary disruption

Acceptance Criteria

  • AC1: Temporary remote git outages don't cause failing minion jobs
  • AC2: An update of remote git repositories is still ensured on shorter failed requests, e.g. in range of seconds

Suggestions

  • Damage is likely limited. If we can't sync needles nobody can edit needles.
  • Jobs end up incomplete if there's an on-going issue with git_clone minion jobs
  • We could decide to eventually give up and continue anyway and let jobs run

Related issues 2 (2 open0 closed)

Related to openQA Infrastructure (public) - action #178492: [alert] Many failing `git_clone` Minion jobs auto_review:"Error detecting remote default branch name":retry size:SBlockedrobert.richardson2025-03-07

Actions
Copied to openQA Project (public) - action #179185: Detection of long-time remote git clone outages size:SWorkable2025-03-17

Actions
Actions #1

Updated by robert.richardson 16 days ago

  • Description updated (diff)
Actions #2

Updated by robert.richardson 16 days ago

  • Related to action #178492: [alert] Many failing `git_clone` Minion jobs auto_review:"Error detecting remote default branch name":retry size:S added
Actions #3

Updated by robert.richardson 16 days ago

  • Description updated (diff)
Actions #4

Updated by okurz 16 days ago

  • Target version changed from Ready to Tools - Next
Actions #5

Updated by okurz 14 days ago

  • Subject changed from Improve GitLab Outage Handling in openQA to Gracious handling of longer remote git clones outages size:S
  • Description updated (diff)
Actions #6

Updated by okurz 14 days ago

  • Parent task set to #162131
Actions #7

Updated by okurz 14 days ago

  • Copied to action #179185: Detection of long-time remote git clone outages size:S added
Actions #8

Updated by okurz 13 days ago

  • Status changed from New to Workable
  • Target version changed from Tools - Next to Ready

Needed for #178492

Actions #9

Updated by robert.richardson 13 days ago

  • Assignee set to robert.richardson
Actions #10

Updated by robert.richardson 9 days ago

  • Status changed from Workable to In Progress
Actions #11

Updated by openqa_review 9 days ago

  • Due date set to 2025-04-08

Setting due date based on mean cycle time of SUSE QE Tools

Actions #12

Updated by livdywan 1 day ago

  • Status changed from In Progress to Workable
Actions

Also available in: Atom PDF