Project

General

Profile

Actions

action #130633

closed

Better documentation on jenkins.qa.suse.de alerts and recovery

Added by livdywan over 1 year ago. Updated over 1 year ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
-
Start date:
Due date:
% Done:

0%

Estimated time:
Tags:

Description

Motivation

It seems the alert regarding "packet loss" is not very clear. And maybe when there's many alerts it's not obvious how to address it.

Acceptance criteria

  • AC1: The alert is understood by the team
  • AC1: There's documentation about how to recover jenkins when it's down

Suggestions


Related issues 1 (0 open1 closed)

Copied from openQA Infrastructure (public) - action #128561: salt managed host being down does not trigger any alert (was: jenkins.qa.suse.de stuck in emergency mode but no alert) size:MResolveddheidler2023-05-032023-07-04

Actions
Actions

Also available in: Atom PDF