Project

General

Profile

Actions

action #163340

open

OBSRSync regularily fails minion jobs - nobody cares, tools gets alerted (e.g. "Munin - minion Minion Jobs") size:M

Added by nicksinger 6 months ago. Updated about 24 hours ago.

Status:
Feedback
Priority:
Normal
Assignee:
Category:
Regressions/Crashes
Start date:
Due date:
% Done:

0%

Estimated time:

Description

Observation

emails with the subject Munin - minion Minion Jobs and content like this:

opensuse.org :: openqa.opensuse.org :: Minion Jobs - see https://openqa.opensuse.org/minion/jobs?state=failed
        CRITICALs: failed is 501.00 (outside range [:500]).

We also see the same on OSD, see https://progress.opensuse.org/issues/163340#note-6 for some examples.
Looking at https://openqa.opensuse.org/minion/jobs?state=failed a lot of obs_rsync_run jobs fail:

---
args:
- project: openSUSE:Slowroll:Build:2
  url: https://api.opensuse.org/public/build/openSUSE:Slowroll:Build:2/_result?package=000product
attempts: 1
children: []
created: 2024-07-04T08:33:32.788609Z
delayed: 2024-07-04T08:33:32.788609Z
expires: ~
finished: 2024-07-04T08:33:33.044735Z
id: 4068028
lax: 0
notes:
  gru_id: 20378045
  project_lock: 1
parents: []
priority: 100
queue: default
result:
  code: 256
  message: |-
    rsync: [sender] change_dir "/openSUSE:Slowroll:Build:2/images/x86_64/kiwi-templates-Minimal:kvm-and-xen" (in openqa) failed: No such file or directory (2)
    rsync error: some files/attrs were not transferred (see previous errors) (code 23) at main.c(1835) [Receiver=3.2.3]
    read_files.sh failed for openSUSE:Slowroll:Build:2 in enviroment openSUSE:Slowroll:Build:2
retried: ~
retries: 0
started: 2024-07-04T08:33:32.792054Z
state: failed
task: obs_rsync_run
time: 2024-07-04T11:25:28.671226Z
worker: 2253

Acceptance criteria

  • AC1: We don't receive those e-mails anymore (unless there is really an actionable problem)
  • AC2: Errors are visible on the web UI pages under "OBS Sync"

Suggestions

Rollback steps


Related issues 3 (1 open2 closed)

Related to openQA Infrastructure (public) - action #155743: OBSRSync fails to sync openSUSE:Factory:PowerPC:ToTest (was: WARNINGs: failed is 452.00 in Munin - minion Minion Jobs on o3)Blockedlivdywan2024-02-21

Actions
Related to openQA Infrastructure (public) - action #163067: [alert] Munin - minion Minion Jobs - see https://openqa.opensuse.org/minion/jobs?state=failed - opensuse.org :: openqa.opensuse.orgRejected2024-07-01

Actions
Related to QA (public) - action #112871: obs_rsync_run Minion tasks fail with no error message size:MResolvedlivdywan

Actions
Actions

Also available in: Atom PDF