Project

General

Profile

Actions

action #127754

closed

osd nfs-server needed to be restarted but we got no alerts size:M

Added by tinita about 1 year ago. Updated 11 months ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
-
Target version:
Start date:
Due date:
% Done:

0%

Estimated time:
Tags:

Description

Observation

See #121573 and https://suse.slack.com/archives/C02CANHLANP/p1681228800740289

s390zp18:/var/lib/openqa/share/factory # cd hdd/fixed/
-bash: cd: hdd/fixed/: Stale file handle

Suggestions

  • Research if "Stale file handle" for NFS can be prevented or better handled, maybe need to upgrade all machines to newer OS? s390zp18 is SLE12SP5 (and long uptime, likely no automatic upgrades)
  • Research for monitoring and alert for NFS mounts or handles
  • Try to reproduce the problem, e.g. with s390zp18 and OSD, maybe has to do with reboots of machines?

Related issues 3 (2 open1 closed)

Related to openQA Project - action #65450: workers on o3 power did not restart after upgrade as NFS mount point was stale "Ignoring host 'http://openqa1-opensuse': Working directory does not exist"Workable2020-04-08

Actions
Related to openQA Infrastructure - action #51836: Manage (parts) of s390 kvm instances (formerly s390p7 and s390p8) with saltResolvedokurz2019-05-22

Actions
Copied from openQA Project - action #121573: Asset/HDD goes missing while job is runningNew2022-12-06

Actions
Actions

Also available in: Atom PDF