Project

General

Profile

Actions

action #114733

closed

openqaworker-arm-3 not consistently reachable

Added by livdywan almost 2 years ago. Updated almost 2 years ago.

Status:
Rejected
Priority:
Normal
Assignee:
Category:
-
Target version:
Start date:
2022-07-27
Due date:
% Done:

0%

Estimated time:

Description

Observation

Originally I was thinking this might be a temporary issue as the "Grafana webhook actions" pipelines were failing. So I re-tried the pipeline. Then talking about it with Tina made me realize I couldn't connect to openqaworker-arm-3 via osd, but from localhost it was fine i.e.

  • ssh: connect to host openqaworker-arm-3 port 22: No route to host
  • lost connection

And eventually my attempt to connect from localhost would just hang forever.

I suspect there's a DNS issue but really guessing so far.

Actions #1

Updated by okurz almost 2 years ago

  • Status changed from New to Rejected
  • Assignee set to okurz
  • Target version set to Ready

Better to be handled as part of #114586. openqaworker-arm-3 is our most unstable one and is often crashing and can not be pinged. As long as we can at least recover it using the remote controlled PDU everything should be fine.

Actions

Also available in: Atom PDF