Project

General

Profile

Actions

action #135230

closed

salt pillars pipelines failing due to Temporary failure in name resolution

Added by livdywan over 1 year ago. Updated over 1 year ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
-
Start date:
2023-09-06
Due date:
% Done:

0%

Estimated time:

Description

Observation

https://gitlab.suse.de/openqa/salt-pillars-openqa/-/jobs/1808645

openqaworker-arm-2.suse.de:
    telegraf is fine
openqa.suse.de:
    2023-09-06T08:13:41Z E! [inputs.http] Error in plugin: [url=https://openqa.suse.de/admin/influxdb/minion]: Get "https://openqa.suse.de/admin/influxdb/minion": context deadline exceeded (Client.Timeout exceeded while awaiting headers)
[...]
atus 2 - /usr/bin/ping: qa-jump.qe.nue2.suse.org: Name or service not known
    2023-09-06T08:13:38Z E! [telegraf] Error running agent: input plugins recorded 2 errors
    2023-09-06T08:13:41Z E! [inputs.http] Error in plugin: [url=https://openqa.suse.de/admin/influxdb/minion]: Get "https://openqa.suse.de/admin/influxdb/minion": context deadline exceeded (Client.Timeout exceeded while awaiting headers)
    2023-09-06T08:13:41Z E! [inputs.http] Error in plugin: [url=https://openqa.suse.de/admin/influxdb/jobs]: Get "https://openqa.suse.de/admin/influxdb/jobs": context deadline exceeded (Client.Timeout exceeded while awaiting headers)
    2023-09-06T08:13:41Z E! [telegraf] Error running agent: input plugins recorded 2 errors

Acceptance criteria

  • AC1: salt pillars pipelines succeed

Suggestions

  • Report an eng-infra ticket

Related issues 2 (0 open2 closed)

Related to openQA Infrastructure (public) - action #134879: reverse DNS resolution PTR for openqa.oqa.prg2.suse.org. yields "3(NXDOMAIN)" for PRG1 workers (NUE1+PRG2 are fine) size:MResolvedokurz2023-08-31

Actions
Has duplicate openQA Infrastructure (public) - action #135206: [tools] GitlabCI telegraf step on salt-states-openqa failedRejectednicksinger2023-09-05

Actions
Actions #2

Updated by livdywan over 1 year ago

  • Status changed from New to Blocked
  • Assignee set to nicksinger
Actions #3

Updated by okurz over 1 year ago

There had also been user reports which could be related to DNS resolution problems in https://suse.slack.com/archives/C02CANHLANP/p1693986939350679 and https://suse.slack.com/archives/C02CANHLANP/p1693988984111069

Actions #5

Updated by nicksinger over 1 year ago

  • Has duplicate action #135206: [tools] GitlabCI telegraf step on salt-states-openqa failed added
Actions #6

Updated by nicksinger over 1 year ago

  • Project changed from QA (public) to openQA Infrastructure (public)
Actions #7

Updated by nicksinger over 1 year ago

  • Related to action #134879: reverse DNS resolution PTR for openqa.oqa.prg2.suse.org. yields "3(NXDOMAIN)" for PRG1 workers (NUE1+PRG2 are fine) size:M added
Actions #8

Updated by okurz over 1 year ago

  • Tags changed from infra, arm, FC Basement to infra, gitlab, reactive
  • Parent task deleted (#130955)
Actions #9

Updated by okurz over 1 year ago

  • Tags changed from infra, gitlab, reactive to infra, gitlab, reactive work
Actions #10

Updated by nicksinger over 1 year ago

  • Status changed from Blocked to Resolved

so the issue was a wrong configuration on the DNS servers for qa.suse.cz hosted e.g. on qanet.suse.cz. Martin was able to resolve the issue in https://progress.opensuse.org/issues/134879#note-12 and we have a working pipeline again: https://gitlab.suse.de/openqa/salt-states-openqa/-/jobs/1810909

Actions

Also available in: Atom PDF