Actions
action #135230
closedsalt pillars pipelines failing due to Temporary failure in name resolution
Status:
Resolved
Priority:
Urgent
Assignee:
Category:
-
Target version:
Start date:
2023-09-06
Due date:
% Done:
0%
Estimated time:
Tags:
Description
Observation¶
https://gitlab.suse.de/openqa/salt-pillars-openqa/-/jobs/1808645
openqaworker-arm-2.suse.de:
telegraf is fine
openqa.suse.de:
2023-09-06T08:13:41Z E! [inputs.http] Error in plugin: [url=https://openqa.suse.de/admin/influxdb/minion]: Get "https://openqa.suse.de/admin/influxdb/minion": context deadline exceeded (Client.Timeout exceeded while awaiting headers)
[...]
atus 2 - /usr/bin/ping: qa-jump.qe.nue2.suse.org: Name or service not known
2023-09-06T08:13:38Z E! [telegraf] Error running agent: input plugins recorded 2 errors
2023-09-06T08:13:41Z E! [inputs.http] Error in plugin: [url=https://openqa.suse.de/admin/influxdb/minion]: Get "https://openqa.suse.de/admin/influxdb/minion": context deadline exceeded (Client.Timeout exceeded while awaiting headers)
2023-09-06T08:13:41Z E! [inputs.http] Error in plugin: [url=https://openqa.suse.de/admin/influxdb/jobs]: Get "https://openqa.suse.de/admin/influxdb/jobs": context deadline exceeded (Client.Timeout exceeded while awaiting headers)
2023-09-06T08:13:41Z E! [telegraf] Error running agent: input plugins recorded 2 errors
Acceptance criteria¶
- AC1: salt pillars pipelines succeed
Suggestions¶
- Report an eng-infra ticket
Updated by livdywan over 1 year ago
Updated by livdywan over 1 year ago
- Status changed from New to Blocked
- Assignee set to nicksinger
Updated by okurz over 1 year ago
There had also been user reports which could be related to DNS resolution problems in https://suse.slack.com/archives/C02CANHLANP/p1693986939350679 and https://suse.slack.com/archives/C02CANHLANP/p1693988984111069
Updated by osukup over 1 year ago
isn this same as https://progress.opensuse.org/issues/135206 ?
Updated by nicksinger over 1 year ago
- Has duplicate action #135206: [tools] GitlabCI telegraf step on salt-states-openqa failed added
Updated by nicksinger over 1 year ago
- Project changed from QA (public) to openQA Infrastructure (public)
Updated by nicksinger over 1 year ago
- Related to action #134879: reverse DNS resolution PTR for openqa.oqa.prg2.suse.org. yields "3(NXDOMAIN)" for PRG1 workers (NUE1+PRG2 are fine) size:M added
Updated by okurz over 1 year ago
- Tags changed from infra, arm, FC Basement to infra, gitlab, reactive
- Parent task deleted (
#130955)
Updated by okurz over 1 year ago
- Tags changed from infra, gitlab, reactive to infra, gitlab, reactive work
Updated by nicksinger over 1 year ago
- Status changed from Blocked to Resolved
so the issue was a wrong configuration on the DNS servers for qa.suse.cz hosted e.g. on qanet.suse.cz. Martin was able to resolve the issue in https://progress.opensuse.org/issues/134879#note-12 and we have a working pipeline again: https://gitlab.suse.de/openqa/salt-states-openqa/-/jobs/1810909
Actions