Project

General

Profile

Actions

action #153880

closed

openQA Infrastructure - coordination #168895: [saga][epic][infra] Support SUSE PRG office move while ensuring business continuity

openQA Infrastructure - coordination #168898: [epic][infra] Support SUSE PRG office datacenter "PRG1" move while ensuring business continuity

https://openqa.suse.de/tests/13277880#step/patterns/96 not being able to resolve download.suse.de, likely DNS problems in PRG1

Added by okurz 10 months ago. Updated 10 months ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
Regressions/Crashes
Target version:
Start date:
Due date:
% Done:

0%

Estimated time:
Tags:

Description

Observation

https://openqa.suse.de/tests/13277880#step/patterns/96 not being able to resolve download.suse.de

I guess we need to take one more deep look into the situation: https://monitor.qa.suse.de/d/nRDab3Jiz/openqa-jobs-test?orgId=1&viewPanel=24&from=now-2d&to=now shows again a significant increase in the failure ratio.

The job got triggered on w17 which does not have "tap" so the scheduler should not have picked that machine The job is not multi-machine

Rollback steps

run salt '*.qa.suse.cz' cmd.run "sed -i 's/NETCONFIG_DNS_STATIC_SERVERS.*/NETCONFIG_DNS_STATIC_SERVERS=""/' /etc/sysconfig/network/config && netconfig update -f && cat /etc/resolv.conf" and check that the output mentions nameserver 10.100.96.1 as first server again


Related issues 2 (1 open1 closed)

Related to openQA Infrastructure - action #138275: Ensure that there is proper ownership and maintainership for qanet.qa.suse.czBlockedokurz2023-10-19

Actions
Copied from openQA Project - action #152389: significant increase in MM-test failure ratio 2023-12-11: test fails in multipath_iscsi and other multi-machine scenarios due to MTU size auto_review:"ping with packet size 1350 failed, problems with MTU" size:MResolvedmkittler2023-12-11

Actions
Actions

Also available in: Atom PDF