Project

General

Profile

Actions

action #132827

closed

[tools][qe-core]test fails in rsync_client/salt-master, DNS resolve issue with workers "sapworker*" on multi-machine tests size:M

Added by rfan1 over 1 year ago. Updated 2 months ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
Regressions/Crashes
Start date:
2023-07-17
Due date:
% Done:

0%

Estimated time:
Tags:

Description

Observation

I can see that some tests are failing due to DNS resolve issue on workers "sapworker*", especially on multi-machine tests.can someone help check?

Some error messages as below:
https://openqa.suse.de/tests/11593878#step/salt_master/15
http://openqa.suse.de/tests/11594635#step/rsync_client/12

Reproducible

Failed test links

Expected result

I Tried with another worker to run the rsync tests without any issue: http://openqa.suse.de/tests/11594925#dependencies

Rollback steps

  • Add back production worker class on all OSD machines mentioning #132827

Further details

May be some network problems with workers "sapworker*", based on my tests [at least for rsync test result], the same test can pass with "worker5" but fail with "sapworker1"

Suggestions

  • First ensure that all openQA workers have the salt state applied cleanly, e.g. sudo salt --no-color -C 'G@roles:worker' state.apply
  • Maybe the failure can be improved on the os-autoinst side, like a better "die"message/reason
  • As temporary measure consider disabling the "tap" class from affected workers, e.g. make it tap_pooXXX
  • Debug multi-machine capabilities according to http://open.qa/docs/#_verify_the_setup
  • Ensure that our salt states ensure all what is needed to run stable multi-machine tests
  • Add back production worker classes for all affected machines openqaworker1, sapworker{1-7}, e.g. qesapworker-prg1-5

Related issues 4 (1 open3 closed)

Related to openQA Tests (public) - action #132932: [qe-core] test fails in t01_basic - eth0 stays in setup-in-progresNew2023-07-18

Actions
Related to openQA Project (public) - action #133025: Configure Virtual Interfaces instructions do not work on Leap 15.5 size:MResolveddheidler2023-07-192023-10-31

Actions
Related to openQA Infrastructure (public) - action #132137: Setup new PRG2 openQA worker for osd size:MResolvedmkittler2023-06-29

Actions
Blocked by openQA Infrastructure (public) - action #133127: Frankencampus network broken + GitlabCi failed --> uploading artefactsResolvedokurz2023-07-20

Actions
Actions

Also available in: Atom PDF