Project

General

Profile

Actions

coordination #161735

open

openQA Project (public) - coordination #112862: [saga][epic] Future ideas for easy multi-machine handling: MM-tests as first-class citizens

[epic] Better error detection on GRE tunnel misconfiguration

Added by okurz 7 months ago. Updated 5 months ago.

Status:
Blocked
Priority:
Normal
Assignee:
Category:
Feature requests
Target version:
Start date:
2024-06-21
Due date:
% Done:

0%

Estimated time:
(Total: 0.00 h)
Tags:

Description

Motivation

See #160646 and #161381

Acceptance criteria

  • AC1: The backend and/or test code can point better to likely causes of an error
  • AC2: Similar future issues are prevented with better CI checks

Suggestions

  • Monitor contents of the mine to better understand when it breaks and why
  • Implement sanity checks on the worker to check for proper peer configuration
  • Change the MTU-size check in the test distribution so make the error message more clear in case not even the smallest MTU-size works (e.g. "The network connection within the SUT does not work at all." and maybe for tap-based tests "Check the MM-setup, e.g. GRE tunnels")
  • Get rid of the mine completely for "workername" <-> IP lookup
    • Problem: Currently the pillar-data does not contain the FQDN of the other workers.
    • We already have "## FQDN: …" in many cases so it would be easy to make that a mandatory key for all, at least the ones where we expect that the tap class should be usable

WARNING

  • Do not touch the key of a worker in workerconf.sls - a lot of other states depend on it!

Subtasks 2 (2 open0 closed)

action #162734: Simple script detecting gre_tunnel_preup.sh with only empty remote_ip= statements during salt CI pipelines size:MWorkable2024-06-21

Actions
action #162737: Put #161735-4, #161735-5, #161735-6 about "Better error detection in GRE tunnel config" into tasks size:SWorkable2024-06-21

Actions

Related issues 1 (0 open1 closed)

Copied from openQA Infrastructure (public) - action #161381: multi-machine test network issues reported 2024-06-03 due to missing content in the salt mine size:SResolvedmkittler2024-06-032024-06-18

Actions
Actions

Also available in: Atom PDF