Project

General

Custom queries

Profile

Actions

action #151310

closed

coordination #112862: [saga][epic] Future ideas for easy multi-machine handling: MM-tests as first-class citizens

coordination #111929: [epic] Stable multi-machine tests covering multiple physical workers

[regression] significant increase of parallel_failed+failed since 2023-11-21 size:M

Added by okurz about 1 year ago. Updated 11 months ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
Regressions/Crashes
Target version:
Start date:
2023-11-23
Due date:
% Done:

0%

Estimated time:

Description

Motivation

As visible on https://monitor.qa.suse.de/d/nRDab3Jiz/openqa-jobs-test?orgId=1&from=1700508932604&to=1700724085546&viewPanel=24
since about 2023-11-21 there is again a significant increase of multi-machine tests which should be investigated, mitigated, fixed and prevented.

Acceptance criteria

Suggestions

  • Start to look into the issue early as waiting longer makes everything harder for us :)
  • Lookup common failure sources and find out if it's actually not test or product regressions.
  • Ask common stakeholders and/or test reviewers if they know something
  • Review recent infrastructure changes which might be possibly related
  • Mitigate, fix and prevent the issues you find
  • Consider using the scientific method https://progress.opensuse.org/projects/openqav3/wiki/#Further-decision-steps-working-on-test-issues
  • Use SQL queries to find out what failures are most common
    • Consider using this opportunity to document one or two examples of how we commonly do that

Related issues 2 (0 open2 closed)

Related to openQA Tests (public) - action #151612: [kernel][tools] test fails in suseconnect_scc - SUT times out trying to reach https://scc.suse.comResolvedmkittler2023-11-28

Actions
Related to openQA Project (public) - action #152389: significant increase in MM-test failure ratio 2023-12-11: test fails in multipath_iscsi and other multi-machine scenarios due to MTU size auto_review:"ping with packet size 1350 failed, problems with MTU" size:MResolvedmkittler2023-12-11

Actions
#1

Updated by livdywan about 1 year ago

  • Subject changed from [regression] significant increase of parallel_failed+failed since 2023-11-21 to [regression] significant increase of parallel_failed+failed since 2023-11-21 size:M
  • Description updated (diff)
  • Status changed from New to Workable
#2

Updated by mkittler about 1 year ago

  • Assignee set to mkittler
#11

Updated by mkittler about 1 year ago

  • Status changed from Workable to In Progress
#12

Updated by openqa_review about 1 year ago

  • Due date set to 2023-12-12
#17

Updated by mkittler about 1 year ago

  • Status changed from In Progress to Feedback
#22

Updated by okurz about 1 year ago

  • Related to action #151612: [kernel][tools] test fails in suseconnect_scc - SUT times out trying to reach https://scc.suse.com added
#23

Updated by okurz about 1 year ago

  • Parent task set to #111929
#24

Updated by okurz about 1 year ago

  • Status changed from Feedback to In Progress
#27

Updated by mkittler about 1 year ago

  • Status changed from In Progress to Resolved
#28

Updated by okurz about 1 year ago

  • Related to action #152389: significant increase in MM-test failure ratio 2023-12-11: test fails in multipath_iscsi and other multi-machine scenarios due to MTU size auto_review:"ping with packet size 1350 failed, problems with MTU" size:M added
#29

Updated by okurz 11 months ago

  • Due date deleted (2023-12-12)
Actions

Also available in: Atom PDF