Project

General

Profile

Actions

action #46502

closed

[tools] Support "record_info" in serial_failure_detection

Added by okurz almost 6 years ago. Updated over 3 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
Enhancement to existing tests
Start date:
2019-01-22
Due date:
2021-05-21
% Done:

0%

Estimated time:
Difficulty:

Description

Motivation

See https://github.com/os-autoinst/os-autoinst-distri-opensuse/pull/6589#issuecomment-456419001 . As the serial failure detection supports only "soft" and "hard" now we can fail the job or use "soft" but for this we want to always have a bugref so probably we should support to just set an info popup without altering the test result.

Acceptance criteria

  • AC1: serial issue detection can be used with a notice popup in test details but without altering the test result

Related issues 1 (0 open1 closed)

Related to openQA Tests (public) - action #45530: [aarch64] system_workarounds.pm triggers lib/known_bugs serial detection which abort whole test suiteClosedggardet_arm2018-12-24

Actions
Actions #1

Updated by okurz almost 6 years ago

  • Related to action #45530: [aarch64] system_workarounds.pm triggers lib/known_bugs serial detection which abort whole test suite added
Actions #2

Updated by szarate almost 5 years ago

We don't want to paint it green

Actions #3

Updated by okurz over 4 years ago

@szarate Not sure what you want to tell with your statement. Of course I am with you to not just mark "known bugs" by record_info instead of a soft-fail. Have you followed https://github.com/os-autoinst/os-autoinst-distri-opensuse/pull/6589#issuecomment-456419001 and the whole discussion after that? The problem was with "noteworthy conditions" that can not have a workable bug behind where people also added record_soft_failure without bug references. Do you consider this feature worthwhile to follow up with or would you prefer an alternative?

Actions #4

Updated by tjyrinki_suse about 4 years ago

  • Subject changed from [functional][u] Support "record_info" in serial_failure_detection to [qe-core][functional] Support "record_info" in serial_failure_detection
Actions #5

Updated by okurz over 3 years ago

As I was asked for further details, let me try to explain: With serial_failure_detection I mean the logic called in https://github.com/os-autoinst/os-autoinst-distri-opensuse/pull/6589/files#diff-45750ef51e5e8360e7ef27b51599ba4b396c661c6ef82d1fd76cecd3ebec195cR27 and the reason why we created https://github.com/os-autoinst/os-autoinst-distri-opensuse/pull/6589 should explain the idea as well: soft-fail also according to http://open.qa/docs/ means "known, non-critical issue", so it should only be used if we have a bugref. only this way we can ensure it is "known". Otherwise jobs will end up all yellow but no one can do anything about it, leading to alarm-fatigue

Actions #6

Updated by VANASTASIADIS over 3 years ago

So, to clarify: the code in https://github.com/os-autoinst/os-autoinst-distri-opensuse/pull/6589/files#diff-45750ef51e5e8360e7ef27b51599ba4b396c661c6ef82d1fd76cecd3ebec195cR27 and https://github.com/os-autoinst/os-autoinst-distri-opensuse/pull/6589/files only changes what is being pushed to serial failures. serial failures is then passed to the backend (basetest.pm) via main.pm, and it's used to fail or softfail according to it's type attribute.

In my understanding, what is being asked is to add a third type option - apart from the currently allowed soft and hard - and then in the backend where the actual parsing, regex matching and (soft)failing takes place, account for that third type using record_info to notify users and then proceeding without failing or softfailing.

Is that understanding correct?

Actions #7

Updated by VANASTASIADIS over 3 years ago

  • Assignee set to VANASTASIADIS
Actions #8

Updated by VANASTASIADIS over 3 years ago

  • Status changed from New to In Progress
Actions #9

Updated by okurz over 3 years ago

VANASTASIADIS wrote:

In my understanding, what is being asked is to add a third type option - apart from the currently allowed soft and hard - and then in the backend where the actual parsing, regex matching and (soft)failing takes place, account for that third type using record_info to notify users and then proceeding without failing or softfailing.

Is that understanding correct?

that sounds like a good idea, yes.

Actions #10

Updated by okurz over 3 years ago

Given that within the tools team we have a different focus for now and are exceeding our WIP limit I suggest you unassign again. Can you?

Actions #11

Updated by livdywan over 3 years ago

Apparently there was no comment here following my chat with Bill:

  • There's the implementation in basetest.pm in os-autoinst which Bill would have to tackle from a Tools pov
  • There's the distri part where the desired behavior can be implemented (qe-core)
  • Why do we distinguish hardhard vs. softfail vs. "not a failure, not a bug" the point of that third, new option seems unclear
  • We need a category that only provides more data for debugging even if by default it will not be used/needed
Actions #12

Updated by okurz over 3 years ago

  • Subject changed from [qe-core][functional] Support "record_info" in serial_failure_detection to [tools] Support "record_info" in serial_failure_detection
  • Target version changed from future to Ready

Yes, I understand that now that this is related to os-autoinst. Then it's good. So we will take it over to the SUSE QE Tools team backlog.

Actions #13

Updated by openqa_review over 3 years ago

  • Due date set to 2021-05-21

Setting due date based on mean cycle time of SUSE QE Tools

Actions #14

Updated by VANASTASIADIS over 3 years ago

  • Status changed from In Progress to Resolved
Actions #15

Updated by okurz over 3 years ago

thank you!

Actions

Also available in: Atom PDF