Project

General

Profile

Actions

action #37354

closed

[opensuse][functional][u][sporadic][medium] test fails in desktop_runner is unstable

Added by JERiveraMoya almost 6 years ago. Updated almost 6 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
Bugs in existing tests
Target version:
SUSE QA - Milestone 17
Start date:
2018-06-14
Due date:
2018-07-17
% Done:

0%

Estimated time:
Difficulty:

Description

Observation

openQA test in scenario opensuse-Tumbleweed-DVD-x86_64-update_Leap_42.2_kde@64bit fails in
desktop_runner

After typing "true" does not come back to desktop.
Also fails when the box on the top does not dissapear and the needle in the desktop does not match: https://openqa.opensuse.org/tests/690612#step/desktop_runner/4

Reproducible

Fails since (at least) Build 20180608

Expected result

Last good: 20180606 (or more recent)

Acceptance criteria

  • AC1: The test module "desktop_runner" should be made stable, e.g. less than 1/100 fails in "desktop_runner" -> AT1: Run 100 jobs to verify, could be on o3

Further details

Always latest result in this scenario: latest


Related issues 3 (0 open3 closed)

Related to openQA Tests - coordination #35302: [qe-core][opensuse][functional][epic][sporadic] Various unstable tests on o3ResolvedSLindoMansilla2018-04-26

Actions
Related to openQA Tests - action #39011: [functional] Low performance on openqa production serverResolvedokurz2018-08-01

Actions
Blocked by openQA Tests - action #31351: [functional][u][medium] force_cron_run does not actually run any crons (occasionally)Resolvedzluo2018-02-032018-07-03

Actions
Actions #1

Updated by JERiveraMoya almost 6 years ago

  • Related to coordination #35302: [qe-core][opensuse][functional][epic][sporadic] Various unstable tests on o3 added
Actions #2

Updated by JERiveraMoya almost 6 years ago

  • Subject changed from [opensuse][functional][sporadic] test fails in desktop_runner is unstable to [opensuse][functional][u][sporadic] test fails in desktop_runner is unstable
Actions #3

Updated by JERiveraMoya almost 6 years ago

  • Description updated (diff)
Actions #4

Updated by okurz almost 6 years ago

  • Due date set to 2018-07-03
  • Status changed from New to Workable
  • Target version set to Milestone 17
Actions #5

Updated by okurz almost 6 years ago

  • Target version changed from Milestone 17 to Milestone 17
Actions #6

Updated by riafarov almost 6 years ago

  • Subject changed from [opensuse][functional][u][sporadic] test fails in desktop_runner is unstable to [opensuse][functional][u][sporadic][medium] test fails in desktop_runner is unstable
Actions #7

Updated by zluo almost 6 years ago

  • Status changed from Workable to In Progress
  • Assignee set to zluo

take over

Actions #8

Updated by zluo almost 6 years ago

after investigation and run more than 40 test runs on normal remoter workers and remote workers (loewe), I can see that this issue is related to low performance on server.

So need to workaround issue with send_key 'ret' in sub x11_start_program of susedistribution.pm.

PR:

https://github.com/os-autoinst/os-autoinst-distri-opensuse/pull/5275

Actions #9

Updated by zluo almost 6 years ago

  • Assignee deleted (zluo)

I don't know a better way to workaround issue related on openQA production server. The statistic shows only problem with low performance on server.

Actions #10

Updated by okurz almost 6 years ago

I am sorry but repeatedly hitting the same key does not sound like a good solution. If the problem can only be reproduced in production for now then I recommend to gather good statistics on production. https://progress.opensuse.org/projects/openqatests/wiki/Wiki#Statistical-investigation mentions one approach for that. We had pretty stable tests for a while so either the product degraded in stability or we have unstable openQA workers. Both needs to be fixed not in os-autoinst-distri-opensuse. I do not consider it a long-term scalable approach to provide quirky workarounds in tests.

Actions #11

Updated by okurz almost 6 years ago

  • Status changed from In Progress to Workable
Actions #12

Updated by SLindoMansilla almost 6 years ago

  • Status changed from Workable to In Progress
  • Assignee set to SLindoMansilla
Actions #14

Updated by SLindoMansilla almost 6 years ago

Found btw a sync problem on send_key_until_needle_match: https://github.com/os-autoinst/os-autoinst/pull/982

Actions #15

Updated by SLindoMansilla almost 6 years ago

The last round of statistical investigation shows that problem happens 45 times of 100 jobs.

Actions #16

Updated by okurz almost 6 years ago

Ok, with this analysis as written in #35685#note-32 I guess it is better we work on #31351 first. It would be best you pair up with zluo on this.

Actions #17

Updated by SLindoMansilla almost 6 years ago

  • Blocked by action #31351: [functional][u][medium] force_cron_run does not actually run any crons (occasionally) added
Actions #18

Updated by SLindoMansilla almost 6 years ago

  • Status changed from In Progress to Blocked
  • Assignee deleted (SLindoMansilla)

Let's avoid people work on this until this issue get solved: force_cron_run does not actually run any crons https://progress.opensuse.org/issues/31351

Actions #19

Updated by SLindoMansilla almost 6 years ago

The PR I was working on before being blocked: https://github.com/os-autoinst/os-autoinst-distri-opensuse/pull/5303

Actions #20

Updated by okurz almost 6 years ago

  • Assignee set to okurz

@SLindoMansilla please don't keep unassigned blocked tickets. It's ok if you want to assign to me.

Actions #21

Updated by mgriessmeier almost 6 years ago

  • Due date changed from 2018-07-03 to 2018-07-17
Actions #22

Updated by okurz almost 6 years ago

Same as #35685, let's see about stability after #31351 was resolved now.

Actions #23

Updated by okurz almost 6 years ago

  • Description updated (diff)
Actions #24

Updated by okurz almost 6 years ago

  • Status changed from Blocked to Resolved

Seems we covered this now properly with #35685 -> desktop_runner stable

Actions #25

Updated by okurz over 5 years ago

  • Related to action #39011: [functional] Low performance on openqa production server added
Actions

Also available in: Atom PDF