Project

General

Profile

Actions

action #69787

closed

[qe-core][qam][sporadic] test fails in rsync_client not waiting for the server long enough to sync on the barrier, auto_review:"(?s)cause_of_death.*barrier.*rsync_setup.*lock owner already finished.*Test died.*mydie.*lockapi"

Added by okurz over 3 years ago. Updated over 3 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
-
Category:
Bugs in existing tests
Target version:
-
Start date:
Due date:
% Done:

0%

Estimated time:
Difficulty:

Description

Observation

openQA test in scenario sle-15-Server-DVD-Updates-x86_64-qam-rsync-client@64bit fails in
rsync_client
with

[2020-08-10T16:49:17.503 CEST] [debug] barrier wait 'rsync_setup'
[2020-08-10T16:49:17.503 CEST] [debug] tests/console/rsync_client.pm:28 called lockapi::barrier_wait
[2020-08-10T16:49:17.503 CEST] [debug] <<< testapi::record_info(title="Paused", output="Wait for rsync_setup (on parent job)", result="ok")
[2020-08-10T16:49:17.584 CEST] [info] ::: lockapi::_try_lock: Retry 1 of 7...
[2020-08-10T16:49:27.662 CEST] [info] ::: lockapi::_try_lock: Retry 2 of 7...
[2020-08-10T16:49:37.736 CEST] [info] ::: lockapi::_try_lock: Retry 3 of 7...
[2020-08-10T16:49:47.810 CEST] [info] ::: lockapi::_try_lock: Retry 4 of 7...
[2020-08-10T16:49:57.876 CEST] [info] ::: lockapi::_try_lock: Retry 5 of 7...
[2020-08-10T16:50:07.944 CEST] [info] ::: lockapi::_try_lock: Retry 6 of 7...
[2020-08-10T16:50:18.008 CEST] [info] ::: lockapi::_try_lock: Retry 7 of 7...
[2020-08-10T16:50:28.008 CEST] [debug] tests/console/rsync_client.pm:28 called lockapi::barrier_wait
[2020-08-10T16:50:28.009 CEST] [debug] <<< bmwqemu::mydie(cause_of_death="barrier 'rsync_setup': lock owner already finished")
[2020-08-10T16:50:28.089 CEST] [info] ::: basetest::runtest: # Test died: mydie at /usr/lib/os-autoinst/lockapi.pm line 41.

whereas the corresponding server code is

[2020-08-10T16:54:21.049 CEST] [debug] barrier wait 'rsync_setup'
[2020-08-10T16:54:21.049 CEST] [debug] tests/console/rsync_server.pm:67 called lockapi::barrier_wait
[2020-08-10T16:54:21.049 CEST] [debug] <<< testapi::record_info(title="Paused", output="Wait for rsync_setup (on parent job)", result="ok")
[2020-08-10T16:54:21.091 CEST] [debug] barrier 'rsync_setup' not released, sleeping 5s
[2020-08-10T16:54:26.121 CEST] [debug] barrier 'rsync_setup' not released, sleeping 5s
…
[2020-08-10T16:59:32.771 CEST] [debug] barrier 'rsync_setup' not released, sleeping 5s
[2020-08-10T16:59:36.015 CEST] [debug] backend got TERM
[2020-08-10T16:59:36.015 CEST] [debug] autotest received signal TERM, saving results of current test before exiting

so the client already gave up waiting after a minute whereas the server has not even reached this point.

Reproducible

Fails often but not always. @dzedro tends to mark according issues with #65118 . All accordingly labeled tests can be investigated for this issue.

Expected result

A good case: 20200810-1

The test should be robust to cover the corresponding needed synchronisation period from both client and server side

Further details

Always latest result in this scenario: latest


Related issues 1 (0 open1 closed)

Related to openQA Tests - action #80570: [qe-core][qem][sporadic] test fails in smt_client1 auto_review:"mydie.*acquiring barrier 'smt_setup': lock owner already finished":retryRejectedzluo

Actions
Actions #1

Updated by tjyrinki_suse over 3 years ago

  • Status changed from New to Workable
  • Assignee deleted (klorenzova)
  • Start date deleted (2020-08-10)

Need to find another assignee.

Actions #2

Updated by tjyrinki_suse over 3 years ago

  • Subject changed from [qam][sporadic] test fails in rsync_client not waiting for the server long enough to sync on the barrier, auto_review:"(?s)cause_of_death.*barrier.*rsync_setup.*lock owner already finished.*Test died.*mydie.*lockapi" to [qe-core][qam][sporadic] test fails in rsync_client not waiting for the server long enough to sync on the barrier, auto_review:"(?s)cause_of_death.*barrier.*rsync_setup.*lock owner already finished.*Test died.*mydie.*lockapi"
Actions #3

Updated by tjyrinki_suse over 3 years ago

  • Status changed from Workable to Resolved

All 21 runs in the last 3 weeks have passed, so looks like fixed now.

Actions #4

Updated by okurz over 3 years ago

  • Related to action #80570: [qe-core][qem][sporadic] test fails in smt_client1 auto_review:"mydie.*acquiring barrier 'smt_setup': lock owner already finished":retry added
Actions

Also available in: Atom PDF