Project

General

Profile

Actions

action #95848

closed

[sporadic] CircleCI: openQA t/05-scheduler-full.t flaky size:M

Added by tinita almost 3 years ago. Updated almost 3 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
Regressions/Crashes
Target version:
Start date:
2021-07-22
Due date:
% Done:

0%

Estimated time:

Description

Problem

t/05-scheduler-full.t fails more often than it used too (it is in "unstable" already)

https://app.circleci.com/pipelines/github/os-autoinst/openQA/7142/workflows/7a75593c-a719-4da8-934e-4c4433f67dc2/jobs/67405/steps

[09:54:43] t/05-scheduler-full.t ..................... 2/? 
    #   Failed test 'running job set to done if its worker re-connects claiming not to work on it anymore'
    #   at t/05-scheduler-full.t line 220.
    #          got: 'running'
    #     expected: 'done'

    #   Failed test 'running job incompleted if its worker re-connects claiming not to work on it anymore'
    #   at t/05-scheduler-full.t line 222.
    #          got: 'none'
    #     expected: 'incomplete'

    #   Failed test 'reason is set'
    #   at t/05-scheduler-full.t line 224.
    #                   undef
    #     doesn't match '(?^:abandoned: associated worker .+:\d+ re-connected but abandoned the job)'
    # Looks like you failed 3 tests of 12.
[09:54:43] t/05-scheduler-full.t ..................... 3/? 
#   Failed test 're-scheduling and incompletion of jobs when worker rejects jobs or goes offline'
#   at t/05-scheduler-full.t line 228.
[09:54:43] t/05-scheduler-full.t ..................... 4/? [09:54:43] t/05-scheduler-full.t ..................... 5/? # Looks like you failed 1 test of 5.
                                                            [09:54:43] t/05-scheduler-full.t ..................... Dubious, test returned 1 (wstat 256, 0x100)
Failed 1/5 subtests 

https://app.circleci.com/pipelines/github/os-autoinst/openQA/7147/workflows/fe33092f-ebf0-4224-a1a9-234babeff802/jobs/67468/steps

[14:04:04] t/05-scheduler-full.t ..................... 1/? 
    #   Failed test 'no more jobs need to be allocated'
    #   at t/05-scheduler-full.t line 133.
    #          got: '1'
    #     expected: '0'
    # Looks like you failed 1 test of 4.
[14:04:04] t/05-scheduler-full.t ..................... 2/? 
#   Failed test 'Scheduler worker job allocation'
#   at t/05-scheduler-full.t line 137.

Acceptance Criteria

AC 1: t/05-scheduler-full.t is stable for 10 runs after apply the solution

Suggestions

  • Just an observation: It became more unstable when many of our tests on CircleCI became slower and unstable

Related issues 1 (0 open1 closed)

Related to openQA Project - action #95851: [sporadic] CircleCI: openQA t/43-scheduling-and-worker-scalability.t flaky size:MResolvedmkittler2021-07-222021-08-07

Actions
Actions

Also available in: Atom PDF