Project

General

Profile

Actions

action #39845

closed

Results of tests with very short duration (~<10s) are not displayed

Added by cfconrad over 6 years ago. Updated about 5 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
Feature requests
Target version:
Start date:
2018-08-16
Due date:
% Done:

100%

Estimated time:

Description

If the execution of the job takes approximately less then 10s the results are not displayed in the openqa web ui.
When enlarge the execution time with "script_run('sleep 8');" results are displayed.

I noticed this only with the ssh backend (https://github.com/os-autoinst/os-autoinst/pull/1012), which is in development.

Failed job: http://10.86.1.52/tests/36


Related issues 1 (0 open1 closed)

Related to openQA Project (public) - action #58826: Result not rendered in detail view on short (e.g. <10s) test-modules, if job is still runningResolvedtinita2019-10-29

Actions
Actions #1

Updated by coolo over 6 years ago

This is most likely because the worker didn't yet see that there is something running at all. Unusual problem :)

Actions #2

Updated by coolo about 6 years ago

  • Subject changed from [tool] Results of tests with very short duration (~<10s) are not displayed to Results of tests with very short duration (~<10s) are not displayed
  • Target version set to Ready
Actions #3

Updated by andriinikitin almost 6 years ago

My non-expert investigation leans to conclusion that it is "by design": - start_time is updated when first message is received from worker. And "update status" messages are coming every 10 seconds as defined here https://github.com/os-autoinst/openQA/blob/64ccc82ec49796560ac09d5efa3fe8105a1655fc/lib/OpenQA/Worker/Common.pm#L66

So the quickest/simplest solution may be to somehow send simple message to the WebService "immediately" after job start.

But I would prefer solution when Worker is sending own explicit start and finish times to WebService. E.g it may do it in first/last message or explicit message packets. WebService may collect own timestamp version of 'first/last' messages received, which may be usable e.g. to understand eventual latency.

Actions #4

Updated by okurz over 5 years ago

  • Category changed from 132 to Feature requests
Actions #5

Updated by tinita over 5 years ago

How can this be reproduced?
How can I create a job that takes less than 10s?

And which part is not displayed in the ui?

Actions #6

Updated by cfconrad over 5 years ago

hi,

create a non sense test like https://github.com/cfconrad/os-autoinst-distri-opensuse/blob/sandbox_clemix/tests/clemix/nop.pm

And trigger just this, you will get a result like: http://cfconrad-vm.qa.suse.de/tests/5993
When waiting these 10 seconds before, you get: http://cfconrad-vm.qa.suse.de/tests/5994

Actions #7

Updated by tinita over 5 years ago

cfconrad wrote:

create a non sense test like https://github.com/cfconrad/os-autoinst-distri-opensuse/blob/sandbox_clemix/tests/clemix/nop.pm

Ah, I see, i thought I always have to run the "boot_to_desktop" test first, that's why my test took longer in total.

I was able to reproduce it now with your test, thanks!

Actions #8

Updated by tinita over 5 years ago

  • Target version changed from Ready to Current Sprint
Actions #9

Updated by tinita over 5 years ago

  • Status changed from New to In Progress
  • Assignee set to tinita
Actions #10

Updated by tinita over 5 years ago

Like Andrii said, the first status call to isotovideo happens too late, so it doesn't get a response anymore.
As discussed with Sebastian, I am working on a replacement of the status call via socket. Instead it is using a status file that survives the end of isotovideo.

Actions #12

Updated by pvorel about 5 years ago

  • Description updated (diff)
Actions #14

Updated by tinita about 5 years ago

Current Status:

I created the PRs which fixed the issue by using a status file.
However, one of the tests (t/33-developer_mode.t) was failing sometimes.
The status call we were using before had some side effect (which seems to be a timing thing only, but not sure).

I have now spent a long time debugging this (and learning to know a lot of our code during this), but the reason is still unclear.
We can't merge my PR until this is fixed.

We don't have a plan yet what to do about it.

Actions #15

Updated by tinita about 5 years ago

The bug I mentioned is very probably fixed (PR https://github.com/os-autoinst/os-autoinst/pull/1230 still in review).
Then I can rebase my PRs for this issue.

Actions #16

Updated by livdywan about 5 years ago

Isn't this actually Low priority? On the other hand gh#os-autoinst/os-autoinst-distri-opensuse#8329 seems to be blocked by it.

Actions #17

Updated by pvorel about 5 years ago

  • Priority changed from Low to Normal

cdywan wrote:

Isn't this actually Low priority? On the other hand gh#os-autoinst/os-autoinst-distri-opensuse#8329 seems to be blocked by it.

Yes, please we're waiting for this to be fixed. BTW we might use gh#os-autoinst/os-autoinst-distri-opensuse#8329 to fix very often broken all LTP on o3 (#51743, https://openqa.opensuse.org/tests/1064280#next_previous).

Actions #18

Updated by tinita about 5 years ago

We're on it. Sorry, it was blocked very long by a bug in os-autoinst that needed to be fixed first.
Second, it introduces a new way of communication between the openQA worker and isotovideo, so both repos were updated and we couldn't merge the second PR before the first was merged.

Second PR is in review and should be merged soon. https://github.com/os-autoinst/openQA/pull/2327

Actions #19

Updated by pvorel about 5 years ago

@tinita: thanks a lot for working on it :)

Actions #21

Updated by tinita about 5 years ago

  • Status changed from In Progress to Feedback
  • % Done changed from 0 to 100
Actions #22

Updated by tinita about 5 years ago

cfconrad it was deployed to https://openqa.opensuse.org/, can you test?

Actions #23

Updated by cfconrad about 5 years ago

hi @tinita, I run it in my own instance with latest openqa installed.
Looks good, nice!
http://cfconrad-vm.qa.suse.de/tests/6136
EDIT
http://cfconrad-vm.qa.suse.de/tests/6141 <= real test run

Regarding your hint, I took a look to the details page during run. And I had the attached intermediate state.
Don't know if this is something which should be covered as well.
http://imagebin.nue.suse.com/2476

Do you need some test run on openqa as well?

Actions #24

Updated by tinita about 5 years ago

@cfconrad That in the intermediate state the short tests don't show is an additional issue. During working on this issue I couldn't figure out why it's happening.
Could you open a new issue for that? Thanks!

Actions #25

Updated by cfconrad about 5 years ago

  • Related to action #58826: Result not rendered in detail view on short (e.g. <10s) test-modules, if job is still running added
Actions #27

Updated by tinita about 5 years ago

  • Status changed from Feedback to Resolved

Thanks!

Actions

Also available in: Atom PDF