action #10460
closed*Huge* gaps in assert_screen
0%
Description
https://openqa.suse.de/tests/190772 is a very suprising failure - waiting for rebootnow it suddenly finds itself in gdm. Looking at the logs, we missed 15 seconds:
no match 1915
MATCH(rebootnow-20131217:0.00)
MATCH(rebootnow-20141204:0.00)
MATCH(rebootnow-20150409:0.00)
MATCH(rebootnow-390x-20150709:0.00)
no match 1914
MATCH(rebootnow-20131217:0.00)
MATCH(rebootnow-20141204:0.00)
MATCH(rebootnow-20150409:0.00)
MATCH(rebootnow-390x-20150709:0.00)
no match 1899
no change 1899
no change 1898
So this is about half an hour before the test was done - which should be around 2am according to the journal:
Jan 28 01:55:14 openqaworker1 worker[13862]: got job 190772: 00190772-sle-12SP2-Staging:D-Server-DVD-x86_64-Build0006-RAID1
Jan 28 01:55:14 openqaworker1 worker[13862]: 28004: WORKING 190772
Jan 28 02:37:55 openqaworker1 worker[13862]: cleaning up 00190772-sle-12SP2-Staging:D-Server-DVD-x86_64-Build0006-RAID1...
There is nothing too unusual in the log and if I count this right, it had roughly 8 jobs running - so what is going on?
But I think we need to mark this gap in the result somehow. We need to consider more than 2 seconds for one assert_screen match a fatal failure. This also applies to the grub needle. If we have too many needles to get done in 2 seconds, we need to clean up the needle pool.
Files