Project

General

Profile

Actions

action #73273

open

coordination #102906: [saga][epic] Increased stability of tests with less "known failures", known incompletes handled automatically within openQA

coordination #102909: [epic] Prevent more incompletes already within os-autoinst or openQA

job incompletes with auto_review:"setup failure: Cache service status error from API.* file is not a database .*":retry

Added by Xiaojing_liu about 4 years ago. Updated about 3 years ago.

Status:
Workable
Priority:
Low
Assignee:
-
Category:
Feature requests
Target version:
QA (public, currently private due to #173521) - future
Start date:
2020-10-13
Due date:
% Done:

0%

Estimated time:

Description

Observation

job https://openqa.suse.de/tests/4812517 is incomplete, the log shows:

[2020-10-13T04:14:16.0500 CEST] [debug] [pid:36910] Updating status so job 4812517 is not considered dead.
[2020-10-13T04:14:16.0501 CEST] [debug] [pid:36910] REST-API call: POST http://openqa.suse.de/api/v1/jobs/4812517/status
[2020-10-13T04:14:21.0568 CEST] [debug] [pid:36910] Updating status so job 4812517 is not considered dead.
[2020-10-13T04:14:21.0569 CEST] [debug] [pid:36910] REST-API call: POST http://openqa.suse.de/api/v1/jobs/4812517/status
[2020-10-13T04:14:21.0630 CEST] [error] [pid:36910] Unable to setup job 4812517: Cache service status error from API: Minion job #43206 failed: DBD::SQLite::db do failed: file is not a database at /usr/lib/perl5/vendor_perl/5.26.1/Mojo/SQLite.pm line 85.

please see more details in https://openqa.suse.de/tests/4812517/file/worker-log.txt

Acceptance criteria

  • AC1: The issue is automatically handled or the test maintainer is informed what he or she did wrong

Suggestions

  • Try to handle this problem with an automatic recovery within openQA if this is not a problem introduced by test maintainers (unlikely)

Workaround

Delete cache database on the affected worker and retry


Related issues 1 (0 open1 closed)

Related to openQA Project (public) - action #67000: Job incompletes due to malformed worker cache database disk image with auto_review:"Cache service status error.*(database disk image is malformed|Specified job ID is invalid).*":retryResolvedmkittler2020-05-18

Actions
Actions #1

Updated by okurz about 4 years ago

  • Description updated (diff)
  • Category set to Feature requests
  • Status changed from New to Workable
  • Priority changed from Normal to Low
  • Target version set to Ready
Actions #2

Updated by Xiaojing_liu about 4 years ago

  • Subject changed from job incompletes with auto_review:"setup failure: Cache service status error from API: Minion job #43206 failed: DBD::SQLite::db do failed: file is not a database at /usr/lib/perl5/vendor_perl/5.26.1/Mojo/SQLite.pm line 85." to job incompletes with auto_review:"setup failure: Cache service status error from API: Minion job.* failed: DBD::SQLite::db do failed: file is not a database .*"
Actions #3

Updated by Xiaojing_liu about 4 years ago

  • Subject changed from job incompletes with auto_review:"setup failure: Cache service status error from API: Minion job.* failed: DBD::SQLite::db do failed: file is not a database .*" to job incompletes with auto_review:"setup failure: Cache service status error from API.* file is not a database .*"
Actions #4

Updated by okurz about 4 years ago

  • Related to action #67000: Job incompletes due to malformed worker cache database disk image with auto_review:"Cache service status error.*(database disk image is malformed|Specified job ID is invalid).*":retry added
Actions #5

Updated by okurz about 4 years ago

  • Subject changed from job incompletes with auto_review:"setup failure: Cache service status error from API.* file is not a database .*" to job incompletes with auto_review:"setup failure: Cache service status error from API.* file is not a database .*":retry
  • Description updated (diff)
  • Target version changed from Ready to future
Actions #6

Updated by okurz about 4 years ago

  • Parent task set to #62420
Actions #7

Updated by okurz about 4 years ago

The underlying problem was introduced by #67000 which we consider solved. We should still have this feature request in mind to improve the situation if the worker cache fails in a similar condition, e.g. automatically retrigger or "un-assign" the job from the worker before it incompletes

Actions #8

Updated by okurz about 3 years ago

  • Parent task changed from #62420 to #102909
Actions

Also available in: Atom PDF