action #75220
closedall jobs run on openqaworker8 incomplete: "Cache service status error from API: Minion job .*failed: .*(database disk image is malformed|not a database)":retry
0%
Description
again many incompletes on openqaworker8 due to malformed sqlite database. But right now it seems like many jobs are actually running fine but I think no one changed anything. I just triggered an explicit pipeline for "auto-review". At the time of writing it is still running.
Updated by okurz about 4 years ago
- Copied from action #73342: all jobs run on openqaworker8 incomplete: "Cache service status error from API: Minion job .*failed: .*(database disk image is malformed|not a database)":retry added
Updated by okurz about 4 years ago
All incompletes were labeled with #67000 and retriggered and auto-review passed this step but then more severe problems have piled up and I have not seen the alert about "minion jobs" on that worker in before: https://stats.openqa-monitor.qa.suse.de/d/WDopenqaworker8/worker-dashboard-openqaworker8?from=1603455577704&to=1603521237406&fullscreen&panelId=65104
I will just reboot the machine and see what happens.
EDIT: Machine is back up but suffering from missing or borked network connection same as in #75016
Updated by okurz about 4 years ago
- Status changed from In Progress to Resolved
At least the problem of cache file was resolved with rebooting which reformated the complete NVMe based pool+cache partition
Updated by okurz about 4 years ago
- Related to action #78058: [Alerting] Incomplete jobs of last 24h alert - again many incompletes due to corrupted cache, on openqaworker8 added