Project

General

Profile

Actions

action #96710

closed

Error `Can't call method "write" on an undefined value` shows up in worker log leading to incompletes

Added by mkittler over 2 years ago. Updated over 2 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
-
Target version:
Start date:
2021-08-10
Due date:
2021-08-31
% Done:

0%

Estimated time:

Description

observation

Since 2021-08-10T05:00 the number of incompletes is increasing (see https://monitor.qa.suse.de/d/nRDab3Jiz/openqa-jobs-test?viewPanel=14&orgId=1&from=1628546400000&to=now). The reason is e.g. Reason: abandoned: associated worker openqaworker3:11 re-connected but abandoned the job. Judging by the worker log there might be a problem with the worker code:

Aug 10 12:01:46 openqaworker3 worker[13786]: [debug] [pid:13786] REST-API call: POST http://openqa.suse.de/api/v1/jobs/6789995/status
Aug 10 12:01:51 openqaworker3 worker[13786]: [debug] [pid:13786] Updating status so job 6789995 is not considered dead.
Aug 10 12:01:51 openqaworker3 worker[13786]: [debug] [pid:13786] REST-API call: POST http://openqa.suse.de/api/v1/jobs/6789995/status
Aug 10 12:01:54 openqaworker3 worker[13786]: [debug] [pid:13786] Stopping job 6789995 from openqa.suse.de: 06789995-sle-15-SP1-Server-DVD-Updates-x86_64-Build20210810-1-qam_kernel_multipath_supportserver@64bit - reason: abort
Aug 10 12:01:54 openqaworker3 worker[13786]: [debug] [pid:13786] REST-API call: POST http://openqa.suse.de/api/v1/jobs/6789995/status
Aug 10 12:01:54 openqaworker3 worker[13786]: [info] [pid:13786] +++ worker notes +++
Aug 10 12:01:54 openqaworker3 worker[13786]: [info] [pid:13786] End time: 2021-08-10 10:01:54
Aug 10 12:01:54 openqaworker3 worker[13786]: [info] [pid:13786] Result: abort
Aug 10 12:01:54 openqaworker3 worker[13786]: [debug] [pid:13786] Job 6789995 stopped as incomplete
Aug 10 12:01:54 openqaworker3 worker[13786]: [warn] [pid:13786] Unable to upload results of the job because no command server URL or worker ID have been set.
Aug 10 12:01:54 openqaworker3 worker[13786]: [debug] [pid:13786] Upload concluded (no current module)
Aug 10 12:01:54 openqaworker3 worker[13786]: [debug] [pid:13786] REST-API call: POST http://openqa.suse.de/api/v1/jobs/6789995/set_done?reason=abort&result=incomplete&worker_id=966
Aug 10 12:01:54 openqaworker3 worker[13786]: [debug] [pid:13786] Job 6789995 from openqa.suse.de finished - reason: abort
Aug 10 12:01:54 openqaworker3 worker[13786]: [debug] [pid:13786] Cleaning up for next job
Aug 10 12:01:54 openqaworker3 worker[13786]: [debug] [pid:13786] Informing openqa.suse.de that we are going offline
Aug 10 12:01:54 openqaworker3 worker[13786]: Mojo::Reactor::Poll: I/O watcher failed: Can't call method "write" on an undefined value at /usr/lib/perl5/vendor_perl/5.26.1/Mojo/UserAgent.pm line 350.
Aug 10 12:01:54 openqaworker3 worker[13786]: [error] [pid:13786] Another error occurred when trying to stop gracefully due to an error. Trying to kill ourself forcefully now.
Aug 10 12:01:54 openqaworker3 systemd[1]: openqa-worker-auto-restart@11.service: Service RestartSec=100ms expired, scheduling restart.
Aug 10 12:24:05 openqaworker3 worker[16563]: [info] [pid:16563] Output of rsync:
Aug 10 12:24:05 openqaworker3 worker[16563]: [info] [#4187] Calling: rsync -avHP --timeout 1800 rsync://openqa.suse.de/tests/ --delete /var/lib/openqa/cache/openqa.suse.de/tests/
Aug 10 12:24:05 openqaworker3 worker[16563]: receiving incremental file list
Aug 10 12:24:05 openqaworker3 worker[16563]: sle/.git/
Aug 10 12:24:05 openqaworker3 worker[16563]: sle/.git/FETCH_HEAD
Aug 10 12:24:05 openqaworker3 worker[16563]: [116B blob data]
Aug 10 12:24:05 openqaworker3 worker[16563]: sle/.git/ORIG_HEAD
Aug 10 12:24:05 openqaworker3 worker[16563]: [116B blob data]
Aug 10 12:24:05 openqaworker3 worker[16563]: sle/.git/index
Aug 10 12:24:05 openqaworker3 worker[16563]: [116B blob data]
Aug 10 12:24:05 openqaworker3 worker[16563]: sle/.git/logs/HEAD
Aug 10 12:24:05 openqaworker3 worker[16563]: [116B blob data]
Aug 10 12:24:05 openqaworker3 worker[16563]: sle/.git/refs/heads/
Aug 10 12:24:05 openqaworker3 worker[16563]: sle/products/opensuse/needles/.git/
Aug 10 12:24:05 openqaworker3 worker[16563]: sle/products/opensuse/needles/.git/FETCH_HEAD
Aug 10 12:24:05 openqaworker3 worker[16563]: [117B blob data]
Aug 10 12:24:05 openqaworker3 worker[16563]: sle/products/opensuse/needles/.git/ORIG_HEAD
Aug 10 12:24:05 openqaworker3 worker[16563]: [117B blob data]
Aug 10 12:24:05 openqaworker3 worker[16563]: sle/products/opensuse/needles/.git/index
Aug 10 12:24:05 openqaworker3 worker[16563]: [117B blob data]
Aug 10 12:24:05 openqaworker3 worker[16563]: sle/products/opensuse/needles/.git/logs/HEAD
Aug 10 12:24:05 openqaworker3 worker[16563]: [117B blob data]
Aug 10 12:24:05 openqaworker3 worker[16563]: sle/products/opensuse/needles/.git/refs/heads/
Aug 10 12:24:05 openqaworker3 worker[16563]: sle/products/sle/needles/.git/
Aug 10 12:24:05 openqaworker3 worker[16563]: sle/products/sle/needles/.git/FETCH_HEAD
Aug 10 12:24:05 openqaworker3 worker[16563]: [117B blob data]
Aug 10 12:24:05 openqaworker3 worker[16563]: sle/products/sle/needles/.git/ORIG_HEAD
Aug 10 12:24:05 openqaworker3 worker[16563]: [118B blob data]
Aug 10 12:24:05 openqaworker3 worker[16563]: sle/products/sle/needles/.git/index
Aug 10 12:24:05 openqaworker3 worker[16563]: [118B blob data]
Aug 10 12:24:05 openqaworker3 worker[16563]: sle/products/sle/needles/.git/logs/HEAD
Aug 10 12:24:05 openqaworker3 worker[16563]: [118B blob data]
Aug 10 12:24:05 openqaworker3 worker[16563]: sle/products/sle/needles/.git/refs/heads/
Aug 10 12:24:05 openqaworker3 worker[16563]: sll/.git/
Aug 10 12:24:05 openqaworker3 worker[16563]: sll/.git/FETCH_HEAD
Aug 10 12:24:05 openqaworker3 worker[16563]: [118B blob data]
Aug 10 12:24:05 openqaworker3 worker[16563]: sll/.git/ORIG_HEAD
Aug 10 12:24:05 openqaworker3 worker[16563]: [118B blob data]
Aug 10 12:24:05 openqaworker3 worker[16563]: sll/.git/index
Aug 10 12:24:05 openqaworker3 worker[16563]: [118B blob data]
Aug 10 12:24:05 openqaworker3 worker[16563]: sll/.git/logs/HEAD
Aug 10 12:24:05 openqaworker3 worker[16563]: [118B blob data]
Aug 10 12:24:05 openqaworker3 worker[16563]: sll/.git/refs/heads/
Aug 10 12:24:05 openqaworker3 worker[16563]: sll/products/sll/needles/.git/
Aug 10 12:24:05 openqaworker3 worker[16563]: sll/products/sll/needles/.git/FETCH_HEAD
Aug 10 12:24:05 openqaworker3 worker[16563]: [118B blob data]
Aug 10 12:24:05 openqaworker3 worker[16563]: sll/products/sll/needles/.git/ORIG_HEAD
Aug 10 12:24:05 openqaworker3 worker[16563]: [118B blob data]
Aug 10 12:24:05 openqaworker3 worker[16563]: sll/products/sll/needles/.git/index
Aug 10 12:24:05 openqaworker3 worker[16563]: [118B blob data]
Aug 10 12:24:05 openqaworker3 worker[16563]: sll/products/sll/needles/.git/logs/HEAD
Aug 10 12:24:05 openqaworker3 worker[16563]: [118B blob data]
Aug 10 12:24:05 openqaworker3 worker[16563]: sll/products/sll/needles/.git/refs/heads/
Aug 10 12:24:05 openqaworker3 worker[16563]: vmdp/.git/
Aug 10 12:24:05 openqaworker3 worker[16563]: vmdp/.git/FETCH_HEAD
Aug 10 12:24:05 openqaworker3 worker[16563]: [118B blob data]
Aug 10 12:24:05 openqaworker3 worker[16563]: vmdp/.git/ORIG_HEAD
Aug 10 12:24:05 openqaworker3 worker[16563]: [118B blob data]
Aug 10 12:24:05 openqaworker3 worker[16563]: vmdp/.git/index
Aug 10 12:24:05 openqaworker3 worker[16563]: [118B blob data]
Aug 10 12:24:05 openqaworker3 worker[16563]: vmdp/.git/logs/HEAD
Aug 10 12:24:05 openqaworker3 worker[16563]: [118B blob data]
Aug 10 12:24:05 openqaworker3 worker[16563]: vmdp/.git/refs/heads/
Aug 10 12:24:05 openqaworker3 worker[16563]: sent 302,978 bytes  received 3,122,744 bytes  9,746.01 bytes/sec
Aug 10 12:24:05 openqaworker3 worker[16563]: total size is 14,122,452,144  speedup is 4,122.47
Aug 10 12:24:05 openqaworker3 worker[16563]: [info] [pid:16563] Finished to rsync tests
Aug 10 12:24:05 openqaworker3 worker[16563]: [debug] [pid:16563] Symlinked from "/var/lib/openqa/cache/openqa.suse.de/tests/sle" to "/var/lib/openqa/pool/11/sle"
Aug 10 12:24:05 openqaworker3 worker[16563]: [info] [pid:16563] Preparing cgroup to start isotovideo
Aug 10 12:24:05 openqaworker3 worker[16563]: [info] [pid:16563] Using cgroup /sys/fs/cgroup/systemd/openqa.slice/openqa-worker.slice/openqa-worker-auto-restart@11.service/6790307
Aug 10 12:24:05 openqaworker3 worker[16563]: [info] [pid:16563] Starting isotovideo container
Aug 10 12:24:05 openqaworker3 worker[16563]: [debug] [pid:16563] Registered process:17981
Aug 10 12:24:05 openqaworker3 worker[16563]: [info] [pid:16563] isotovideo has been started (PID: 17981)
Aug 10 12:24:05 openqaworker3 worker[16563]: [info] [pid:17981] 17981: WORKING 6790307
Aug 10 12:24:05 openqaworker3 worker[16563]: [debug] [pid:17981] +++ worker notes +++
Aug 10 12:24:05 openqaworker3 worker[16563]: [debug] [pid:16563] Running job 6790307 from openqa.suse.de: 06790307-sle-15-SP3-Server-DVD-Updates-x86_64-Build20210810-1-mru-iscsi_server_normal_auth_backstore_hdd@64bit.
Aug 10 12:24:05 openqaworker3 worker[16563]: Mojo::Reactor::Poll: I/O watcher failed: Can't call method "write" on an undefined value at /usr/lib/perl5/vendor_perl/5.26.1/Mojo/UserAgent.pm line 350.
Aug 10 12:24:05 openqaworker3 worker[16563]: [error] [pid:16563] Stopping because a critical error occurred.
Aug 10 12:24:05 openqaworker3 worker[16563]: [error] [pid:16563] Another error occurred when trying to stop gracefully due to an error. Trying to kill ourself forcefully now.
Aug 10 12:24:05 openqaworker3 worker[16563]: [info] [pid:16563] Isotovideo exit status: 0
Aug 10 12:24:05 openqaworker3 worker[16563]: [debug] [pid:16563] Stopping job 6790307 from openqa.suse.de: 06790307-sle-15-SP3-Server-DVD-Updates-x86_64-Build20210810-1-mru-iscsi_server_normal_auth_backstore_hdd@64bit - reason: done
Aug 10 12:24:05 openqaworker3 worker[16563]: [debug] [pid:16563] REST-API call: POST http://openqa.suse.de/api/v1/jobs/6790307/status
Aug 10 12:24:06 openqaworker3 systemd[1]: openqa-worker-auto-restart@11.service: Service RestartSec=100ms expired, scheduling restart.

We're currently also faced with #96557 and #96554 so there might be a relation - although these incompletes look quite different now.


Related issues 3 (0 open3 closed)

Related to openQA Infrastructure - coordination #96447: [epic] Failed systemd services and job age alertsResolvedokurz2021-08-04

Actions
Related to openQA Infrastructure - action #97043: job queue hitting new record 14k jobsResolvedokurz2021-08-17

Actions
Copied to openQA Infrastructure - action #96713: Slow grep in openqa-label-known-issues leads to high CPU usageResolvedokurz2021-09-10

Actions
Actions

Also available in: Atom PDF