Project

General

Profile

Actions

action #166415

open

[alert] diesel could not reach qa-jump.qe.nue2.suse.org causing ping alert to fire

Added by mkittler 4 months ago. Updated 4 months ago.

Status:
New
Priority:
Normal
Assignee:
-
Category:
-
Target version:
Start date:
2024-09-05
Due date:
% Done:

0%

Estimated time:

Description

Related panel/timeframe: https://stats.openqa-monitor.qa.suse.de/d/EML0bpuGk/monitoring?orgId=1&from=1725467338522&to=1725527966118

In that timeframe there are ethernet/ipv6 related messages in the logs:

Sep 04 16:50:58 diesel systemd[1]: systemd-timedated.service: Deactivated successfully.
Sep 04 16:51:32 diesel worker[20565]: [warn] [pid:20565] The average load 10.91 is exceeding the configured threshold of 10. - checking again for web UI 'openqa.suse.de' in 164.03 s
Sep 04 16:51:33 diesel worker[21866]: [debug] [pid:21866] Refusing to grab job from openqa.suse.de: The average load 12.62 is exceeding the configured threshold of 10.
Sep 04 16:51:34 diesel worker[20567]: [debug] [pid:20567] Refusing to grab job from openqa.suse.de: The average load 11.75 is exceeding the configured threshold of 10.
Sep 04 16:51:34 diesel worker[21841]: [warn] [pid:21841] The average load 10.91 is exceeding the configured threshold of 10. - checking again for web UI 'openqa.suse.de' in 65.13 s
Sep 04 16:51:46 diesel worker[21866]: [warn] [pid:21866] The average load 10.73 is exceeding the configured threshold of 10. - checking again for web UI 'openqa.suse.de' in 238.11 s
Sep 04 16:52:16 diesel worker[20566]: [warn] [pid:20566] The average load 10.39 is exceeding the configured threshold of 10. - checking again for web UI 'openqa.suse.de' in 174.86 s
Sep 04 16:52:24 diesel worker[20567]: [warn] [pid:20567] The average load 10.34 is exceeding the configured threshold of 10. - checking again for web UI 'openqa.suse.de' in 192.10 s
Sep 04 16:52:39 diesel worker[21841]: [warn] [pid:21841] The average load 10.17 is exceeding the configured threshold of 10. - checking again for web UI 'openqa.suse.de' in 259.98 s
Sep 04 16:52:43 diesel wickedd[2313]: eth3: updated address 2a07:de40:a102:5:9abe:94ff:fe04:4817/64 in ipv6:auto lease (owner auto)
Sep 04 16:52:43 diesel wickedd[2313]: eth3: updated address 2a07:de40:a102:5:9abe:94ff:fe04:4817/64 in ipv6:auto lease (owner auto)
Sep 04 16:52:44 diesel wickedd[2313]: eth3: verifying adressses for ipv6:auto lease in state applying: success [after 0m0.000s]
Sep 04 16:52:44 diesel wickedd[2313]: Preparing xml lease data for '/var/lib/wicked/lease-eth3-auto-ipv6.xml'
Sep 04 16:52:44 diesel wickedd[2313]: Writing lease to temporary file for '/var/lib/wicked/lease-eth3-auto-ipv6.xml'
Sep 04 16:52:44 diesel wickedd[2313]: Lease written to file '/var/lib/wicked/lease-eth3-auto-ipv6.xml'
Sep 04 16:52:44 diesel wickedd[2313]: eth3: writing lease file for ipv6:auto lease in state granted: success [after 0m0.000s]
Sep 04 16:52:44 diesel wickedd[2313]: eth3: created netconfig batch file to update lease ipv6:auto in state granted
Sep 04 16:52:44 diesel wickedd[2313]: netconfig batch add: modify -i eth3 -s wicked-auto-ipv6 -I /run/wicked/leaseinfo.eth3.auto.ipv6
Sep 04 16:52:44 diesel wickedd[2313]: netconfig batch add: update
Sep 04 16:52:44 diesel wickedd[2313]: eth3: applying system config for ipv6:auto lease in state granted: deferred [since 0m0.000s]
Sep 04 16:52:44 diesel wickedd[2313]: ni_process_reap: reaping child process 36809 (/etc/wicked/extensions/netconfig batch)
Sep 04 16:52:44 diesel wickedd[2313]: subprocess 36809 (/etc/wicked/extensions/netconfig batch) exited with status 0 [0m0.159s]
Sep 04 16:52:44 diesel wickedd[2313]: handle_other_event(generic-updated)
Sep 04 16:52:44 diesel wickedd[2313]: sending event "genericUpdated"
Sep 04 16:52:44 diesel wickedd[2313]: finished finished install job[3465](2) on device eth3[5] for lease ipv6:auto state granted
Sep 04 16:52:44 diesel wickedd[2313]: eth3: applying system config for ipv6:auto lease in state granted: success [after 0m0.165s]
Sep 04 16:52:44 diesel wickedd[2313]: eth3: updater for lease ipv6:auto in state granted finished: success [0m0.700s]
Sep 04 16:52:44 diesel wickedd[2313]: eth3: ipv6:auto lease updated (state granted), sending addressAcquired event
Sep 04 16:52:44 diesel wickedd[2313]: sending device event "addressAcquired" for /org/opensuse/Network/Interface/5; uuid=<088bc666-6e14-0b00-0909-000027000000>
Sep 04 16:52:44 diesel wickedd-nanny[2328]: process event signal addressAcquired from /org/opensuse/Network/Interface/5; uuid=<088bc666-6e14-0b00-0909-000027000000>
Sep 04 16:52:44 diesel wickedd-nanny[2328]: eth3: processed event addressAcquired; state=starting, policy=policy__eth3
Sep 04 16:52:44 diesel wickedd-nanny[2328]: eth3: waiting for more addressAcquired events...
Sep 04 16:52:44 diesel wickedd-nanny[2328]: eth3: waiting for callbacks:
Sep 04 16:52:44 diesel wickedd-nanny[2328]:         088bc666-6e14-0b00-0909-000026000000 event=addressAcquired
Sep 04 16:52:44 diesel wickedd-nanny[2328]: eth3: state=lldp-up want=network-up, wait-for=addrconf-up
Sep 04 16:52:44 diesel wickedd-nanny[2328]: waiting for 1 devices to become ready (1 explicitly requested)
Sep 04 16:52:44 diesel wickedd-nanny[2328]: ni_nanny_recheck(erspan0[0], unmanaged)
Sep 04 16:52:44 diesel wickedd-nanny[2328]: erspan0: no applicable policies
Sep 04 16:52:49 diesel worker[22287]: [warn] [pid:22287] The average load 10.06 is exceeding the configured threshold of 10. - checking again for web UI 'openqa.suse.de' in 109.89 s
Sep 04 16:54:33 diesel worker[21963]: [debug] [pid:21963] Accepting job 15340647 from openqa.suse.de.
Sep 04 16:54:33 diesel worker[21963]: [debug] [pid:21963] Setting job 15340647 from openqa.suse.de up
Sep 04 16:54:33 diesel worker[21963]: [debug] [pid:21963] Preparing Mojo::IOLoop::ReadWriteProcess::Session
Sep 04 16:54:33 diesel worker[21963]: [info] [pid:21963] +++ setup notes +++

Considering other logs the machine is under heavy load at the time but also not generally showing connectivity problems (e.g. the worker has no connection problems). In other cases this also happens without high load and while the worker is uploading results without problems:

Sep 04 17:16:55 diesel worker[20565]: [debug] [pid:20565] Upload concluded (at bci_test_docker)
Sep 04 17:16:56 diesel worker[21866]: [debug] [pid:21866] REST-API call: POST http://openqa.suse.de/api/v1/jobs/15340753/status
Sep 04 17:16:56 diesel worker[21866]: [debug] [pid:21866] Upload concluded (at boot_to_desktop)
Sep 04 17:16:57 diesel worker[20566]: [debug] [pid:20566] Updating status so job 15340825 is not considered dead.
Sep 04 17:16:57 diesel worker[20566]: [debug] [pid:20566] REST-API call: POST http://openqa.suse.de/api/v1/jobs/15340825/status
Sep 04 17:16:57 diesel worker[20567]: [debug] [pid:20567] Updating status so job 15340815 is not considered dead.
Sep 04 17:16:57 diesel worker[20567]: [debug] [pid:20567] REST-API call: POST http://openqa.suse.de/api/v1/jobs/15340815/status
Sep 04 17:16:57 diesel worker[21841]: [debug] [pid:21841] Updating status so job 15340823 is not considered dead.
Sep 04 17:16:57 diesel worker[21841]: [debug] [pid:21841] REST-API call: POST http://openqa.suse.de/api/v1/jobs/15340823/status
Sep 04 17:16:59 diesel wickedd[2313]: eth3: updated address 2a07:de40:a102:5:9abe:94ff:fe04:4817/64 in ipv6:auto lease (owner auto)
Sep 04 17:16:59 diesel wickedd[2313]: eth3: updated address 2a07:de40:a102:5:9abe:94ff:fe04:4817/64 in ipv6:auto lease (owner auto)
Sep 04 17:17:00 diesel wickedd[2313]: eth3: verifying adressses for ipv6:auto lease in state applying: success [after 0m0.000s]
Sep 04 17:17:00 diesel wickedd[2313]: Preparing xml lease data for '/var/lib/wicked/lease-eth3-auto-ipv6.xml'
Sep 04 17:17:00 diesel wickedd[2313]: Writing lease to temporary file for '/var/lib/wicked/lease-eth3-auto-ipv6.xml'
Sep 04 17:17:00 diesel wickedd[2313]: Lease written to file '/var/lib/wicked/lease-eth3-auto-ipv6.xml'
Sep 04 17:17:00 diesel wickedd[2313]: eth3: writing lease file for ipv6:auto lease in state granted: success [after 0m0.000s]
Sep 04 17:17:00 diesel wickedd[2313]: eth3: created netconfig batch file to update lease ipv6:auto in state granted
Sep 04 17:17:00 diesel wickedd[2313]: netconfig batch add: modify -i eth3 -s wicked-auto-ipv6 -I /run/wicked/leaseinfo.eth3.auto.ipv6
Sep 04 17:17:00 diesel wickedd[2313]: netconfig batch add: update
Sep 04 17:17:00 diesel wickedd[2313]: eth3: applying system config for ipv6:auto lease in state granted: deferred [since 0m0.000s]
Sep 04 17:17:00 diesel wickedd[2313]: ni_process_reap: reaping child process 40747 (/etc/wicked/extensions/netconfig batch)
Sep 04 17:17:00 diesel wickedd[2313]: subprocess 40747 (/etc/wicked/extensions/netconfig batch) exited with status 0 [0m0.268s]
Sep 04 17:17:00 diesel wickedd[2313]: handle_other_event(generic-updated)
Sep 04 17:17:00 diesel wickedd[2313]: sending event "genericUpdated"
Sep 04 17:17:00 diesel wickedd[2313]: finished finished install job[3468](2) on device eth3[5] for lease ipv6:auto state granted
Sep 04 17:17:00 diesel wickedd[2313]: eth3: applying system config for ipv6:auto lease in state granted: success [after 0m0.297s]
Sep 04 17:17:00 diesel wickedd[2313]: eth3: updater for lease ipv6:auto in state granted finished: success [0m0.855s]
Sep 04 17:17:00 diesel wickedd[2313]: eth3: ipv6:auto lease updated (state granted), sending addressAcquired event
Sep 04 17:17:00 diesel wickedd[2313]: sending device event "addressAcquired" for /org/opensuse/Network/Interface/5; uuid=<088bc666-6e14-0b00-0909-000027000000>
Sep 04 17:17:00 diesel wickedd-nanny[2328]: process event signal addressAcquired from /org/opensuse/Network/Interface/5; uuid=<088bc666-6e14-0b00-0909-000027000000>
Sep 04 17:17:00 diesel wickedd-nanny[2328]: eth3: processed event addressAcquired; state=starting, policy=policy__eth3
Sep 04 17:17:00 diesel wickedd-nanny[2328]: eth3: waiting for more addressAcquired events...
Sep 04 17:17:00 diesel wickedd-nanny[2328]: eth3: waiting for callbacks:
Sep 04 17:17:00 diesel wickedd-nanny[2328]:         088bc666-6e14-0b00-0909-000026000000 event=addressAcquired
Sep 04 17:17:00 diesel wickedd-nanny[2328]: eth3: state=lldp-up want=network-up, wait-for=addrconf-up
Sep 04 17:17:00 diesel wickedd-nanny[2328]: waiting for 1 devices to become ready (1 explicitly requested)
Sep 04 17:17:00 diesel wickedd-nanny[2328]: ni_nanny_recheck(erspan0[0], unmanaged)
Sep 04 17:17:00 diesel wickedd-nanny[2328]: erspan0: no applicable policies
Sep 04 17:17:01 diesel worker[21963]: [debug] [pid:21963] REST-API call: POST http://openqa.suse.de/api/v1/jobs/15340818/status
Sep 04 17:17:01 diesel worker[22287]: [debug] [pid:22287] REST-API call: POST http://openqa.suse.de/api/v1/jobs/15340816/status
Sep 04 17:17:01 diesel worker[21963]: [debug] [pid:21963] Upload concluded (at boot_to_desktop)
Sep 04 17:17:01 diesel worker[40959]: [debug] [pid:40959] Optimizing /var/lib/openqa/pool/5/testresults/boot_to_desktop-4.png
Sep 04 17:17:02 diesel worker[40959]: [debug] [pid:40959] Uploading artefact boot_to_desktop-4.png as 86460e48e05a1d2af0ab5f99542afd59
Sep 04 17:17:02 diesel worker[40959]: [debug] [pid:40959] Optimizing /var/lib/openqa/pool/5/testresults/.thumbs/boot_to_desktop-4.png
Sep 04 17:17:02 diesel worker[40959]: [debug] [pid:40959] Uploading artefact boot_to_desktop-4.png as 86460e48e05a1d2af0ab5f99542afd59
Sep 04 17:17:02 diesel worker[22287]: [debug] [pid:22287] Upload concluded (up to boot_to_desktop)
Sep 04 17:17:05 diesel worker[20565]: [debug] [pid:20565] REST-API call: POST http://openqa.suse.de/api/v1/jobs/15340811/status

Otherwise I couldn't find anything related in the worker logs.

Actions #1

Updated by livdywan 4 months ago

  • Tags changed from alert, reactive work to alert, reactive work, infra
  • Target version set to future

I would have expected an active alert or a silence for this. Maybe sporadic.

Actions

Also available in: Atom PDF