Project

General

Profile

Actions

action #105855

closed

[easy][beginner] openqa-worker@.service should handle NTP unavailability gracefully

Added by dheidler almost 3 years ago. Updated over 2 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
Feature requests
Target version:
Start date:
2022-02-02
Due date:
2022-07-22
% Done:

0%

Estimated time:

Description

Observation

-- Logs begin at Thu 2022-01-06 12:00:01 UTC, end at Wed 2022-02-02 16:18:15 UTC. --
Jan 06 12:00:37 localhost systemd[1]: Starting openQA Worker #4...
Jan 06 12:00:37 localhost systemd[1]: Started openQA Worker #4.
Jan 06 12:00:40 localhost worker[1306]: [info] worker 4:
Jan 06 12:00:40 localhost worker[1306]:  - config file:           /etc/openqa/workers.ini
Jan 06 12:00:40 localhost worker[1306]:  - worker hostname:       localhost
Jan 06 12:00:40 localhost worker[1306]:  - isotovideo version:    24
Jan 06 12:00:40 localhost worker[1306]:  - websocket API version: 1
Jan 06 12:00:40 localhost worker[1306]:  - web UI hosts:          http://10.131.0.32
Jan 06 12:00:40 localhost worker[1306]:  - class:                 generalhw_RPi4
Jan 06 12:00:40 localhost worker[1306]:  - no cleanup:            no
Jan 06 12:00:40 localhost worker[1306]:  - pool directory:        /var/lib/openqa/pool/4
Jan 06 12:00:40 localhost worker[1306]: [info] Project dir for host http://10.131.0.32 is /var/lib/openqa/share
Jan 06 12:00:40 localhost worker[1306]: [info] Registering with openQA http://10.131.0.32
Jan 06 12:00:40 localhost worker[1306]: [error] Failed to register at http://10.131.0.32 - 403 response: timestamp mismatch - ignoring server
Jan 06 12:00:40 localhost worker[1306]: [error] Stopping because registration with all configured web UI hosts failed
Jan 06 12:00:40 localhost systemd[1]: openqa-worker@4.service: Succeeded.

This worker is running on a raspberry pi without a real time clock.
The output is from today (Feb 02) and chronyd is enabled on that machine.
Still the worker doesn't come up because the worker is started before NTP sync is done.

Acceptance criteria

  • AC1: openQA worker does not fail with an error and does not report an error when a time source is not (yet) available

Suggestions

  • Make the code itself resilient to time not being available yet, e.g. just wait until it becomes available

Further details

entrance level issue


Files

g.svg (906 KB) g.svg dheidler, 2022-07-07 09:08

Related issues 2 (1 open1 closed)

Related to openQA Infrastructure (public) - action #111926: osd-deployment pipeline failed: test 481 -le 0, due to job age alert, likely just the raspberry pi based tests stuck in scheduleResolvednicksinger2022-06-01

Actions
Copied to openQA Project (public) - action #113240: openqa-worker@.service returns 0 even when it fails to startNew2022-02-02

Actions
Actions

Also available in: Atom PDF