Project

General

Profile

action #90635

NTP alerts coinciding with reboots of ppc64le host

Added by okurz 4 months ago. Updated 4 months ago.

Status:
Resolved
Priority:
Normal
Assignee:
Target version:
Start date:
2021-04-02
Due date:
% Done:

0%

Estimated time:

Description

Observation

Multiple alerts for qa-power8-5-kvm as visible on
https://stats.openqa-monitor.qa.suse.de/d/WDQA-Power8-5-kvm/worker-dashboard-qa-power8-5-kvm?editPanel=65102&tab=alert&viewPanel=65102&orgId=1&from=1617299599160&to=1617310314965
where the alerts seem to coincide with reboots that I triggered as an experiment. It seems that the NTP offset is exceeding a threshold as there are many values missing due to reboot. Likely we can live with a longer pending time to prevent this problem.

Acceptance criteria

  • AC1: NTP offset alert does not trigger on repeated reboots

History

#2 Updated by okurz 4 months ago

11159

https://stats.openqa-monitor.qa.suse.de/d/WDQA-Power8-5-kvm/worker-dashboard-qa-power8-5-kvm?editPanel=65102&tab=alert&viewPanel=65102&orgId=1&from=1617386200044&to=1617387310538

ntp_offset_fine_within_12m.png

shows how the machine qa-power8-5-kvm is offline for 9 minutes and returns with a nearly 1s clock offset but returning to a sane level after in total 12m so the new alert threshold of 20m is not exceeded.

Also available in: Atom PDF