Project

General

Profile

action #120780

[Alerting] InfluxDB not reachable, turned ok after some minutes

Added by okurz 3 months ago. Updated 2 months ago.

Status:
Resolved
Priority:
Normal
Assignee:
Target version:
Start date:
2022-11-20
Due date:
% Done:

0%

Estimated time:

Description

Observation

Received email "[Alerting] InfluxDB not reachable" at 2022-11-20 03:37. https://stats.openqa-monitor.qa.suse.de/d/EML0bpuGk/monitoring?tab=alert&viewPanel=2&orgId=1&from=1668911669806&to=1668912007654&editPanel=2 shows the alert to go to pending 03:35 and alerting 2m later but within the same minute to ok. So apparently during our weekly reboot some startup took a little bit longer exceeding the 2m threshold. We should be more forgiving.


Related issues

Copied to openQA Infrastructure - action #120783: [Alerting] failed systemd service on worker11, os-autoinst-openvswitch. Failed at system boot, turned ok after some hours size:MResolved2022-11-20

History

#2 Updated by okurz 3 months ago

  • Due date set to 2022-12-02
  • Status changed from New to Feedback

#3 Updated by okurz 3 months ago

  • Tags set to alert, reactive work

#4 Updated by okurz 3 months ago

  • Copied to action #120783: [Alerting] failed systemd service on worker11, os-autoinst-openvswitch. Failed at system boot, turned ok after some hours size:M added

#5 Updated by okurz 2 months ago

  • Due date deleted (2022-12-02)
  • Status changed from Feedback to Resolved

merged and deployed. I expect no further problems soon.

Also available in: Atom PDF