Project

General

Profile

action #116758

Help with adding monitoring for the SLE maintenance update queue size:M

Added by okurz 3 months ago. Updated about 2 months ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Target version:
Start date:
2022-09-19
Due date:
2022-10-13
% Done:

0%

Estimated time:

Description

Motivation

Lately the SLE maintenance update queue was too long hence people want to add monitoring. https://gitlab.suse.de/openqa/salt-states-openqa/-/merge_requests/732 already tried but CI jobs fail. We should help.

Acceptance criteria

  • AC1: The maintenance update queue is visible in grafana on monitor.qa.suse.de

Suggestions

  • Review https://gitlab.suse.de/openqa/salt-states-openqa/-/merge_requests/731
  • Offer help to Zoltan to check if the data is now already correctly present in influxdb, debug telegraf with the according config otherwise, e.g. call journalctl -u telegraf on monitor.qa.suse.de . Then add an according dashboard+panel to grafana
  • Alternative: Do it for him
  • Optional: Improve the original python script, e.g. make it more generic, read configuration from salt pillars, etc.

History

#1 Updated by okurz 2 months ago

  • Subject changed from Help with https://gitlab.suse.de/openqa/salt-states-openqa/-/merge_requests/732 to add monitoring for the SLE maintenance update queue to Help with https://gitlab.suse.de/openqa/salt-states-openqa/-/merge_requests/732 to add monitoring for the SLE maintenance update queue size:M
  • Description updated (diff)
  • Status changed from New to Workable

#2 Updated by okurz 2 months ago

  • Priority changed from Normal to Urgent

We want to be more collaborative and support if others contribute to the tools we maintain so that we do not need to do everything on our own which is good. But we need to prioritze this before the patience of users run out :)

#3 Updated by cdywan 2 months ago

  • Subject changed from Help with https://gitlab.suse.de/openqa/salt-states-openqa/-/merge_requests/732 to add monitoring for the SLE maintenance update queue size:M to Help with adding monitoring for the SLE maintenance update queue size:M
  • Status changed from Workable to In Progress
  • Assignee set to cdywan

I'll look into this

#5 Updated by openqa_review 2 months ago

  • Due date set to 2022-10-13

Setting due date based on mean cycle time of SUSE QE Tools

#6 Updated by cdywan 2 months ago

  • Status changed from In Progress to Resolved

I assume we're good. There's a working dashboard. Thank you everyone for contributing.

#7 Updated by okurz 2 months ago

  • Status changed from Resolved to In Progress

please make sure the dashboard is saved in salt otherwise we might loose it on reinstalls of monitor.qa.suse.de

#9 Updated by tinita 2 months ago

  • Assignee changed from cdywan to tinita

I'm assigning this to me until Cris is back

#11 Updated by tinita about 2 months ago

  • Status changed from In Progress to Feedback

#12 Updated by tinita about 2 months ago

  • Status changed from Feedback to Resolved

#13 Updated by tinita about 2 months ago

  • Assignee changed from tinita to nicksinger

#14 Updated by okurz about 2 months ago

  • Status changed from Resolved to In Progress

tinita wrote:

MR merged.

The new dashboard can be found here: https://stats.openqa-monitor.qa.suse.de/d/rGAgUYoGo/sle-maintenance-update-metrics?orgId=1

That's the wrong one. That's the older WIP dashboard. We need to ensure the JSON file is somehow read like the other files so that it's rolled out by salt.

#15 Updated by nicksinger about 2 months ago

  • Status changed from In Progress to Resolved

I've added the dashboard to the deployment with https://gitlab.suse.de/openqa/salt-states-openqa/-/merge_requests/756 and now it is available at:
https://stats.openqa-monitor.qa.suse.de/d/HWsOjIIxu/sle-maintenance-update-metrics

I consider this done for now.
btw: the AC stated that the dashboard is available in grafana. Strictly speaking this was already fulfilled.

Also available in: Atom PDF