Project

General

Profile

Actions

action #116758

closed

Help with adding monitoring for the SLE maintenance update queue size:M

Added by okurz about 2 years ago. Updated about 2 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
-
Start date:
2022-09-19
Due date:
2022-10-13
% Done:

0%

Estimated time:

Description

Motivation

Lately the SLE maintenance update queue was too long hence people want to add monitoring. https://gitlab.suse.de/openqa/salt-states-openqa/-/merge_requests/732 already tried but CI jobs fail. We should help.

Acceptance criteria

  • AC1: The maintenance update queue is visible in grafana on monitor.qa.suse.de

Suggestions

  • Review https://gitlab.suse.de/openqa/salt-states-openqa/-/merge_requests/731
  • Offer help to Zoltan to check if the data is now already correctly present in influxdb, debug telegraf with the according config otherwise, e.g. call journalctl -u telegraf on monitor.qa.suse.de . Then add an according dashboard+panel to grafana
  • Alternative: Do it for him
  • Optional: Improve the original python script, e.g. make it more generic, read configuration from salt pillars, etc.
Actions #1

Updated by okurz about 2 years ago

  • Subject changed from Help with https://gitlab.suse.de/openqa/salt-states-openqa/-/merge_requests/732 to add monitoring for the SLE maintenance update queue to Help with https://gitlab.suse.de/openqa/salt-states-openqa/-/merge_requests/732 to add monitoring for the SLE maintenance update queue size:M
  • Description updated (diff)
  • Status changed from New to Workable
Actions #2

Updated by okurz about 2 years ago

  • Priority changed from Normal to Urgent

We want to be more collaborative and support if others contribute to the tools we maintain so that we do not need to do everything on our own which is good. But we need to prioritze this before the patience of users run out :)

Actions #3

Updated by livdywan about 2 years ago

  • Subject changed from Help with https://gitlab.suse.de/openqa/salt-states-openqa/-/merge_requests/732 to add monitoring for the SLE maintenance update queue size:M to Help with adding monitoring for the SLE maintenance update queue size:M
  • Status changed from Workable to In Progress
  • Assignee set to livdywan

I'll look into this

Actions #5

Updated by openqa_review about 2 years ago

  • Due date set to 2022-10-13

Setting due date based on mean cycle time of SUSE QE Tools

Actions #6

Updated by livdywan about 2 years ago

  • Status changed from In Progress to Resolved

I assume we're good. There's a working dashboard. Thank you everyone for contributing.

Actions #7

Updated by okurz about 2 years ago

  • Status changed from Resolved to In Progress

please make sure the dashboard is saved in salt otherwise we might loose it on reinstalls of monitor.qa.suse.de

Actions #9

Updated by tinita about 2 years ago

  • Assignee changed from livdywan to tinita

I'm assigning this to me until Cris is back

Actions #11

Updated by tinita about 2 years ago

  • Status changed from In Progress to Feedback
Actions #12

Updated by tinita about 2 years ago

  • Status changed from Feedback to Resolved
Actions #13

Updated by tinita about 2 years ago

  • Assignee changed from tinita to nicksinger
Actions #14

Updated by okurz about 2 years ago

  • Status changed from Resolved to In Progress

tinita wrote:

MR merged.

The new dashboard can be found here: https://stats.openqa-monitor.qa.suse.de/d/rGAgUYoGo/sle-maintenance-update-metrics?orgId=1

That's the wrong one. That's the older WIP dashboard. We need to ensure the JSON file is somehow read like the other files so that it's rolled out by salt.

Actions #15

Updated by nicksinger about 2 years ago

  • Status changed from In Progress to Resolved

I've added the dashboard to the deployment with https://gitlab.suse.de/openqa/salt-states-openqa/-/merge_requests/756 and now it is available at:
https://stats.openqa-monitor.qa.suse.de/d/HWsOjIIxu/sle-maintenance-update-metrics

I consider this done for now.
btw: the AC stated that the dashboard is available in grafana. Strictly speaking this was already fulfilled.

Actions

Also available in: Atom PDF