Project

General

Profile

Actions

action #35533

closed

action #18164: [devops][tools] monitoring of openqa worker instances

[tools] Monitoring of openqa worker instances via existing SUSE Infra services

Added by acarvajal over 6 years ago. Updated over 5 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
-
Target version:
-
Start date:
2018-04-25
Due date:
% Done:

0%

Estimated time:

Description

Work on poo#18164 has been split into 2 branches:

1) "Basic" monitoring of openQA workers using existing SUSE Infra tools (as is currently done with OSD)
2) "Performance Profiling" of OSD & openQA workers with new instances/tools (grafana, graphite, etc.)

This subtask is created to track only the basic monitoring of openQA workers.

Actions #1

Updated by acarvajal over 6 years ago

Initially adding openqaworker3 to nagios/thruk/icinga with a procedure based on the documentation found on: https://wiki.microfocus.net/index.php?title=SUSE-Development/OPS/Services/Monitoring

Actions #2

Updated by acarvajal over 6 years ago

Merge request on salt-states-openqa to allow TCP connections to ports 6556 and 5666 to the monitoring agents.

https://gitlab.suse.de/openqa/salt-states-openqa/merge_requests/44

Actions #3

Updated by acarvajal over 6 years ago

  • Assignee set to acarvajal
Actions #4

Updated by acarvajal over 6 years ago

Ticket requesting infra@suse.de to add openqaworker3 to openqa-suse host group in thruk/nagios/icinga: https://infra.nue.suse.com/Ticket/Display.html?id=111591

Actions #5

Updated by szarate over 6 years ago

Asked to move the ticket to the openQA-request queue https://infra.nue.suse.com/SelfService/Display.html?id=115255

Actions #6

Updated by okurz over 6 years ago

Because infra tickets by default are not open I suggest to keep this around for tracking. You could set it to blocked though.

Actions #7

Updated by szarate over 6 years ago

  • Status changed from In Progress to Blocked
  • Assignee changed from acarvajal to szarate

Currently blocked by this ticket: https://infra.nue.suse.com/SelfService/Display.html?id=117693 Not adding the ticket to current sprint due to no known ETA from infra

Actions #8

Updated by okurz about 6 years ago

  • Project changed from openQA Project (public) to openQA Infrastructure (public)
  • Subject changed from [tools] Monitoring of openqa worker instances via existing SUSE Infra services to [functional][u][tools] Monitoring of openqa worker instances via existing SUSE Infra services
  • Target version set to Milestone 20

szarate joined qsf-u. Has there been any update? Please ask for an ETA

Actions #9

Updated by szarate about 6 years ago

  • Subject changed from [functional][u][tools] Monitoring of openqa worker instances via existing SUSE Infra services to [tools] Monitoring of openqa worker instances via existing SUSE Infra services
  • Assignee deleted (szarate)

Requested ETA. Unassigning ticket for the time being

Actions #10

Updated by okurz almost 6 years ago

  • Target version deleted (Milestone 20)

removing target version as tools-team does not use milestones

Actions #11

Updated by okurz over 5 years ago

  • Status changed from Blocked to Resolved
  • Assignee set to okurz

https://infra.nue.suse.com/SelfService/Display.html?id=117693 is done, some worker instances are covered by simple monitoring checks. More detailed checks are covered on http://stats.openqa-monitor.qa.suse.de/ , at least a basic ping check is already in place for all workers. This should be enough for this ticket as we have the parent and the "advanced" ticket still.

Actions

Also available in: Atom PDF