Project

General

Profile

Actions

action #90875

closed

powerqaworker-qam-1 is online but https://monitor.qa.suse.de/d/4KkGdvvZk/osd-status-overview shows "No Data"

Added by okurz over 3 years ago. Updated over 3 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
-
Start date:
2021-04-08
Due date:
% Done:

0%

Estimated time:

Description

Observation

https://monitor.qa.suse.de/d/4KkGdvvZk/osd-status-overview?orgId=1 still shows "No Data" for powerqaworker-qam-1 but the "reboot" problem is fixed

Acceptance criteria

Actions #1

Updated by okurz over 3 years ago

seems I am getting closer. In https://monitor.qa.suse.de/d/4KkGdvvZk/osd-status-overview?editPanel=31 if I select explicitly in the query which hosts are there I can see that it is "powerqaworker-qam-1" without any domain whereas for all others it's either .qa.suse.de or .suse.de . In https://monitor.qa.suse.de/d/4KkGdvvZk/osd-status-overview?editview=templating&orgId=1 the variable "workers_ppc64le" has "QA-Power8-5-kvm.qa.suse.de, powerqaworker-qam-1.qa.suse.de, malbec.arch.suse.de, grenache-1.qa.suse.de, QA-Power8-4-kvm.qa.suse.de" configured so with the domain which does not have any data for powerqaworker-qam-1. So I assume we need to find out where the hosts have the domain configured and where it's missing for this specific host

Actions #2

Updated by okurz over 3 years ago

  • Status changed from Workable to In Progress
  • Assignee set to okurz

https://gitlab.suse.de/openqa/salt-states-openqa/-/blob/master/monitoring/telegraf/telegraf-webui.conf#L47 defines the hostnames as they are defined within telegraf and that depends on how the salt minions call themselves in salt keys. that is defined in /etc/salt/minion_id . I don't know if that is maintained manually by us? I changed that manually now on powerqaworker-qam-1.qa.suse.de and accepted the salt key on osd and deleted the other key

Actions #3

Updated by okurz over 3 years ago

  • Status changed from In Progress to Resolved

https://monitor.qa.suse.de/d/4KkGdvvZk/osd-status-overview?orgId=1 shows powerqaworker-qam-1 as online again \o/

Actions

Also available in: Atom PDF