action #90875
closedpowerqaworker-qam-1 is online but https://monitor.qa.suse.de/d/4KkGdvvZk/osd-status-overview shows "No Data"
0%
Description
Observation¶
https://monitor.qa.suse.de/d/4KkGdvvZk/osd-status-overview?orgId=1 still shows "No Data" for powerqaworker-qam-1 but the "reboot" problem is fixed
Acceptance criteria¶
- AC1: https://monitor.qa.suse.de/d/4KkGdvvZk/osd-status-overview?orgId=1 shows a proper "online"/"offline" status for powerqaworker-qam-1
Updated by okurz over 3 years ago
seems I am getting closer. In https://monitor.qa.suse.de/d/4KkGdvvZk/osd-status-overview?editPanel=31 if I select explicitly in the query which hosts are there I can see that it is "powerqaworker-qam-1" without any domain whereas for all others it's either .qa.suse.de or .suse.de . In https://monitor.qa.suse.de/d/4KkGdvvZk/osd-status-overview?editview=templating&orgId=1 the variable "workers_ppc64le" has "QA-Power8-5-kvm.qa.suse.de, powerqaworker-qam-1.qa.suse.de, malbec.arch.suse.de, grenache-1.qa.suse.de, QA-Power8-4-kvm.qa.suse.de" configured so with the domain which does not have any data for powerqaworker-qam-1. So I assume we need to find out where the hosts have the domain configured and where it's missing for this specific host
Updated by okurz over 3 years ago
- Status changed from Workable to In Progress
- Assignee set to okurz
https://gitlab.suse.de/openqa/salt-states-openqa/-/blob/master/monitoring/telegraf/telegraf-webui.conf#L47 defines the hostnames as they are defined within telegraf and that depends on how the salt minions call themselves in salt keys. that is defined in /etc/salt/minion_id . I don't know if that is maintained manually by us? I changed that manually now on powerqaworker-qam-1.qa.suse.de and accepted the salt key on osd and deleted the other key
Updated by okurz over 3 years ago
- Status changed from In Progress to Resolved
https://monitor.qa.suse.de/d/4KkGdvvZk/osd-status-overview?orgId=1 shows powerqaworker-qam-1 as online again \o/