action #152811
closedada.qe.suse.de is not responding to salt commands
0%
Description
Updated by livdywan about 1 year ago
- Copied from action #152673: [alert] `systemctl status iscsid.socket` failed on `s390zl12.oqa.prg2.suse.org` size:S added
Updated by okurz about 1 year ago
Same as #152813 for ada+openqaw5-xen we need to wait for #132617 . Your observation is correct and I wonder if some gitlab CI pipelines or monitoring shouldn't have failed until we remove those hosts from salt. I wonder, how did you find this issue?
Updated by livdywan about 1 year ago
okurz wrote in #note-3:
Same as #152813 for ada+openqaw5-xen we need to wait for #132617 . Your observation is correct and I wonder if some gitlab CI pipelines or monitoring shouldn't have failed until we remove those hosts from salt. I wonder, how did you find this issue?
I was executing salt commands on all machines and these did not respond. It also surprised me that monitoring didn't fail. If they're not expected to be usable, they surely shouldn't be in salt?
Updated by okurz about 1 year ago
livdywan wrote in #note-4:
okurz wrote in #note-3:
Same as #152813 for ada+openqaw5-xen we need to wait for #132617 . Your observation is correct and I wonder if some gitlab CI pipelines or monitoring shouldn't have failed until we remove those hosts from salt. I wonder, how did you find this issue?
I was executing salt commands on all machines and these did not respond. It also surprised me that monitoring didn't fail.
Found it: #151588
If they're not expected to be usable, they surely shouldn't be in salt?
Correct. For those we should follow https://progress.opensuse.org/projects/openqav3/wiki/#Take-machines-out-of-salt-controlled-production
Updated by okurz about 1 year ago
- Description updated (diff)
- Status changed from New to Blocked
- Assignee set to okurz
- Priority changed from High to Normal
- Target version changed from Ready to Tools - Next
removed salt key and added rollback step in description. Blocking on #132617