Project

General

Profile

action #132500

Updated by okurz 10 months ago

## Observation 
 Jobs on *openqaworker-arm** are [not running](https://openqa.suse.de/tests/overview?state=scheduled&arch=&flavor=&machine=&test=&modules=&module_re=&distri=sle&build=20230709-1&groupid=414). See [conversation in Slack](https://suse.slack.com/archives/C02CANHLANP/p1688971832970689). 

 ## Acceptance criteria 
 * **AC1:** *openqaworker-arm**, **openqaw5-xen**, etc., are able to run jobs 
 * **AC2:** https://monitor.qa.suse.de is reachable again 
 * **AC3:** No unhandled alerts on https://monitor.qa.suse.de 


 ## Suggestions 
 * Follow up on the situation with SRV2 
 * If climate issue resolved then ensure OSD jobs are good again 
 * Ensure that monitor.qa.suse.de is good again 


 ## Rollback steps 
 * Unsilence https://monitor.qa.suse.de/alerting/silence/4a6e0759-792d-4b1b-b885-3a6d9d1928b8/edit?alertmanager=grafana once malbec is powered back on 
 * Add salt-keys for at least 
   * e.g. malbec 
   * openqaworker14.qa.suse.cz 
   * qesapworker-prg4.qa.suse.cz

Back