Project

General

Profile

action #132812

Updated by okurz 10 months ago

## Observation 
 https://monitor.qa.suse.de/d/EML0bpuGk/monitoring?viewPanel=4&orgId=1&from=now-6h&to=now showing 100% packet loss between qa-power8-4 and openqaw5-xen. 

 ## Acceptance criteria 
 * **AC1:** Alert resolved 
 * **AC2:** Alert about packet loss should only fire if we don't already have a related "host up" alert 

 ## Suggestions 
 * Look into the individual alerts and fix the error source 
 * Crosscheck definitions of "host up" and "packet loss" alerts, do we have a redundant alerting overlap? IIRC (okurz) then packet loss was intended to fire only when we have significant packet loss but not hosts being down completely 
 * Ensure all rollback steps are conducted 

 ## Rollback steps 
 * Remove related silences

Back