Project

General

Profile

action #137600

Updated by livdywan 12 months ago

## Observation 

 We had multiple occurrences of packet loss alert over the weekend 

 ``` 
 alertname            	 Packet loss between worker hosts and other hosts alert 
 grafana_folder            	 Salt 
 rule_uid            	 2Z025iB4km 

 http://stats.openqa-monitor.qa.suse.de/d/EML0bpuGk?orgId=1&viewPanel=4 
 ``` 
 
 http://stats.openqa-monitor.qa.suse.de/d/EML0bpuGk?orgId=1&viewPanel=4 

 Currently, the problematic ones according to the panel are: 

 ``` 
 imagetester - walter1.qe.nue2.suse.org    100% 
 petrol-1 - walter1.qe.nue2.suse.org       100% 
 sapworker1 - walter1.qe.nue2.suse.org     100% 
 ``` 

 That is a little bit weird as I manually checked the first one and it can reach each other well 

 ``` 
 walter1:~ #     ping imagetester.qe.nue2.suse.org 
 PING imagetester.qe.nue2.suse.org (10.168.192.249) 56(84) bytes of data. 
 64 bytes from imagetester.qe.nue2.suse.org (10.168.192.249): icmp_seq=7 ttl=64 time=0.326 ms 

 jbaier@imagetester:~>    ping walter1.qe.nue2.suse.org 
 PING walter1.qe.nue2.suse.org (10.168.192.1) 56(84) bytes of data. 
 64 bytes from walter1.qe.nue2.suse.org (10.168.192.1): icmp_seq=1 ttl=64 time=0.331 ms 
 ``` 

 ## Suggestions 
 * Confirm **when** this started happening or if it's no longer an issue 
 * There's no paused alerts

Back