action #135737
closed[alert] Munin - network eth errors - opensuse.org :: openqa.opensuse.org size:M
0%
Description
Observation¶
At 3.35 UTC and 4:20 UTC:
opensuse.org :: openqa.opensuse.org :: eth2 errors
UNKNOWNs: errors is unknown, errors is unknown.
opensuse.org :: openqa.opensuse.org :: eth3 errors
UNKNOWNs: errors is unknown, errors is unknown.
This seems to be coming from:
Received: from ariel.suse-dmz.opensuse.org (openqa.infra.opensuse.org
[192.168.47.13])
Acceptance criteria¶
- AC1: Ethernet network errors from Munin are understood
Suggestions¶
- Maybe find a way to make the errors less unknown in the notifications?
- Look into the munin graphs (See https://progress.opensuse.org/projects/openqav3/wiki/#Monitoring for how to setup the tunnel)
Updated by livdywan over 1 year ago
- Copied to action #135740: [alert] Munin - minion hook failed - opensuse.org :: openqa.opensuse.org - only "label_known_issues" hook scriptssize:M added
Updated by livdywan over 1 year ago
- Subject changed from [alert] Munin - network eth errors - opensuse.org :: openqa.opensuse.org to [alert] Munin - network eth errors - opensuse.org :: openqa.opensuse.org size:M
- Description updated (diff)
- Status changed from New to Workable
Updated by livdywan about 1 year ago
FYI it doesn't seem like these have come back so far
Updated by tinita about 1 year ago
Files under /etc were copied over from old-ariel, which includes configured munin plugins like /etc/munin/plugins/if_err_eth2
and /etc/munin/plugins/if_err_eth3
(symlinks).
But eth2 and eth3 don't exist on new-ariel.
I deleted if_eth2, if_eth3, if_err_eth2 and if_err_eth3.
But I also created if_tun5
and if_err_tun5
because ip a
is showing tun5
. We now got an alert for this device, so not sure yet what to do about this.
cd /etc/munin/plugins
ln -s /usr/lib/munin/plugins/if_err_ if_tun5
ln -s /usr/lib/munin/plugins/if_err_ if_err_tun5
Date: Thu, 28 Sep 2023 10:35:07 +0000
Subject: Munin - network tun5 errors - opensuse.org :: openqa.opensuse.org
opensuse.org :: openqa.opensuse.org :: tun5 errors
UNKNOWNs: errors is unknown, errors is unknown.
Updated by tinita about 1 year ago
- Status changed from In Progress to Feedback
I'm assuming the tun5 alert was simply because I just configured the plugin, because the traffic plugin starts to show values, so it looks fine:
http://localhost:8080/munin/opensuse.org/openqa.opensuse.org/if_tun5.html
Will monitor a bit.
Updated by tinita about 1 year ago
- Status changed from Feedback to Resolved
I assume it's fine now. If we get a tun5 alert again, we can consider disabling it.