Actions
action #133700
openopenQA Project (public) - coordination #155485: [saga][epic] Efficient openQA worker pool resource handling in datacenters
coordination #158374: [epic] Prevention of inefficient hardware resource use
Network bandwidth graphs per switch, like https://mrtg.suse.de/qanet13nue, for all current top-of-rack switches (TORs) that we are connected to size:M
Start date:
2023-08-02
Due date:
% Done:
0%
Estimated time:
Description
Motivation¶
Sometimes or often enough there are various network related issues. To find out the available bandwidth or bottle necks graphs like https://mrtg.suse.de/qanet13nue/index.html can be quite helpful:
We have those available for NUE1 based switches but I would not know about NUE2 or PRG2 so we should research how something equivalent is possible and ensure everybody within our team would be able to reach the according graphs.
Acceptance criteria¶
- AC1: Graphs like from mrtg.suse.de are available to all current SUSE QE Tools members for common racks, e.g. FC_Basement-B1..5, PRG2-J11+PRG2-J12
- AC2: The team knows how to reach those
Suggestions¶
- Ask Eng-Infra where something like https://mrtg.suse.de/qanet13nue/index.html can be found for all the AC1 mentioned TOR switches
- Ask Eng-Infra if metric collection is already in place
- Figure out if we can just collect these metrics on our own
- Ensure we have ACs covered for both FC Basement and PRG2
- Check e.g. https://www.influxdata.com/integration/jti-openconfig-telemetry/
Files
Actions