action #96795
closed
CPU Load alert and telegraf going between 41%, 98.5% and 115% CPU
Added by livdywan over 3 years ago.
Updated over 3 years ago.
Description
Observation¶
- CPU Load alert triggered
- htop shows that
telegraf
is going up and down
- telegraf still spiking regardless of the alert being OK
I saw the CPU alert trigger in the meantime, but I couldn't confirm a correlation with Telegraf's spikes which just seem to continue.
Switched it off temporarily for testing via sudo systemctl disable --now telegraf
and I see openqa processes maxing out at 54% as the worst offenders now. SWitching it back on via enable
the spikes are back fully.
- Status changed from New to In Progress
- Assignee set to livdywan
- Due date set to 2021-08-28
Setting due date based on mean cycle time of SUSE QE Tools
merged. I did on osd systemctl daemon-reload && systemctl restart telegraf
. I can confirm that telegraf runs with nice-level 10 now. What's next?
- Status changed from In Progress to Feedback
okurz wrote:
merged. I did on osd systemctl daemon-reload && systemctl restart telegraf
. I can confirm that telegraf runs with nice-level 10 now. What's next?
It looks like telegraf is on average using a lot less CPU than before, so I'm inclined to consider this a success.
- Related to action #96807: Web UI is slow and Apache Response Time alert got triggered added
- Status changed from Feedback to Resolved
cdywan wrote:
It looks like telegraf is on average using a lot less CPU than before, so I'm inclined to consider this a success.
Hence resolving.
Also available in: Atom
PDF