Project

General

Profile

Actions

action #125765

closed

Make Telegraf errors visible in alert handling

Added by livdywan almost 2 years ago. Updated over 1 year ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
-
Start date:
2022-12-06
Due date:
% Done:

0%

Estimated time:

Description

Motivation

In the context of #121582 the deployed InfluxDB input wouldn't seem to be picked up by Grafana but we also saw no issues with deployment or alerts to explain that it was broken.

Acceptance criteria

  • AC1: The team is aware of errors in Telegraf inputs

Suggestions

  • Run sudo telegraf --test --config /etc/telegraf/telegraf.d/slo.conf with the according config filename. By default only one config file will be used
  • Use logwarn (c.f. openqa logwarn)
  • Use https://grafana.com/oss/loki/ (maybe overkill?)

Related issues 1 (0 open1 closed)

Copied from QA (public) - action #121582: [tools][metrics] Calculate cycle + lead times for SUSE QE Tools continuously size:MResolvedlivdywan2022-12-062023-03-31

Actions
Actions

Also available in: Atom PDF