Project

General

Profile

Actions

coordination #161414

open

[epic] Improved salt based infrastructure management

Added by okurz 6 months ago. Updated 9 days ago.

Status:
New
Priority:
Normal
Assignee:
-
Category:
Feature requests
Target version:
Start date:
2021-06-22
Due date:
2024-11-29 (Due in 5 days)
% Done:

50%

Estimated time:
(Total: 0.00 h)
Tags:

Subtasks 12 (6 open6 closed)

action #94492: Configure retention/downsampling policy for monitoring data stored within InfluxDB size:MResolvedmkittler2021-06-22

Actions
action #103380: Configure retention/downsampling policy for specific monitoring data stored within InfluxDBBlockedokurz2021-12-01

Actions
action #161423: [timeboxed:10h] Incomplete config files on OSD due to salt - Improve salt state application from remotely accessible salt master size:SResolvedokurz2024-06-03

Actions
action #161426: incomplete config files on OSD due to salt - introduce post-deploy monitoring steps like in osd-deployment but in salt-states-openqaNew2024-06-03

Actions
action #161429: incomplete config files on OSD due to salt - create annotations in grafana on the time of the osd deployment as well as salt-states-openqa deploymentsNew2024-06-03

Actions
action #162377: incomplete config files on OSD due to salt - Prevent conflicting state applications on OSD "fstab" size:SResolvedokurz2024-06-03

Actions
action #167051: https://gitlab.suse.de/openqa/salt-pillars-openqa/-/jobs/3109145 failed due to telegraf errors on monitor.qa.suse.de size:SResolvednicksinger2024-09-19

Actions
action #167719: No new data in monitor.qe.nue2.suse.org due to influxdb failing to write with ""error opening new segment file for wal (1): write /var/lib/influxdb/….wal: no space left on device"Resolvedokurz2024-10-02

Actions
action #167722: Efficient use of monitoring data within influxdb on monitor.qe.nue2.suse.org size:MWorkablenicksinger2024-10-022024-11-29

Actions
action #167728: grafana dashboard for monitor.qe.nue2.suse.org size:SResolvedgpathak2024-10-02

Actions
action #168145: implement telegraf health check and adjust according pipelinesNew

Actions
action #168148: hackweek idea: use loki to monitor our log files and explore alerting possibilites based on these size:SIn Progressnicksinger

Actions
Actions

Also available in: Atom PDF