Project

General

Profile

Actions

coordination #161414

open

[epic] Improved salt based infrastructure management

Added by okurz 6 months ago. Updated about 5 hours ago.

Status:
New
Priority:
Normal
Assignee:
-
Category:
Feature requests
Target version:
QA (public, currently private due to #173521) - future
Start date:
2021-06-22
Due date:
% Done:

38%

Estimated time:
(Total: 0.00 h)
Tags:

Subtasks 18 (11 open7 closed)

action #94492: Configure retention/downsampling policy for monitoring data stored within InfluxDB size:MResolvedmkittler2021-06-22

Actions
action #103380: Configure retention/downsampling policy for specific monitoring data stored within InfluxDBBlockedokurz2021-12-01

Actions
action #161423: [timeboxed:10h] Incomplete config files on OSD due to salt - Improve salt state application from remotely accessible salt master size:SResolvedokurz2024-06-03

Actions
action #161426: incomplete config files on OSD due to salt - introduce post-deploy monitoring steps like in osd-deployment but in salt-states-openqaNew2024-06-03

Actions
action #161429: incomplete config files on OSD due to salt - create annotations in grafana on the time of the osd deployment as well as salt-states-openqa deploymentsNew2024-06-03

Actions
action #162377: incomplete config files on OSD due to salt - Prevent conflicting state applications on OSD "fstab" size:SResolvedokurz2024-06-03

Actions
action #167051: https://gitlab.suse.de/openqa/salt-pillars-openqa/-/jobs/3109145 failed due to telegraf errors on monitor.qa.suse.de size:SResolvednicksinger2024-09-19

Actions
action #167719: No new data in monitor.qe.nue2.suse.org due to influxdb failing to write with ""error opening new segment file for wal (1): write /var/lib/influxdb/….wal: no space left on device"Resolvedokurz2024-10-02

Actions
action #167722: Efficient use of monitoring data within influxdb on monitor.qe.nue2.suse.org size:MWorkablenicksinger2024-10-02

Actions
action #167728: grafana dashboard for monitor.qe.nue2.suse.org size:SResolvedgpathak2024-10-02

Actions
action #168145: implement telegraf health check and adjust according pipelinesNew

Actions
action #168148: hackweek idea: use loki to monitor our log files and explore alerting possibilites based on these size:SResolvednicksinger

Actions
action #170077: Put more storage into qamaster "to make our lives easier in general" size:MBlockedokurz2024-11-19

Actions
action #173344: Extend iPXE in qe/oqa.*.suse.org to also display on local consoleNew

Actions
action #173347: Ensure we have a current backup of qamaster VMs, VM config, jenkins data, data from backup-vm itself, etc. size:SWorkable

Actions
action #173350: Migrate VMs from qamaster to modern hypervisor solutionNew2024-11-29

Actions
action #173353: physically label slots 10+11 on qamaster size:SWorkableokurz2024-11-29

Actions
action #173674: qamaster-independent backup size:SWorkable2024-12-03

Actions
Actions #1

Updated by okurz 6 months ago

  • Subtask #161423 added
Actions #2

Updated by okurz 6 months ago

  • Subtask #161426 added
Actions #3

Updated by okurz 6 months ago

  • Subtask #161429 added
Actions #4

Updated by okurz 6 months ago

  • Subtask #162377 added
Actions #5

Updated by okurz 2 months ago

  • Subtask #167719 added
Actions #6

Updated by okurz 2 months ago

  • Subtask #167722 added
Actions #7

Updated by okurz 2 months ago

  • Subtask #167728 added
Actions #8

Updated by okurz about 2 months ago

  • Subtask #103380 added
Actions #9

Updated by okurz about 2 months ago

  • Subtask #94492 added
Actions #10

Updated by okurz about 2 months ago

  • Subtask #167051 added
Actions #11

Updated by nicksinger about 2 months ago

  • Subtask #168145 added
Actions #12

Updated by nicksinger about 2 months ago

  • Subtask #168148 added
Actions #13

Updated by okurz 7 days ago

  • Subtask #170077 added
Actions #14

Updated by okurz 4 days ago

  • Subtask #173344 added
Actions #15

Updated by okurz 4 days ago

  • Subtask #173347 added
Actions #16

Updated by okurz 4 days ago

  • Subtask #173350 added
Actions #17

Updated by okurz 4 days ago

  • Subtask #173353 added
Actions #18

Updated by okurz about 5 hours ago

  • Subtask #173674 added
Actions

Also available in: Atom PDF