Project

General

Profile

Actions

action #134852

open

gitlab CI job fails in telegraf check with unsupported option since telegraf package on monitor.qa.suse.de was downgraded due to invalid repo metadata

Added by okurz about 1 year ago.

Status:
New
Priority:
Normal
Assignee:
-
Category:
-
Target version:
Start date:
2023-08-30
Due date:
% Done:

0%

Estimated time:

Description

Observation

https://gitlab.suse.de/openqa/salt-states-openqa/-/jobs/1792173#L85 shows a failed telegraf check:

Error running agent: Error loading config file /etc/telegraf/telegraf.d/slo.conf: plugin inputs.exec: line 1: configuration specified the fields ["environment"], but they weren't used

This is due to telegraf 1.22 not supporting the option. Current telegraf version is 1.26. Erroneously the host monitor.qa.suse.de downgraded the telegraf package today due to journalctl -u auto-upgrade --since=today:

Aug 30 03:55:38 monitor systemd[1]: Started Automatically upgrade packages..
Aug 30 03:55:40 monitor sh[4129]: Retrieving repository 'devel_openQA' metadata [.....done]
Aug 30 03:55:41 monitor sh[4129]: Building repository 'devel_openQA' cache [...done]
Aug 30 03:55:47 monitor sh[4129]: Retrieving repository 'Update repository of openSUSE Backports' metadata [......error]
Aug 30 03:55:47 monitor sh[4129]: Repository 'Update repository of openSUSE Backports' is invalid.
Aug 30 03:55:47 monitor sh[4129]: [repo-backports-update|http://download.opensuse.org/update/leap/15.4/backports/] Valid metadata not found at specified URL
Aug 30 03:55:47 monitor sh[4129]: History:
Aug 30 03:55:47 monitor sh[4129]:  - File './repodata/74d0b705b4bb64056928ad7f19a7d2424b5bb0e526df9b95dff98b8c901d4b3e-deltainfo.xml.gz' not found on medium 'http://download.opensuse.org/update/leap/>
Aug 30 03:55:47 monitor sh[4129]:  - Can't provide ./repodata/74d0b705b4bb64056928ad7f19a7d2424b5bb0e526df9b95dff98b8c901d4b3e-deltainfo.xml.gz
Aug 30 03:55:47 monitor sh[4129]: Please check if the URIs defined for this repository are pointing to a valid repository.
Aug 30 03:55:47 monitor sh[4129]: Warning: Skipping repository 'Update repository of openSUSE Backports' because of the above error.
Aug 30 03:55:54 monitor sh[4129]: Retrieving repository 'Update repository with updates from SUSE Linux Enterprise 15' metadata [........................................done]
Aug 30 03:56:21 monitor sh[4129]: Building repository 'Update repository with updates from SUSE Linux Enterprise 15' cache [....done]
Aug 30 03:56:22 monitor sh[4129]: Some of the repositories have not been refreshed because of an error.
Aug 30 03:56:22 monitor sh[4129]: Loading repository data...
Aug 30 03:56:23 monitor sh[4129]: Reading installed packages...
Aug 30 03:56:26 monitor sh[4129]: Warning: You are about to do a distribution upgrade with all enabled repositories. Make sure these repositories are compatible before you continue. See 'man zypper' >
Aug 30 03:56:26 monitor sh[4129]: Computing distribution upgrade...
Aug 30 03:56:28 monitor sh[4129]: The following 31 packages are going to be upgraded:
Aug 30 03:56:28 monitor sh[4129]:   bind-utils device-mapper libdevmapper-event1_03 libdevmapper1_03 libdevmapper1_03-32bit libicu-suse65_1 libicu65_1-ledata liblvm2cmd2_03 libopenjp2-7 libparted0 li>
Aug 30 03:56:28 monitor sh[4129]: The following 3 packages are going to be downgraded:
Aug 30 03:56:28 monitor sh[4129]:   libqt5-qtstyleplugins-platformtheme-gtk2 php7-imagick telegraf
Aug 30 03:56:28 monitor sh[4129]: The following 2 NEW packages are going to be installed:
Aug 30 03:56:28 monitor sh[4129]:   libfstrm0 libprotobuf-c1
Aug 30 03:56:28 monitor sh[4129]: 31 packages to upgrade, 3 to downgrade, 2 new.

The problem is Skipping repository 'Update repository of openSUSE Backports' because of the above error. meaning that we continue the upgrade despite an error to refresh a repo causing the downgrade.

Acceptance criteria

  • AC1: Upgrade is only conducted when all configured repositories can be successfully refreshed
  • AC2: We are alerted if the situation of broken repos persists

Suggestions

  • Research how to only continue with the upgrade after the refresh was successful and abort otherwise

Workaround

Apply the upgrade manually when this happens.


Related issues 1 (0 open1 closed)

Related to openQA Infrastructure - action #134042: auto-update on OSD does not install updates due to "Problem: nothing provides 'libwebkit2gtk3 ..." but service does not fail and we do not get an alert size:MResolvedlivdywan2023-08-092023-09-12

Actions
Actions #1

Updated by okurz about 1 year ago

  • Related to action #134042: auto-update on OSD does not install updates due to "Problem: nothing provides 'libwebkit2gtk3 ..." but service does not fail and we do not get an alert size:M added
Actions

Also available in: Atom PDF