Project

General

Profile

Actions

action #175473

closed

openQA Project (public) - coordination #102906: [saga][epic] Increased stability of tests with less "known failures", known incompletes handled automatically within openQA

openQA Project (public) - coordination #175515: [epic] incomplete jobs with "Failed to find an available port: Address already in use"

OpenQA Jobs test - Incomplete jobs (not restarted) of last 24h alert Salt

Added by gpathak 5 months ago. Updated 5 months ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
Regressions/Crashes
Start date:
2024-12-19
Due date:
% Done:

0%

Estimated time:

Description

Observation

Values
B0=370 
Labels
alertname 	Incomplete jobs (not restarted) of last 24h alert
grafana_folder 	Salt
rule_uid 	cXo2cmBVk

https://monitor.qa.suse.de/d/nRDab3Jiz/openqa-jobs-test?orgId=1&from=2025-01-15T07:00:00.000Z&to=2025-01-15T08:59:59.000Z&viewPanel=panel-17

Can be related: #175464

Rollback actions


Related issues 2 (0 open2 closed)

Related to openQA Project (public) - action #175464: jobs incomplete with auto_review:"setup failure: isotovideo can not be started"Resolvedokurz2025-01-15

Actions
Copied from openQA Infrastructure (public) - action #174586: Incomplete jobs (not restarted) of last 24h alert SaltResolvedgpathak2024-12-192025-01-03

Actions
Actions #1

Updated by gpathak 5 months ago

  • Copied from action #174586: Incomplete jobs (not restarted) of last 24h alert Salt added
Actions #2

Updated by gpathak 5 months ago

  • Subject changed from Incomplete jobs (not restarted) of last 24h alert Salt to OpenQA Jobs test - Incomplete jobs (not restarted) of last 24h alert Salt
Actions #3

Updated by gpathak 5 months ago

  • Description updated (diff)
Actions #4

Updated by gpathak 5 months ago

  • Description updated (diff)
Actions #5

Updated by gpathak 5 months ago

  • Related to action #175464: jobs incomplete with auto_review:"setup failure: isotovideo can not be started" added
Actions #6

Updated by gpathak 5 months ago · Edited

  • Status changed from New to Feedback

It was triggered due to #175464 and since the mitigation is done to address the issue this can be closed. The alert is not firing as of now https://monitor.qa.suse.de/d/nRDab3Jiz/openqa-jobs-test?orgId=1&from=now-3h&to=now&viewPanel=panel-17
cc: @livdywan

Actions #7

Updated by okurz 5 months ago

  • Parent task set to #175515
Actions #8

Updated by okurz 5 months ago

  • Category set to Regressions/Crashes
  • Status changed from Feedback to In Progress
  • Assignee set to okurz

Right now https://monitor.qa.suse.de/d/nRDab3Jiz/openqa-jobs-test?orgId=1&from=2025-01-15T11:13:16.681Z&to=2025-01-15T12:17:30.509Z&viewPanel=panel-17 still or again shows many not restarted incompletes. Triggering host=openqa.suse.de ./openqa-advanced-retrigger-jobs once more and checking details.

Actions #9

Updated by okurz 5 months ago

  • Status changed from In Progress to Blocked
Actions #10

Updated by okurz 5 months ago

  • Description updated (diff)
  • Priority changed from High to Normal

silenced

Actions #11

Updated by okurz 5 months ago

  • Due date set to 2025-01-29
  • Status changed from Blocked to Feedback

Reaching out to some test maintainers regarding incompletes, e.g.

Actions #13

Updated by okurz 5 months ago

  • Due date deleted (2025-01-29)
  • Status changed from Feedback to Resolved
Actions

Also available in: Atom PDF