Project

General

Profile

action #63451

Updated by okurz about 1 year ago

## Observation

On Thursday, 13 February 2020 22.35.33 CET Grafana wrote:
> [Alerting] New incompletes alert
>
> Metric name
>
> Value
>
>
> New incompletes
>
> 27.000

keep in minds. I bumped the alert threshold so that only if 25 new incompletes occur within one reporting period, that is just 10 seconds (!). I checked 2 out of many jobs and found the same reason:
"Reason: associated worker re-connected but abandoned the job"

The good thing is that they have all been automatically cloned.

## Suggestions


I guess we need just one more change two changes to the scripts in https://github.com/os-autoinst/scripts/ github.com/os-autoinst/scripts/ :
1. ignore incompletes that have a clone
2. Look for useful "reason" when no log file is uploaded


Who wants to give it a shot? :)

Back