action #104085: openQABot pipeline failed with terminating connection due to administrator command size:S - openQA Infrastructure (public) - openSUSE Project Management Tool

Actions

Copy link

action #104085

closed

openQABot pipeline failed with terminating connection due to administrator command size:S

Added by livdywan over 3 years ago. Updated over 3 years ago.

Status:

Resolved

Priority:

High

Assignee:

jbaier_cz

Category:

Target version:

openQA Project (public) - Ready

Start date:

2021-12-16

Due date:

% Done:

Estimated time:

Description

openQABot pipeline failed like so:

ERROR: Unexpected response: {'errors': [{'message': 'terminating connection due to administrator command\nERROR:  server conn crashed?\nFATAL:  server conn crashed?\nserver closed the connection unexpectedly\n\tThis probably means the server terminated abnormally\n\tbefore or while processing the request.\n', 'locations': [{'line': 1, 'column': 2}], 'path': ['requests']}], 'data': {'requests': None}}
ERROR:root:Something bad happended during reading MR data from SMELT/IBS

Related issues 2 (0 open — 2 closed)

Actions

Copy link

Updated by livdywan over 3 years ago

Re-triggered

Actions

Copy link

Updated by livdywan over 3 years ago

Status changed from New to Feedback

I guess it's fine now

Actions

Copy link

Updated by jbaier_cz over 3 years ago

It is Thursday and the connection to SMELT failed. I am not sure if it is worthy to even file a ticket about this (before the retrigger). We might think about moving the schedule time outside the maintenance window.

Actions

Copy link

Updated by okurz over 3 years ago

Status changed from Feedback to New
Target version set to Ready

Yes, I had already proposed to completely disable all triggering of openQA jobs on OSD in #102716 but no agreement was found so far. I think what we should be able to do here is to retry for long enough so that we can cover outages on non-reachable times of other machines.

For example in openqabot/update/mr.py in _get_mr_requests where we call requests.get use the internal retry feature of requests, similar to how we did in https://github.com/os-autoinst/openqa_review/pull/149/files

Also see https://findwork.dev/blog/advanced-usage-python-requests-timeouts-retries-hooks/

Actions

Copy link

Updated by okurz over 3 years ago

Priority changed from Normal to High

Actions

Copy link

Updated by livdywan over 3 years ago

Subject changed from openQABot pipeline failed with terminating connection due to administrator command to openQABot pipeline failed with terminating connection due to administrator command size:S
Description updated (diff)
Status changed from New to Workable

Actions

Copy link

Updated by jbaier_cz over 3 years ago

Status changed from Workable to In Progress
Assignee set to jbaier_cz

Actions

Copy link

Updated by jbaier_cz over 3 years ago

Status changed from In Progress to Feedback

Basic retry implemented with a simple wrapper call: https://gitlab.suse.de/qa-maintenance/openQABot/-/merge_requests/88

Actions

Copy link

Updated by okurz over 3 years ago

Description updated (diff)
Status changed from Feedback to Resolved

merged. https://gitlab.suse.de/qa-maintenance/openQABot/-/jobs/750202 showed that at least nothing severly broke :) I don't think we need to keep this ticket open until we hit a problem with the network again. Looks good so far. Thanks!

Actions

Copy link

#10

Updated by okurz over 3 years ago

Related to action #105169: Pipeline of openQABot project fails with "urllib.error.HTTPError: HTTP Error 503: Service Unavailable" causing alert/notification added

Actions

Copy link

#11

Updated by nicksinger over 3 years ago

Copied to action #105603: openQABot pipeline failed: "ERROR:root:Something bad happended during reading MR data from SMELT/IBS: Expecting value: line 4 column 1 (char 3)" size:M added

Actions

Copy link

Also available in: Atom PDF

Project

General

Profile

QA (public) » openQA Project (public) » openQA Infrastructure (public)

Tags

Custom queries

action #104085

openQABot pipeline failed with terminating connection due to administrator command size:S

Updated by livdywan over 3 years ago

Updated by livdywan over 3 years ago

Updated by jbaier_cz over 3 years ago

Updated by okurz over 3 years ago

Updated by okurz over 3 years ago

Updated by livdywan over 3 years ago

Updated by jbaier_cz over 3 years ago

Updated by jbaier_cz over 3 years ago

Updated by okurz over 3 years ago

Updated by okurz over 3 years ago

Updated by nicksinger over 3 years ago