Project

General

Profile

Actions

action #158808

closed

coordination #127031: [saga][epic] openQA for SUSE customers

coordination #152955: [epic] Metric-driven project management in SUSE QE Tools team

Prevent HTTP response codes 500 as observed in OSD monitoring size:M

Added by okurz about 1 month ago. Updated 25 days ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
Feature requests
Target version:
Start date:
Due date:
% Done:

0%

Estimated time:

Description

Motivation

See #158550 and #158556. We introduced an alert based on 5xx HTTP responses and found unexpectedly that we have about 120 5xx HTTP responses every hour. We should identify why we have so many hits, fix the problem in either openQA behaviour or the bug in monitoring data and then reduce the alert threshold accordingly.

Acceptance criteria

  • AC1: The number of HTTP 5xx errors is reasonably low
  • AC2: We know how many 500 errors we actually have (so our monitoring doesn't fool us)

Suggestions

  • On OSD grep '" \<500\> ' /var/log/apache2/access_log which right now looks like this
10.149.213.14 - - [10/Apr/2024:03:39:24 +0200] "GET /liveviewhandler/tests/13971445/developer/ws-proxy/status HTTP/1.1" 500 - "-" "Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Firefox/102.0" 363
2a07:de40:b2bf:1b::1117 - - [10/Apr/2024:08:47:18 +0200] "GET /liveviewhandler/tests/13991767/developer/ws-proxy/status HTTP/1.1" 500 - "-" "Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/117.0.0.0 Safari/537.36" 5079
10.149.213.14 - - [10/Apr/2024:11:52:14 +0200] "GET /liveviewhandler/tests/13993065/developer/ws-proxy/status HTTP/1.1" 500 - "-" "Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Firefox/102.0" 586
2a07:de40:b203:12:7ec2:55ff:fe24:de70 - - [10/Apr/2024:13:17:33 +0200] "POST /api/v1/mutex/support_server_ready?action=lock HTTP/1.1" 500 860 "-" "Mojolicious (Perl)" 2818

Related issues 1 (0 open1 closed)

Copied from openQA Infrastructure - action #158556: Single-value SLI of OSD HTTP response code successful vs. all size:SResolvedokurz2024-04-07

Actions
Actions

Also available in: Atom PDF