Project

General

Profile

Actions

action #159654

closed

coordination #110833: [saga][epic] Scale up: openQA can handle a schedule of 100k jobs with 1k worker instances

coordination #108209: [epic] Reduce load on OSD

high response times on osd - nginx properly monitored in grafana size:S

Added by okurz 3 months ago. Updated about 1 month ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
Feature requests
Target version:
Start date:
2024-04-26
Due date:
% Done:

0%

Estimated time:
Tags:

Description

Motivation

Apache in prefork mode uses a lot of resources to provide mediocre performance. We have nginx on OSD deployed with #159651. Now let's make sure we have it properly monitored as the web proxy is critical for the overall performance and user experience

Acceptance criteria

  • AC1: Nginx on OSD is properly monitored in grafana
  • AC2: No alerts about apache being down

Suggestions

  • Follow #159651 for the actual nginx deployment
  • Add changes to salt-states-openqa including monitoring: we have multiple panels regarding apache that need to be adapted for nginx as applicable
  • Ensure that we have no alerts regarding "oh no, apache is down" ;)

Out of scope

  • No need for any additional metrics, just feature-parity with what we have regarding apache, e.g. response sizes, response codes, response times

Files

20240611_14h33m35s_grim.png (83.4 KB) 20240611_14h33m35s_grim.png jbaier_cz, 2024-06-11 12:33

Related issues 2 (0 open2 closed)

Related to openQA Project - action #160877: [alert] Scripts CI pipeline failing due to osd yielding 502 size:MResolvedmkittler2024-05-24

Actions
Copied from openQA Project - action #159651: high response times on osd - nginx with enabled rate limiting features size:SRejectedokurz2024-04-262024-06-14

Actions
Actions

Also available in: Atom PDF