Project

General

Profile

action #126212

Updated by livdywan about 1 year ago

## Observation 
 [Tina observed](https://suse.slack.com/archives/C02AJ1E568M/p1679304824699299) very slow responses from the OSD webui at 10:33 CET. Shortly after we got asked in [#eng-testing](https://suse.slack.com/archives/C02CANHLANP/p1679305078007049). 
 The higher load can be well seen in grafana too: https://stats.openqa-monitor.qa.suse.de/d/Webuinew/webui-summary-new?orgId=1&from=1679293281205&to=1679306017314 
 We received no apache response time alerts as far as I can tell. 

 ## Acceptance Criteria 
 * **AC1**: It is known that our alert thresholds are sensible 

 ## Suggestions 
 * Check what caused the high load e.g. by analyzing the apache log in /var/log/apache2 
 * Remediate the offender (e.g. fixing a script, blocking an IP, etc) 
 * Check why the apache response time alert was not firing and check if something needs to be fixed 
   * Apache Response Time should have fired? 
   * Maybe the alert was too relaxed and didn't trigger "yet"? 
   * Should be 10s but even the index page w/o additional ajax took longer? We don't have numbers, though?

Back