Project

General

Profile

Actions

action #70834

closed

[alert] Refine I/O time alerts for OSD

Added by nicksinger over 3 years ago. Updated over 3 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
-
Target version:
Start date:
2020-09-02
Due date:
% Done:

0%

Estimated time:
Tags:

Description

We have several IO time alerts for OSD itself:

They need to be reworked so that:

  1. The right disk is shown for the right purpose (e.g. /dev/vde is not /results any longer)
  2. DONE: The alert thresholds need to be adjusted to not trigger that often
    • Spikes of up to 7s seem to happen from time to time
    • The situation gets critical if these spikes continue for several minutes

All above linked alerts are on pause right now since they don't provide a big benefit being that flaky.


Related issues 3 (0 open3 closed)

Related to openQA Infrastructure - action #69667: missing monitoring data for vde after partitions where reorderedResolvedmkittler2020-08-06

Actions
Related to openQA Infrastructure - action #73165: [osd] Consolidate "expensive+fast" and "cheap+slow" storage after realizing vdc is "cheap+slow" as wellResolvedokurz2020-09-02

Actions
Related to openQA Infrastructure - action #110269: [alert] QA-Power8-4-kvm + QA-Power8-5-kvm: Disk I/O time alert size:MResolvedkraih

Actions
Actions

Also available in: Atom PDF