Project

General

Profile

Actions

action #40196

closed

[monitoring] monitor internal port 9526, port 80, external port 443 accessibility of o3 and response times size:M

Added by okurz over 6 years ago. Updated 14 days ago.

Status:
Resolved
Priority:
Low
Assignee:
Category:
Feature requests
Start date:
2018-08-23
Due date:
% Done:

0%

Estimated time:

Description

Motivation

Outcome of #39743#note-20 , would be nice to have better monitoring

Acceptance criteria

  • AC1: There is active monitoring for at least 443 on o3

Suggestions

  • The current monitoring solution applicable to o3 is https://zabbix.nue.suse.com. Ensure that in there at least port 443 reachable from outside is covered
  • As applicable extend to monitor for port 80 and/or 9526 reachable by localhost

Related issues 4 (2 open2 closed)

Related to openQA Project (public) - action #39743: [o3][tools] o3 unusable, often responds with 504 Gateway Time-outResolvedokurz2018-08-15

Actions
Related to openQA Infrastructure (public) - action #174316: [o3][zabbix][alert] warning about depleting storage space but no email? size:SWorkable2024-12-12

Actions
Related to openQA Infrastructure (public) - action #174313: [o3][zabbix][alert] / and /var/tmp: "Disk space is low and might be full in 7d (used > 85%)" since 2024-12-11 06:50 size:SWorkablemkittler2025-01-24

Actions
Blocked by openQA Infrastructure (public) - action #156322: zabbix-proxy.dmz-prg2.suse.org not reachable from ariel.suse-dmz.opensuse.orgResolvedjbaier_cz2024-02-29

Actions
Actions #1

Updated by okurz over 6 years ago

  • Related to action #39743: [o3][tools] o3 unusable, often responds with 504 Gateway Time-out added
Actions #2

Updated by coolo about 6 years ago

  • Project changed from openQA Project (public) to openQA Infrastructure (public)
  • Category deleted (168)
Actions #3

Updated by nicksinger about 6 years ago

  • Status changed from New to Blocked
  • Assignee set to okurz

Two questions which need to be clarified first:

  1. which should host should do these checks
  2. where should we store/save the metrics

I know that there is already some kind of grafana+influxdb for opensuse projects. But do they also run checks for other hosts? If so, what do they use to collect metrics? Telegraf?

@okurz: since you know many opensuse-people I'd kindly ask you to clarify these two points. I can then help with setting up the rest.

Actions #4

Updated by okurz about 6 years ago

  • Status changed from Blocked to Feedback

nicksinger wrote:

@okurz: since you know many opensuse-people I'd kindly ask you to clarify these two points. I can then help with setting up the rest.

yes, but I do not have a better way then discussing on #opensuse-admin on freenode so I suggest you try that. You can ping me there as well :)

Also, I think you meant to set the ticket to "Blocked" because you are "waiting" for me, right? I do not know of any other ticket reference, "Feedback" therefore :)

Actions #5

Updated by okurz almost 6 years ago

  • Assignee changed from okurz to nicksinger

@nicksinger back to you

Actions #6

Updated by nicksinger almost 5 years ago

  • Assignee deleted (nicksinger)
Actions #7

Updated by okurz almost 5 years ago

  • Status changed from Feedback to New

ok so back to "New" for the tasks to clarify which grafana instance to use for openqa.opensuse.org or where to run an instance.

Actions #8

Updated by okurz over 4 years ago

  • Priority changed from Normal to Low
Actions #9

Updated by okurz over 4 years ago

  • Subject changed from [tools][monitoring] monitor internal port 9526, port 80, external port 443 accessibility of o3 and response times to [monitoring] monitor internal port 9526, port 80, external port 443 accessibility of o3 and response times
  • Target version set to future
Actions #10

Updated by okurz 11 months ago

  • Target version changed from future to Ready
Actions #11

Updated by okurz 10 months ago

  • Subject changed from [monitoring] monitor internal port 9526, port 80, external port 443 accessibility of o3 and response times to [monitoring] monitor internal port 9526, port 80, external port 443 accessibility of o3 and response times size:M
  • Description updated (diff)
  • Status changed from New to Workable
Actions #12

Updated by jbaier_cz 10 months ago

  • Assignee set to jbaier_cz
Actions #13

Updated by jbaier_cz 10 months ago

  • Status changed from Workable to Blocked

Blocked on #156322

Actions #14

Updated by jbaier_cz 10 months ago

  • Blocked by action #156322: zabbix-proxy.dmz-prg2.suse.org not reachable from ariel.suse-dmz.opensuse.org added
Actions #15

Updated by okurz 10 months ago

  • Target version changed from Ready to Tools - Next
Actions #16

Updated by okurz 6 months ago

  • Category set to Feature requests
  • Status changed from Blocked to Workable
  • Assignee deleted (jbaier_cz)
Actions #17

Updated by okurz 14 days ago

  • Assignee set to okurz

will check current state

Actions #18

Updated by okurz 14 days ago ยท Edited

  • Status changed from Workable to Feedback
  • Target version changed from Tools - Next to Ready
  • Parent task set to #162146
Actions #19

Updated by okurz 14 days ago

  • Related to action #174316: [o3][zabbix][alert] warning about depleting storage space but no email? size:S added
Actions #20

Updated by okurz 14 days ago

  • Related to action #174313: [o3][zabbix][alert] / and /var/tmp: "Disk space is low and might be full in 7d (used > 85%)" since 2024-12-11 06:50 size:S added
Actions #21

Updated by okurz 14 days ago

  • Status changed from Feedback to Resolved
Actions

Also available in: Atom PDF