Project

General

Profile

Actions

tickets #51335

closed

Pontifex access logs no longer compressed nor complete

Added by jberry about 5 years ago. Updated about 4 years ago.

Status:
Closed
Priority:
Urgent
Assignee:
Category:
Mirrors
Target version:
-
Start date:
Due date:
% Done:

100%

Estimated time:

Description

Since around 23/24 of April the log files are no longer xz compressed and the
sizes are smaller than compressed versions of previous days. This causes the
metrics [1] to be incomplete since then.

Was a change made to the logging setup and if so can it be reverted or fixed?

[1] https://metrics.opensuse.org/d/osrt_access/osrt-access?
orgId=1&from=now-30d&to=now&var-frequency=day

--
Jimmy

Actions #1

Updated by jberry about 5 years ago

It appears the issue is caused by a full disk.

/dev/vdb                         200G  200G   20K 100% /var/log
Actions #2

Updated by lnussel about 5 years ago

  • Priority changed from Normal to Urgent
Actions #3

Updated by pjessen about 5 years ago

  • Category set to Mirrors
  • Private changed from Yes to No

I've cleared out some old apache logs (2017), that gives us some room for now.

Actions #4

Updated by pjessen about 5 years ago

  • Status changed from New to In Progress

Question - how are pontifex logs moved/copied to langley? We have to coordinate the purging of old logs with the archiving on langley.

Actions #5

Updated by jberry almost 5 years ago

The logs have not been moved to langley since the network split for lack of infra providing a mechanism by which to send the data across network or a suitable place to back them up.

Actions #6

Updated by pjessen almost 5 years ago

jberry wrote:

The logs have not been moved to langley since the network split for lack of infra providing a mechanism by which to send the data across network
or a suitable place to back them up.

Thanks. We don't have the room to archive logs on pontifex, so old stuff has to go. The 2018/2019 apache logs for 'download.o.o' take up about 2/3 of the 200Gb we currently have - I've cleared out the 2017 logs, but I expect I'll have to start on 2018 before long, we're 89% full. I guess we could expand the filesystem, if needed, but that'll only prolong the situation.

Actions #7

Updated by cboltz almost 5 years ago

Unsurprisingly, /var/log/ on pontifex was full again (since two days, if I get the monitoring right).

I just deleted /var/log/nginx/downloadcontent/2017/ which had about 20 GB of logs - but once more, that's only a temporary solution. We really need to decide what to do with all the old logs!

Actions #8

Updated by pjessen almost 5 years ago

cboltz wrote:

Unsurprisingly, /var/log/ on pontifex was full again (since two days, if I get the monitoring right).

Might this have to do with the database being unavailable, since Friday morning at least.
A normal month is about 10Gb, but for June 2019 sofar, we already have 14Gb.

I just deleted /var/log/nginx/downloadcontent/2017/ which had about 20 GB of logs - but once more,
that's only a temporary solution. We really need to decide what to do with all the old logs!

I think whoever has a need for them might want to provide some space of us to offload them?

Actions #9

Updated by pjessen almost 5 years ago

Some of the logs in /var/log/apache2/2019/06 were not compressed, presumably because of the filesystem being full.

Actions #10

Updated by pjessen almost 5 years ago

We are currently at 88% full, leaving about 26Gb available. Considering that we need about 5Gb working space when compressing, that is room for less than 2 months, i.e. for June and July, maybe a bit more. We could use /tmp for working space though. Maybe I'll check that out.

145G download.opensuse.org/
19G ipv6.download.opensuse.org/

Of the above, error_logs take up 4.2Gb.
Of the above, logs older than 1 year take up 30Gb.

If we were to keep logs for 1 year only, we should have a working set of 150Gb, that doesn't seem unreasonable.

Actions #11

Updated by pjessen almost 5 years ago

The apache logs are growing rapidly, I expect due to the database problem (mirrordb1 being full). This afternoon we were at 86% full, now we have hit 92%.
I have cleared out apache2 logs more than 1 year old, now 77% full. That should keep us going until tomorrow morning.

Actions #12

Updated by jberry almost 5 years ago

I think whoever has a need for them might want to provide some space of us to offload them?

This is/was langley.suse.de, but since the network split no solution for migrating data across networks has been made available. As such, clearing the logs older than some date/size to keep a rotation seems like the best solution.

Actions #13

Updated by lrupp about 4 years ago

  • Status changed from In Progress to Closed
  • Assignee set to lrupp
  • % Done changed from 0 to 100

FYI: I manually moved 2019 logs to langley.suse.de - if someone needs them...

Actions

Also available in: Atom PDF