tickets #51335
closedPontifex access logs no longer compressed nor complete
100%
Description
Since around 23/24 of April the log files are no longer xz compressed and the
sizes are smaller than compressed versions of previous days. This causes the
metrics [1] to be incomplete since then.
Was a change made to the logging setup and if so can it be reverted or fixed?
[1] https://metrics.opensuse.org/d/osrt_access/osrt-access?
orgId=1&from=now-30d&to=now&var-frequency=day
--
Jimmy
Updated by jberry about 5 years ago
It appears the issue is caused by a full disk.
/dev/vdb 200G 200G 20K 100% /var/log
Updated by pjessen about 5 years ago
- Category set to Mirrors
- Private changed from Yes to No
I've cleared out some old apache logs (2017), that gives us some room for now.
Updated by pjessen about 5 years ago
- Status changed from New to In Progress
Question - how are pontifex logs moved/copied to langley? We have to coordinate the purging of old logs with the archiving on langley.
Updated by jberry almost 5 years ago
The logs have not been moved to langley since the network split for lack of infra providing a mechanism by which to send the data across network or a suitable place to back them up.
Updated by pjessen almost 5 years ago
jberry wrote:
The logs have not been moved to langley since the network split for lack of infra providing a mechanism by which to send the data across network
or a suitable place to back them up.
Thanks. We don't have the room to archive logs on pontifex, so old stuff has to go. The 2018/2019 apache logs for 'download.o.o' take up about 2/3 of the 200Gb we currently have - I've cleared out the 2017 logs, but I expect I'll have to start on 2018 before long, we're 89% full. I guess we could expand the filesystem, if needed, but that'll only prolong the situation.
Updated by cboltz almost 5 years ago
Unsurprisingly, /var/log/ on pontifex was full again (since two days, if I get the monitoring right).
I just deleted /var/log/nginx/downloadcontent/2017/ which had about 20 GB of logs - but once more, that's only a temporary solution. We really need to decide what to do with all the old logs!
Updated by pjessen almost 5 years ago
cboltz wrote:
Unsurprisingly, /var/log/ on pontifex was full again (since two days, if I get the monitoring right).
Might this have to do with the database being unavailable, since Friday morning at least.
A normal month is about 10Gb, but for June 2019 sofar, we already have 14Gb.
I just deleted /var/log/nginx/downloadcontent/2017/ which had about 20 GB of logs - but once more,
that's only a temporary solution. We really need to decide what to do with all the old logs!
I think whoever has a need for them might want to provide some space of us to offload them?
Updated by pjessen almost 5 years ago
Some of the logs in /var/log/apache2/2019/06 were not compressed, presumably because of the filesystem being full.
Updated by pjessen almost 5 years ago
We are currently at 88% full, leaving about 26Gb available. Considering that we need about 5Gb working space when compressing, that is room for less than 2 months, i.e. for June and July, maybe a bit more. We could use /tmp for working space though. Maybe I'll check that out.
145G download.opensuse.org/
19G ipv6.download.opensuse.org/
Of the above, error_logs take up 4.2Gb.
Of the above, logs older than 1 year take up 30Gb.
If we were to keep logs for 1 year only, we should have a working set of 150Gb, that doesn't seem unreasonable.
Updated by pjessen almost 5 years ago
The apache logs are growing rapidly, I expect due to the database problem (mirrordb1 being full). This afternoon we were at 86% full, now we have hit 92%.
I have cleared out apache2 logs more than 1 year old, now 77% full. That should keep us going until tomorrow morning.
Updated by jberry almost 5 years ago
I think whoever has a need for them might want to provide some space of us to offload them?
This is/was langley.suse.de, but since the network split no solution for migrating data across networks has been made available. As such, clearing the logs older than some date/size to keep a rotation seems like the best solution.
Updated by lrupp about 4 years ago
- Status changed from In Progress to Closed
- Assignee set to lrupp
- % Done changed from 0 to 100
FYI: I manually moved 2019 logs to langley.suse.de - if someone needs them...