Project

General

Profile

Actions

tickets #16070

closed

planet: check for inactive URLs and remove

Added by tampakrap over 7 years ago. Updated over 4 years ago.

Status:
Closed
Priority:
Normal
Assignee:
-
Category:
Planet
Target version:
-
Start date:
2017-01-18
Due date:
% Done:

100%

Estimated time:

Description

It would be cool to have the following:

  • a check for URLs that don't return 200
  • log the URL, exit status and date it started
  • if it happens for a week, automatically send a mail (or file a ticket with the user in CC)
  • if no reply for a week, remove the blog automatically

Files

check_feeds.sh (838 Bytes) check_feeds.sh check_feeds.sh cboltz, 2017-03-18 18:03
broken-feeds-2017-03-18 (1.71 KB) broken-feeds-2017-03-18 broken feeds as of 2017-03-18 cboltz, 2017-03-18 18:38
planet_status.txt (1.52 KB) planet_status.txt orion_0, 2017-05-07 16:02
Actions #1

Updated by tampakrap over 7 years ago

  • Private changed from Yes to No
Actions #2

Updated by tampakrap over 7 years ago

the first check could happen by just parsing the planet software logs

Updated by cboltz about 7 years ago

I don't have direct access to the planet.o.o server, so I wrote a little script to do the check. It isn't as fancy as in your dreams ;-) und needs a few minutes to download all feeds, but it should get the job done.

The script and today's report (36 broken feeds) are attached.

Someone should re-check the broken feeds in a few days ("check_feeds.sh broken-feeds-2017-03-18") to filter out temporary outages and then contact the affected users.

Actions #4

Updated by orion_0 almost 7 years ago

This is the update of the last tracking i did to the planet sites. Nothing has changed since the last time we spoke

Actions #5

Updated by cboltz almost 7 years ago

There were some changes since then:

  • linux301 wordpress com was taken over by a spammer, so I removed it
  • blog.karlitschek.de moved, Frank sent a pull request with the new location

In some cases, it helps to shorten the link to the domain - most domains are still reachable, and maybe you can find the feed this way. See https://github.com/openSUSE/planet.opensuse.org/pull/54 for an example ;-)

For everything else - please give it a final test, and if it's still broken, send a pull request to remove the broken feeds. It doesn't make sense to keep them if they are broken.

At the same time, I'd recommend to send a mail to opensuse-project with the to-be-removed feeds - maybe some of the people notice that their old blog is mentioned, and send us an updated feed URL ;-)

Actions #6

Updated by lrupp over 4 years ago

  • Status changed from New to Closed
  • % Done changed from 0 to 100

Looks like nobody worked on this for >3 years.

I close this here in the hope that something happened in between. If not, please open issues in github
( https://github.com/openSUSE/planet.opensuse.org ), so this can be tracked by the ones who maintain the repo/service.

Lars

Actions #7

Updated by cboltz over 4 years ago

Well, yes and no ;-)

I still run my check script from time to time, try to fix broken feeds in obvious cases, mail their owners etc.

However, I don't do additional paperwork like updating this ticket ;-) therefore - thanks for finally closing it!

Actions

Also available in: Atom PDF