A survey of some site link checker tools. Do you know another one? Preferably one that could be used in a CI environment?
- I wrote it.
- Written in PHP.
- Not all that mature.
- Half-baked start at making it crawl the site instead of just sitemap.xml.
- Will eventually output reports in structured file formats, like maybe junit.xml.
- This list exists so I can find out if I need to finish it or not. :-)
- Written in Dart, which I'd never seen before.
- Very fast.
- Declares that
<link rel="delete-form"returning a 403 is a fail. (It is not.) I couldn't find how to change that.
- Found this one:
http://mile23.com/taxonomy/term/42 - (223:260) '5 comments' => http://mile23.com/content/about-site#comments (HTTP 200 but missing anchor)
- It's telling me that this is a 200 but the page that comes back doesn't have
- Perl. I hate to be a language snob, but... Next.
Broken Link Checker (blc)
- Written in node.js.
- It doesn't tell me any errors, so clearly it's the best. (That's a joke.)
- Lets you choose which 'level' of tags to use.
--filter-level The types of tags and attributes that are considered links. 0: clickable links 1: 0 + media, iframes, meta refreshes 2: 1 + stylesheets, scripts, forms 3: 2 + metadata Default: 1
- Can include or exclude internal and/or external links. Only want to check external links? This is your tool.
- Written in python.
- Has GUI + command line.
- Outputs structured reports.
- Installation instructions are wrong for Mac OS: https://github.com/wummel/linkchecker/blob/master/doc/install.txt
- I couldn't find the Mac build, so I'd have to use XCode to build it. Will experiment on a linux container later.