Auto Proxy Filter Test (APFT) automates the testing of safe and unsafe URLs against a content filtering proxy (such as Dansguardian) and helps prevent regressions. APFT is useful to people who are designing filter rules.
Toke is a webmining toolkit for web exploring, indexing and searching for Java. Toke allows to you crawl public or private web sites, in order to create web estatistics, web Pajek graphs, Lucene indexs and word frequency files for data clustering.
Sperowider Website Archiving Suite is a set of Java applications, the primary purpose of which is to spider dynamic websites, and to create static distributable archives with a full text search index usable by an associated Java applet.
InSite is a Web site management tool written in perl. It checks link integrity and does some basic content monitoring of your site's files directly on the local disk, which gives it a huge speed advantage over similar tools.
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
Automatic link management program. Has three functions: List links in database in html format, add links to database using browser and optionaly check for bad links (by cron job). This eliminates the need for the "Report bad link" on too many web sites
Like social boomarking, allows users to share their bookmarks online. Like wiki, anyone can freely edit links. Export / Import boomarks with your browser. Many other features: RSS and Atom feeds, URL check, popular categories, XLIink, ...
The main function of this script is to shorten long website-URL's -- converting long URL's into easy-to-remember, short ones. [htaccess, MOD_REWRITE, XHTML 1.0 strict, CSS 1, JS 1.2, PHP 5X, MySQL 4X]
Programmable web client utilising HttpUnit with input & output files in XML. Eccles includes the ability to create a GUI to control/monitor the processing, and can be used for website testing as well as automating web transactions.
HyperSpider (Java app) collects the link structure of a website. Data import/export from/to database and CSV-files. Export to Graphviz DOT, Resource Description Framework (RDF/DC), XML Topic Maps (XTM), Prolog, HTML. Visualization as hierarchy and map.
W3mon is a tool that can be used to monitor a set of web sites.
By continually polling such web sites (say, once per hour), the
respective webmasters can be notified by e-mail whenever an outage
occurs.
Protect your bandwith and increase traffic to your vBulliten web site.
This project is to keep the location of files a secret that are on FTP of a vBulliten forums website. Avoid using the vB attachments database and hide file locations.
LCat is a php/mysql cataloging system for hyperlinks, intended for use on websites. It is build around a set of objects providing the basic functionality and complemented with a set of scripts and templates to perform communication with the user.
aWebVisit analyses web logfiles for visit information like: entry/transit/exit points, internal links followed, length of visit and time spent.
aWebVisit-Map then allows you to examine the links followed by your visitors to and from each web page...
This Linklist uses a MySQL backend. It can handle mutiple Linklists in only 3 tables. Every Linklist has also Categories and Moderator functions. The Admin page has rich amount of features including easy un-/install. The design is clear but cool.
JoBo is a web site mirroring tool. It has a graphical UI but there is a also command line version. Supports robot exclusion protocol (but this can be disabled)
TightURL takes a very long URL as input, and returns a very short URL.
Example (long URL)
http://service.domain.net/de/cgi/g.fcgi/startpage?site=greetings&CUS-GHTOO=43
Exampe (short URL)
http://www.yourdomain.com/174733788
phpSitemapNG is a free Google Sitemaps generator written in PHP, but also generates RSS-based, txt-based and HTML-based sitemap files. It will spider your website and can also index the filesystem.
Monitoring tool with support to Websites, RSS, Webservices and Databases. Has notifications by email and RSS and you can access metrics like availability, latency and load time by a web-based GUI. Runs standalone with an embedded HTTP server.