A PHP search engine for your website and web analytics tool. GNU GPL3
ahCrawler is a set to implement your own search on your website and an analyzer for your web content. It can be used on a shared hosting.
It consists of
* crawler (spider) and indexer
* search for your website(s)
* search statistics
* website analyzer (http header, short titles and keywords, linkchecker, ...)
You need to install it on your own server. So all crawled data stay in your environment.
You never know when an external webspider updated your content. Trigger a rescan whenever you want - you always have under control what data of what time were checked.
...
centralized syslog-ng monitoring frontend writen in php
Webinterface to monitor many Syslog-ng - Linux Hosts on a central logserver. Powered by SphinxSE for ultrafast Fulltext-Search Queries.
Testet with huge amount of entries (over 80 000 000) with incredible good performance.
Easy to setup
A system to perform analysis of large documents for the purpose of cataloging similar documents. Similarity is based upon contextual analysis of these documents done by identifying common words and proper nouns.
WEB-PA is a spider indexing a set of web sites for collecting statistics. It is currently being run on sites of Italian Public Administrations and studies/reports on their WWW standards compliance, but it can serve for a number of purposes.
phpCMS is a highly flexible flat file, no SQL, Web CMS with complete content/logic separation, featuring e.g.: powerful menu and template system, plug-in capability, scripting (even non-PHP), search engine, statistics, e-mail address cloaking, fast cache