A distributed web-crawling system written in PHP featuring a web-based control interface.
NetFAQ is a program for manage a F.A.Q. (Frequently Asked Questions). NetFAQ is at the present time in development status. It's written in PHP/MySQL.
PHP Wrapper Class For ht://Dig is a class I developed while desperately searching for something with similar capabilites. This class is intended to be much more thorough allowing for easily changing headers, footers, and templates. htdig + PHP = htPHP
A PHP extension to Swish-e
This project is for a webpage content monitor called PagePing. It is designed to actively keep track of website content (especially message boards) by repeatedly downloading and diff'ing against the last cached copy.
The "Netiquette abolishment project" ! Replace content RATING by content POSITIONNING. This project's main goal is to create an on line 'real place': It will work like a 3D visualisation software: you select your interests by geting close to them,
QuickReviewer is a mobile (J2ME) client which allows users to query various databases (Amazon) about books, films and music and receive reviews in a central place and in a convenient display.
An RDF-based post content information system with associated APIs, used to provide intelligent information about the status and accessibility of a web document. Functionality: page re-directs, intelligent 404 handling, Threadneedle connectivity and other
It's a robot exclusion file generator that provides an easy way to generate robots.txt
ResumeGazette is a client that obtains email addresses from searches on popular job search sites, then at the direction of the user, sends emails to all of the addresses found along with an attached resume.
New SearchEngine Very young and raring to go. Got some useful features..
SINP uses SlackBuild scripts found at slackbuilds.org and the native Slackware package manager to create "source" packages for Slackware.
SYRAH si propone di far emergere e rappresentare i concetti espressi per mezzo di un linguaggio naturale. SYRAH aims to discover and represent concepts expressed in natural languages. NLP, lemma, lemmario, italiano, rete, semantica, clustering, semantic
(Project is participated in the Zend PHP5 Contest. Project information will be released after the event, Oct 11, 2004)
The Somewhat Intelligent Proxy [SIP] is an effort at an open-source, natural language, web accessible instrument which utilizes Internet sources to return answers to your questions.
Sprawler is the first Open Source internet search engine software and service - built by the community, for the community. It will address the various reasons most search engines today still are far from being where they need to be.
A News Aggregator - not a news reader - to collect news from subscribed RSS channels.
Switchboard is a conceptual-level interface to many web and network related functions (SOAP, REST, XML parsing, screen-scraping, FTP, network sniffing), designed for the Processing environment.
Syndicateme.net ... Ajax Atom 1.0 Syndication Engine Tell your story ... Especially if you are a business along Queen St. in Toronto Canada or King Street Waterloo Canada. Syndication can be from a pop mailbox, and can use XInclude.
Configurable wiki parser and XML based engine for easy generation of documentation.
An alternative to the Ebay API using old school HTML parsing and libcurl.
UTYP is a visual search service for pictures and an alternative challenge-response test to ensure that the response is not generated by a computer. Its framework is based on outsourcing visual recognition of images and picture to human playing games.
Values-based Document Analysis: I want to take some rudimentary Document Analysis work that I have done and make it more sophisticated and to use it to analyze (at least) all of the docuemnts of the web for (human) values priorities. The project woul
A limited-scope fork and extension of the popular 'cvsweb' work originated by Bill Fenner and heavily extended by Henner Zeller. Hoped-for extensions include getting this to also work with Subversion, and adding repository manage functionality.