Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
Andy's PHP Knowledgebase using MySQL is a database driven Web Application for storing, searching and updating article content for a knowledgebase. Andy's PHP Knowledgebase is easily customized and has potential for a variety of creative uses.
PHP Search is a search engine script that searches a MySQL database for links and descriptions much like google. Manual adding of Data. Crawler Coming soon! Demo at http://www.jhosting.tk/admin/search/search.php
webspider provides a mechanism to get contents from web. With the extended classes, you can do the following things: 1. grab urls from a specified base url 2. analyze the contents of a list of urls 3. get specific files from web 4. blablabla
FTPdb is a PHP-based tool that makes searching and keeping inventory of an FTP server as easy as possible. It is completely modular, so adapting it to your needs is a snap.
A VB Web crawler that is currently under construction with the goal to be able to crawl and index the net most likely by distributed computing (via network).
A fat client price checking tool. Similar in spirit to pricerunner and others except it checks prices at the source on demand. Supposed to save entering the same search criteria on multiple sites and then tabbing through to do a comparison.
Python app used to download (torrent) files from various RSS feeds. Designed for use with Transmission client...
OpenAnonymity consists of a module for apache 2.0 Webserver and a framework that enables you to control search engine spider indexing on a word level, contrary to on file level as in Robots exclusion. OA could force Spiders to follow this rules.
Our aim is to enable Web applications to consume linked data from the Web. With SQUIN (Semantic Web Query Interface) we will provide a Web data query service as an addition to the LAMP technology stack. This service executes queries over the whole We
Project consist of 2 parts. One of them is a J2ME app. used to get information such as photo, position, speed & course from GPS and transfers it to the web server. Another one is a web app. which allows to manage and display received data using GoogleMap
This project aims to create a searchable archive (for several OSes) for the popular webcomic College Roomies From Hell!!!, located at http://www.crfh.net. The final code can hopefully be modified to help other webcomics and similar projects.
An extensible framework and user interface for combining various structured search and document clustering techniques.
JavaMatch is an engine that can search inside a runtime Java data structures, and look for objects that best match the criteria that you specify. The extensive query mechanism allows for highly customizable tuning of your match queries.
A system to perform analysis of large documents for the purpose of cataloging similar documents. Similarity is based upon contextual analysis of these documents done by identifying common words and proper nouns.
PHP-XML is a class written in php to create, edit, modify and read XML documents.
SPFM aims to create a simple and elegant user-based environment to organize and index files on servers running Apache.
with Zip2Map, one can find the geo map of any zip code(now U.S. only). finding the zip code, returns the Map of the location with its name and state name. Google Maps api has been used with PHP-MySql and lots of Ajax to make it a real WEB 2.0 Application
Web Crawler & indexer project, for university
"girtools" is an implementation of Grid Information Retrieval (GIR). GIR is an emerging open standard for IR on the grid designed to allow dynamic, secure creation and searching of distributed information systems.
A drop-in framework for adding tagging (folksonomy) capabilities to existing applications
QZARCH - Quick free-text search The project aims to deliver a light-weight file-based free-text search engine for Java based websites to adopt easily. The features include: - Search for one or more keywords in the content of one or more files -
ASPLinks is a free, open-source and light-weight framework for producing online links directories written in ASP and ASP.NET for use on mySQL and SQL Server.
This is an another open-source search engine, which can be use for educational fields.