A Perl administration interface for ht://Dig, an open source content indexing and searching system. Includes web-based GUI.
=DOES NOT WORK ANYMORE AS DSA HAS PUT CAPTCHA= DSA Practical Driving Test Monitor helps you find any available practical driving test slot within specified date range. Runs on Linux/Mac/Windows and automates your manual task of finding the test slot.
An extensible framework for the automated harvesting, indexing and collation of publicly available data from internet resources for example RSS feeds and webservices.
Deathwatch WebCrawler Personal search engine that runs on any Windows machine with the .NET Framework installed.
Dias is a standalone, recursive directory indexing server. It is based on Apache Lucene and supports currently all Postscript-like, HTML-like and Text-like file formats. It is small, multi-threaded and easy to use. Works with FTP,SMB,WebDAV,eMail...
Advanced fully customizable and extensible desktop search application with ability to index and search documents, pictures, music and video files. It has an ability to extract the elements like abstract, title, keywords and literature from documents.
A simple search engine for LANs. Indexes files in shares over FTP and SMB protocols and provides the ability to search for certain files in this index.
FlixFinder: Tivo & Netflix marriage. Automatically find and schedule upcoming movies in cable/satellite listings based on your netflix queue. Now Greasemonkey script. (Original project deprecated since the tv listings are no longer available).
GImageSpider is an Image Spider that has two abilities. GIS can search web by image search engines to find images. GIS can act as an image spider that crawls your arbitrary site by your constraints and find images.
This application is a google desktop search utility that use google soap web api to enable the end users search google directly from there desktop and also enable them to check spellings on the fly.
An application used to search various web-based genealogy sites simultaneously and review and analyse the data gathered.
Glue is a WSMO compliant discovery engine that aims at developing an efficient system for the management of semantically described Web Services and their discovery.
"Gobble" is a GUI based interface for accessing search results from www.Google.com and allowing the user to download files of a selected type. Functionality for multiple advanced functions is included.
HooDoo is designed to provide most of the same functionality of Google, but available to all for their websites
Command line HTML Parser to be used in scripts to extract data from HTML/webpage according to supplied path and options. Usefull for systematic periodic parsing pages with known structures where information keeps changing - like looking for item on ebay
HttpFinder is web content searching tool. It enables look for text content that matches given regular expression in html pages/scripts etc. All navigation is performed with use of other regexp which describes links to visit.
IGLU is a Java class library designed to facilitate sharing of code among Artificial Intelligence/Information Retrieval researchers to illustrate how various problems can be solved in Java. It is developed and maintained by the IGLU Research Group.
Infomation extraction and indexing modal with Association Rules
Peer To Peer software with an high integration with the Windows operating system, that allow you to execute full text search. The software is based on a peer to peer innovative network technology named DANTE (Digital Autopoietic Network Tree Environment),
Bible study and Christian library management multilingual software.
J-DAWN project is a Job-Directed Automated Web Navigator. It can retrieve network tasks, and schedule and execute them. Part of its power lies in the ability to define tasks using a graphical programming language based on an underlying XML foundation.
A web crawler which uses regular expressions on text downloaded from a site.
Krakatoa is search engine for your desktop with simple and advanced search capabilities. It will search on any key word, exact phrases or files. Search within a domain or site. Fast search engine switching for better results.
The LEADERS toolkit is a generic toolset that enables the creation of an online environment which integrates EAD finding aids and EAC authority records with TEI transcripts and digitised images of archival material suitable to a wide variety of archives.