Unlock Google's potential. Use this application to find infomation that is more relevant to your search... This application allows enhanced searching on Google without the need for long modifiers etc..
Desk.Now is a cross-platform Java client for the WhereIsNow WebService which allows you to know where is the latest version of a document, with just two clicks.
Produce alphabetical index for document repository using SWISH-E. Index files are analysed with WordNet to produce a theme list, which is used for searches to find documents. Theme words in documents are automatically hyperlinked to a list of references.
Caused by new releases and/or activities of similiar tools like swish++ and swish-e this project has been closed.
Cheshire3 is a fast Z39.50, SRW, XML search engine, written in Python for extensability and using C libraries for speed. Next generation of the Cheshire system (http://cheshire.berkeley.edu) and designed around a distributable, object oriented model.
Coherence is an advanced Content Management System build on top of Zope. Coherence has site-, user- and filemanagement. Some of the special features are a WYSIWYG page-editor with a drag and drop interface, versioncontrol, workflow and linkmanagement.
A Perl administration interface for ht://Dig, an open source content indexing and searching system. Includes web-based GUI.
GImageSpider is an Image Spider that has two abilities. GIS can search web by image search engines to find images. GIS can act as an image spider that crawls your arbitrary site by your constraints and find images.
An application used to search various web-based genealogy sites simultaneously and review and analyse the data gathered.
HTTP Directory Index consiste en un script PHP que actúa como interfaz gráfica amigable para indexar directorios Web.
HooDoo is designed to provide most of the same functionality of Google, but available to all for their websites
IGLU is a Java class library designed to facilitate sharing of code among Artificial Intelligence/Information Retrieval researchers to illustrate how various problems can be solved in Java. It is developed and maintained by the IGLU Research Group.
Bible study and Christian library management multilingual software.
A web crawler which uses regular expressions on text downloaded from a site.
Written in PHP and designed to maintain a personal database of bookmarks, Linkerdoodle is a simple link organizer.
My Community Portal is a all in one internet portal that offers, forum, groups, chat, your own e-mail, search engine, internet directory, your own home page, poll's, dating services, buddy list, MP3 and file sharing, and many more.
OMax is set of projects including real estate crawler and management system.
A multi-platform information extraction/ontology population library from HTML documents, written in C++
This is a new rebranching of the Metadot Portal Server to the Open Source environment, metadot corp has closed their development of the project so openmetadot is a way to keep development working.
Open-site PHP code.
OpenSiteSearch is the new Open Source version of OCLC's original java-based web application for building Z39.50 portals (i.e. virtual union catalogues). This project is specifically aimed at the library community.
SlinkE is a highly elastic distributed cloud computing environment. All source code is included in all of the products. Our goal in making it open source is to allow others to contribute to the project.
Web Textual eXtraction Tools C++ Parallel web crawler, noun phrase idenification, Multi-lingual Part of Speech Tagging, Tarjan's Algorithm, Co-RelationShip Mappings...
contentix - open source content management system contentix is a cms and a framework to develop any personalized browser based application. It use xml to store data in media nutral way and xsl to generate output. Check the demowebsite from downloads.
eXhaustive is a search software that crawls the Internet to answer a specific query. It has to work during 1hour - 1day and this way gives the user really pertinent results plus an analysis of all the data downloaded (tonality / related words / ... )