Search engine and data mining applications and ClueWeb datasets.
The Lemur Project develops search engines, browser toolbars, text analysis tools, and data resources that support research and development of information retrieval and text mining software, including the Indri search engine in C++, the Galago search engine research framework in Java, the RankLib learning to rank library, ClueWeb09 and ClueWeb12 datasets and the Sifaka data mining application.
OpenSiteSearch is the new Open Source version of OCLC's original java-based web application for building Z39.50 portals (i.e. virtual union catalogues). This project is specifically aimed at the library community.
It will show of the files that you have uploaded though it or by ftp to a certain directory. THIS SCRIPT MAY NOT BE LISTED ON ANY OTHER WEBSITE EXCEPT FOR CREAMERSREALM, SOURCEFORGE, HOTSCRIPTS. IF IT IS IT MUST BE REMOVED OR LEGAL ACTION WILL BE TAKEN.
Cheshire3 is a fast Z39.50, SRW, XML search engine, written in Python for extensability and using C libraries for speed. Next generation of the Cheshire system (http://cheshire.berkeley.edu) and designed around a distributable, object oriented model.
Cicerone is a multi-platform, multi-server, multi-database, web-based corporate information system like no other. Completely web-driven and accessible through any 4.x web browser, Cicerone allows your company to create and maintain information on the fly
HTTP Directory Index consiste en un script PHP que actúa como interfaz gráfica amigable para indexar directorios Web.
IGLU is a Java class library designed to facilitate sharing of code among Artificial Intelligence/Information Retrieval researchers to illustrate how various problems can be solved in Java. It is developed and maintained by the IGLU Research Group.
Written in PHP and designed to maintain a personal database of bookmarks, Linkerdoodle is a simple link organizer.
Pansophica is an intelligent web search agent that presents results in a dynamic and interactive virtual reality. Twist, fly and play the net.
A hypertext-browser written in Java which filters links (emails, docs or pics for e.g.) out of .html-documents and paints them on screen in hierarchical order. Users get a quick overview of how a website is put together.