Anarchivist is the name of the rewrite of the AustLII software (www.austlii.edu.au). The project seeks to produce a full-text indexing search engine (for remote and local documents) and an XML/XSLT based document repository, among others.
The target of this project is to develop a protocol and a server building on top of TCP/IP. I want to manage bookmarks over the network. The protocol will be based on XBel, an XML bookmark exchange language.
Caused by new releases and/or activities of similiar tools like swish++ and swish-e this project has been closed.
AVD is a continuation of the swim project. The goal is to create a suitable SQL server from swim's not-installed DB, and to maintain the swim client. AVD will be used as a gBootRoot method.
BTR Wizard quickly replaces multiple occurances of text over multiple files. This unique program scans folders for files matching filter critera then searches those files for any occurances of a text string and replaces them all. This is an ideal tool fo
BullFrog is a search engine ranking program, written as a Mozilla Firefox extension. Simply enter one or more URLs and their corresponding keywords or key phrases, and BullFrog will see what position the URLs appear in Google.
arachne is a C++ library for HTTP crawling, link, text and metadata extraction designed to run in a distributed environment.
Cortez, for create new news service model for RSS and blogging. Cortez will just offer the environment to create post, read news thru RSS(ATOM) and syndicate within the multiple blogs.
FWebSpider is a web crawler application written on Perl. It performs chosen site crawl, featuring response cache, URL storage, URL exclusion rules and more. It is developed to function as a local/global site search engine core.
This is a browser/navigator for the FedoraForum,which is based on wxPython. (http://forum.fedoraforum.org/)
The Free Knowledge Project is a project aimed to build a fully qualified platform for active Knowledge Exchange.
The FreeMoz project is working on creating a MySQL-based directory software similar to that used by the Open Directory Project. The project aims to eventually implement all the features of the ODP, and in most cases to exceed them.
Funnel is a project for use on intranets, or selected sites on the Internet to gather together and index information from several different sources and make it available through a sane, usable interface.
An application used to search various web-based genealogy sites simultaneously and review and analyse the data gathered.
High Availability Distributed Search Engine
HADSE is a server software for storing indexes in a cluster of server. The goal is to handle high availability by storing indexes on several nodes. HADSE provides RESTFUL APIs to easily populate and request data on your index. Using the powerful cluster APIs you can retrieve the data whatever the node that hosts it. To avoid any single point of failure, it is possible to apply a request to any node of the cluster, there is no master node. HADSE is in active development. A first running version should be available in few weeks.
Looking for members that want to participate in the development of a PHP/MySQL application with the single purpose of making possible the deployment of a website containing an Internet Service Providers list ranked by several things.
Indir is a network application designed for server scanning. It can search files with convenentional names (used by programmers) for data that may be dangerous. The database presently contains over 2000 records and is constantly growing.
A collection of software to implement search engine technology. The overall search technology is built on the individual components of this project, each component is released under the BSD License, and is written in the language most suited to its task.
Bible study and Christian library management multilingual software.
A web crawler which uses regular expressions on text downloaded from a site.
The Jobcrawler search engine is a research project in order to index the available applications on the internet. Our mission is to really help people who seek a job or employee on a one to one basis and rule mediators (job agencies) out.
jukebx is a mp3 file indexing system, with MySQL as the backend db
Kassandra is an SQL-based Latent Semantic Indexing and search engine written mostly in PHP. Supported formats will be at least HTML, Postscript and PDF.
LANbyrinth is a bot that indexes a LAN and organizes its files. It is initially focused on MP3 files indexing. Features: Fully configurable Fast and smart searching Recognizes duplicated files Organizes songs by artist/album etc.