OpenAnonymity consists of a module for apache 2.0 Webserver and a framework that enables you to control search engine spider indexing on a word level, contrary to on file level as in Robots exclusion. OA could force Spiders to follow this rules.
Emine is a python script that parses an email file, separates all the email elements, including words and phrases, and populates a database with file offsets for retrieval from the original file.
Lyfind is a little suite of components for easily searching, modifying and storing song lyrics from a variety of sources (mainly lyrics web sites).
Open-site PHP code.
Easy to use set of shell-scripts to search on tv-websites for programs with your favourite actors, directors etc. Output will be in csv for further use or html for reading and printing.
DNS 'MX' Validator 'MX' Server Validator Forward/Reverse DNS Validator SMTP (ie: 'HELO' or 'EHLO') Banner Validator We intend to provide a website, running a script, that will validate a given domain name's entire SMTP configuration.
"girtools" is an implementation of Grid Information Retrieval (GIR). GIR is an emerging open standard for IR on the grid designed to allow dynamic, secure creation and searching of distributed information systems.
Yet Another Open Search Engine
SemanticDoc is a documentation search engine that provides context specific listing of docbook xml books. Its goal is to provide accurate searches of web documentation that use semantic tags.
Caissfind is expected to be an independent web searching application based on Google API in Java.
This is an web search engine core, this follow links on sites to do 'thinks'. Este é um mecanismo de busca central, que segue links em sites para fazer 'coisas'. By AJSouza at kserv.com.br ( www.kserv.com.br )
Degu is a distributed, linguistic indexing- and search engine based on J2EE.
With phpdefob you can use and compose web objects. While html forms are usually static, phpdefob allows you to provide a simple definition (consisting of elements, subobjects) from which a php class (methods: init, input, check, form, store) is generated
Sprawler is the first Open Source internet search engine software and service - built by the community, for the community. It will address the various reasons most search engines today still are far from being where they need to be.
Open Source Application for databasing your Music Collection(s). iChoons will utilize other open source products such as MySQL, Apache Webserver and PHP as well as Python / wxPython and SQL Lite. We will also be including tools written in Python for Win3
GRM is an Modular Homepage System. Automatical installation and update function on a Webserver with PHP und MYSQL. Written in php. Configuration, installation und Update Assistent.
A collection of software to implement search engine technology. The overall search technology is built on the individual components of this project, each component is released under the BSD License, and is written in the language most suited to its task.
A C++ library for processing Internet Archive ARC, CDX, and DAT files.
An application used to search various web-based genealogy sites simultaneously and review and analyse the data gathered.
This is a simple command line tool, which will solve the problem of full mailboxes with stuff you don't want to lose. It fetches all the mail from any POP3 mailbox account and generates a searchable HTML archive on your local harddrive. OS: Unix/Linux
My Community Portal is a all in one internet portal that offers, forum, groups, chat, your own e-mail, search engine, internet directory, your own home page, poll's, dating services, buddy list, MP3 and file sharing, and many more.
The KB (knowledge base) is a PHP/MySQL solution to having an easily updatable, workflow managed, searchable structure within which anyone with permission, can view entries, create new entries, vote on entries, moderate/approve, and submit files.
"I don't know about other people, but I have a problem with my favorite links. In most cases, I lost them." This project is to create a link repository that will allow for storage of private "favourite links" and sharing of a "public links" storage.
This project aims to create a searchable archive (for several OSes) for the popular webcomic College Roomies From Hell!!!, located at http://www.crfh.net. The final code can hopefully be modified to help other webcomics and similar projects.
A new Web Crawler including sophisticated searching process especialized by language !