webplay creates from a collection of mp3s and Ogg-Vorbis files (directory or database) a web-based jukebox with support for multiple independent streams. it also maintains control over the stream and can change codec/bit-rate, skip or goto a track, etc.
"Swish-e is a fast, flexible, and free open source system for indexing collections of Web pages or other files" (http://swish-e.org/ ) This module provides a Python API for this software.
ALTSE is an alternative search engine technology. It can index up to a couple million Web pages.
An Apache2 DSO module search engine based on the Swish-e C API returning results by replacing tags in a user supplied html template. Persons with Swish-e knowledge and ability to generate a Swish-e index file should find the searchm interface familiar.
This is a simple command line tool, which will solve the problem of full mailboxes with stuff you don't want to lose. It fetches all the mail from any POP3 mailbox account and generates a searchable HTML archive on your local harddrive. OS: Unix/Linux
BLySP is in development as a mean to research and test new and improved P2P protocols. BLySP will, in a near future, make use of BestLyrics to collect the necessary resources (i.e. computers) to accomplish the difficult task of testing and tuning protoco
BeeSeek is a project to build a free, open-source search engine based on a peer to peer technology. Code and bug reports are available on https://launchpad.net/beeseek-project
Digital Comics ad Picture Viewer with Database Support. GIF, BMP,JPG, PNG, JPEG 2000 Support, Direct3D with Zoom and Pan. Zip, Rar and StuffitX formats, Database Utilities. XML Embedding support, Web Search on Comic Database Pages.
This project will implement DAV Searching & Locating (DASL), an application of HTTP/1.1 forming a lightweight search protocol to transport queries and result sets and allows clients to make use of server-side search facilities.
lease-parser is a simple daemon that records the lease state changes of an ISC DHCP server to a database for historical reference. The data can be searched via a web search form that is provided with the tool.
A distributed highly customizable web search system designed to be able to include custem parsers to add additional searchable metadata from the content of a site as well as from the url of both the site and the referrer.
DocTaur is a Web-based searchable directory of reference manuals. You can freely download, install, and administrate it on your local Linux intranet server. It is powered by the ht://Dig search engine and contains reference manuals for developers.
Estraier is a personal full-text search system for web sites, local file systems, mail boxes, and so on. Estraier has flexible interface and it can handle multilingual documents and various file formats with external plug-ins.
When released, FilmSearch will let you gain a huge amount of time: you'll no more have to scan every day the program of some dozens of TV-channels, just to find once per month something interesting enough to turn on the TV. Each user will be able t
Fleming File Sharing System is a networked file sharing system. It should be much more reliable and user-friendly than FTP or netbios's stuff.
A project to develop specifications and software for a backwards-compatible gnutella protocol for real-time searches for anything on the internet, aka: 'The Universal Search Protocol' to join the family of established internet protocols
PAD stands for Portable Application Description. PAD is an XML-based open format to describe downloadable applications. By using the PAD system, developers save time by having to create a description of their software packages only once.
High Availability Distributed Search Engine
HADSE is a server software for storing indexes in a cluster of server. The goal is to handle high availability by storing indexes on several nodes. HADSE provides RESTFUL APIs to easily populate and request data on your index. Using the powerful cluster APIs you can retrieve the data whatever the node that hosts it. To avoid any single point of failure, it is possible to apply a request to any node of the cluster, there is no master node. HADSE is in active development. A first running version should be available in few weeks.
XPath HTML parser
HXPath is a command line tool useful to extract data from HTML documents. HXPath can select sub trees, like the standard xpath tool, but is also able to read contents and attributes and output them in a bash friendly format. HTML Tidy and HTTP/HTTPS get are built in too.
Harvest is a web indexing package, originally disigned for distributed indexing, it can form a powerful system for indexing both large and small web sites. Also now includes Harvest-NG a highly efficient, modular, perl-based web crawler.
A collection of software to implement search engine technology. The overall search technology is built on the individual components of this project, each component is released under the BSD License, and is written in the language most suited to its task.
Irudiko is a library written in C++ for generating Locality Sensitive Hashing sketches from any textual and web document. Mainly designed to work with HTML pages, it has also an optimization support for English or Italian documents.
J-DAWN project is a Job-Directed Automated Web Navigator. It can retrieve network tasks, and schedule and execute them. Part of its power lies in the ability to define tasks using a graphical programming language based on an underlying XML foundation.
jukebx is a mp3 file indexing system, with MySQL as the backend db