Harvest is a distributed search engine framework. It collects data using various methods like HTTP, FTP, News, local files etc., extracts relevant information, creates indexes and make them searchable using a Web interface. All of the collecting, extracti
OpenFTS (Open Source Full Text Search engine) is an advanced PostgreSQL-based search engine that provides online indexing of data and relevance ranking for database searching. Close integration with database allows use of metadata to restrict search re
TinyURL PHP script, which shortens long URL's into a nice small one
Xyzse has implemented the essential functions of general web search engines. It is developed for students or anyone who are interested in search engine. More features will be added in the following releases.
A multi-threaded web spider that finds free porn thumbnail galleries by visiting a list of known TGPs (Thumbnail Gallery Posts). It optionally downloads the located pictures and movies. TGP list is included. Public domain perl script running on Linux.
This is an ***old archive*** of tools developed for facilitating the use of Creative Commons licenses and metadata. --- For the most up to date representation of any of the projects listed here, please see: http://creativecommons.org/project/Developer.
Desk.Now is a cross-platform Java client for the WhereIsNow WebService which allows you to know where is the latest version of a document, with just two clicks.
DocTaur is a Web-based searchable directory of reference manuals. You can freely download, install, and administrate it on your local Linux intranet server. It is powered by the ht://Dig search engine and contains reference manuals for developers.
RIG is a web-based JPEG image album viewer, especially useful for digital camera albums; provides automatic image resizing, preview & thumbnail caching, user authentication; composed of a PHP web interface and a C++ thumbnail engine.
Narrows search result produced by popular Internet search engines, allowing to put extra filtering conditions, as certain words presented, certain words excluded, and so on.
Swishd cluster system is an application that will allow swish-e to scale out to multiple machines.
Easy to use set of shell-scripts to search on tv-websites for programs with your favourite actors, directors etc. Output will be in csv for further use or html for reading and printing.
Group file share with advanced text parsing capability for easy search
Originally created as a church resource sharing system, phpShare&Search allows users to create accounts, share documents, search documents, and like or report documents. phpShare&Search's power comes from its advanced document parser which extracts text from .PDF, .TXT, .DOC, and .DOCX files and its community features of liking resources and reporting them as inappropriate or SPAM. Users also subscribe to weekly updates of new content. User's may choose to download and host/install/configure/modify/manage this code themselves, or contract the code writer to do these functions for them. Contact me for a reasonable quote. eedrew <at> users <dot> sourceforge <dot> net To support future revisions and/or contribute based on the value you found from this code, checkout the External Link drop-down in the menu. Also, if you do not wish to create and maintain your own installation, email eedrew@users.sourceforge.net for a quote on a turn key solution.
Quack is a daemon-mode gnutella server. It allows file indexing so that searches do not depend solely on the filename, as is required by other gnutella servers.
The purpose of this project is to build a searchable database out of a directorystructure of ini files (for album info), id3v1 and v2 tags from MP3s using PHP, MySQL and Apache.
Digital Learning Sciences (DLS) is a mission-centered, not-for-profit organization dedicated to improving learning through the use of digital content and tools.
bee-rain is a web crawler that harvest and index file over the network. You can see result by bee-rain website : http://bee-rain.internetcollaboratif.info/
ASPseek is a full-featured medium-to-large scale SQL-based Internet search engine. It consists of an indexing robot, search daemon and search frontend (CGI program). These programs are written in C++ using the STL library.
A new Web Crawler including sophisticated searching process especialized by language !
Discontinued lightweight Desktop-Files/SMB/FTP crawler and search engine.
DCTViewer is a robust web based solution, sponsored by Document Conversions Technology, http://docconversions.com , for digital document searching and viewing in an intranet enviroment. Features include document storage, indexing, searching and viewing,
Domain name lookup script written in php and aimed at being run on unix/linux based webservers.
Fast File Search is a crawler of FTP servers and SMB shares (Windows shares and UNIX systems running Samba). WWW interface is provided for searching files. FFS is similar to FemFind but optimized for speed.
Frosttie (FROnt-end SchemaTron Text Internet Engine) takes XHTML pages and processes them with various user-definable filters such a W3C's WAI, Section 508 (US) web usability compliance, ad removal, etc. It can be used with zKnowMan.
This software project aims to create an easy to use eBay hidden counter system in php that reports more accurate, more informative, free, and just more data than commercial auction counter tools. Links your auctions to a php script that logs connections.