Indir is a network application designed for server scanning. It can search files with convenentional names (used by programmers) for data that may be dangerous. The database presently contains over 2000 records and is constantly growing.
A collection of software to implement search engine technology. The overall search technology is built on the individual components of this project, each component is released under the BSD License, and is written in the language most suited to its task.
High-performance software for information retrieval research. Emphasis on semi-structured text retrieval, especially for HTML and XML. The goal is to facilitate information retrieval research by providing an interchangable toolkit of functions.
A web content management system with special emphasis on multimedia content. Designed as part of the TITAN grant at Manhattan College. Special thanks to Mike Mucciardi (Project team leader), Matt Joyce (Me), Vlad Panov (Design Layout), and Ananda Das (
JoBo is a web site mirroring tool. It has a graphical UI but there is a also command line version. Supports robot exclusion protocol (but this can be disabled)
The Jobcrawler search engine is a research project in order to index the available applications on the internet. Our mission is to really help people who seek a job or employee on a one to one basis and rule mediators (job agencies) out.
Kassandra is an SQL-based Latent Semantic Indexing and search engine written mostly in PHP. Supported formats will be at least HTML, Postscript and PDF.
Spider that recollects data from MySpace Social Network. At now, it is only designed to extract information from native american people because it is used for a social science study in the UNAM (Universidad Nacional Autónoma de México).
Lazysearch is a quick & dirty search proxy script designed for use with Mac OS X.
Written in PHP and designed to maintain a personal database of bookmarks, Linkerdoodle is a simple link organizer.
Lucene has moved to Jakarta. Please visit http://lucene.apache.org/
MWIP (Mean What I Play) is a clone of PWIM written in Lua. It is faster than the original PWIM (which was in Python), and also contains extra features and better documentation. It is meant to be a complete replacement to PWIM.
Mac GoogleSeach is an OpenSource effort to implement the Google SOAP APIs on Mac OS X.
"MLBibFile" is a tool for Managing Library Bibliography Files, based on PHP and MySQL, to provides a web interface for organizing bibliographic lists that can be text-files, OpenOffice, msOffice, HTML or PDF. It will support English and Hebrew.
The Medlane project is an attempt to create a set of tools that will enable librarians to move from the standard MARC (MAchine Readable Cataloging) format to a new library/museum XML format. This move will ensure traditional library/museum data remains
Site visitor support for any web site. Give a more up to date feel, support returning visitors re-finding what they looked at before. Instantly add news page, upload page, personal history, featured articles, email to friends, etc.
This is a search engine that really works!Written in AJAX, PHP and MySQL,easy installation(only edit the database settings,ex. localhost). Easy to Search Your Requirement Just like, Documents, images, Video, Music and Much More
Convertor of MusicMoz data from XML format into SQL database. Directory and content management system ( CMS ) for MusicMoz data, written on Perl with SQL backend. Integration with end-user music-related content.
This was a terrible idea and is equally terribly implemented.
Not A Blog is a collection of modules based on a common user authentication and sessions module that can separate form from content without sacrificing design control over the content. This is achieved by having a virtual site hierarchy in a MySQL databas
Nucular Archiving System for creating full text indices for fielded data. Python API, web, and command line interfaces. Fast. Very light weight. Concurrent read/writes with no possible locking issues. No server process. Proximity. Facets. Funny name.
OMax is set of projects including real estate crawler and management system.
Omseek has been renamed to Xapian. Xapian is a Search Engine Library, written in C++ with bindings for Perl, Python, PHP, Java, Tcl, C# and Ruby. It allows you to easily add advanced indexing and search facilities to your applications. See xapian.org
Open-site PHP code.
Perl-based web application designed to allow web sites to trivially add a dynamic, free, ever-evolving photo-gallery application to their sites.