HAE is a php-based file system explorer. It provides a user-friendly interface to browse the content of a HTTP server, close to desktop environments.
A SOAP-based Document/File-Sharing solution written in Java. It includes a basic web-interface but other clients are possible. You can share and download all common office document formats like MS Word, Excel, OpenOffice and PDF.
Harvest is a distributed search engine framework. It collects data using various methods like HTTP, FTP, News, local files etc., extracts relevant information, creates indexes and make them searchable using a Web interface. All of the collecting, extracti
This software project aims to create an easy to use eBay hidden counter system in php that reports more accurate, more informative, free, and just more data than commercial auction counter tools. Links your auctions to a php script that logs connections.
Web mining, crawling and indexation. It may do a predefined set of tasks and save you a lot of time (automation) or implement learning capability and decision making (artificial intelligence). CLI PHP daemon, based on TYPO3 framework.
HostingFeed its a small script to know the size and amount of files of a specific folder and tracking throught standard RSS feed readers
Command line HTML Parser to be used in scripts to extract data from HTML/webpage according to supplied path and options. Usefull for systematic periodic parsing pages with known structures where information keeps changing - like looking for item on ebay
HttpFinder is web content searching tool. It enables look for text content that matches given regular expression in html pages/scripts etc. All navigation is performed with use of other regexp which describes links to visit.
TagHybrida is a French hybrid syntactic parser. TagHybrida is a four stage parser combining hand-writen and corpus based information.
The application will be able to provide further information about the location of a host by analyzing the senders IP address. It works like other localizer software and provides different types of visualisation (map, text).
Simple create link index script, easy create category and easy add google adsense can help you make money.Admin function can delete and edit url post
No-hassle file indexing for the web. No database. No external files. One PHP file does it all! a) Web-based filesystem indexing b) Flexible user-by-user permissions c) Attractive interface d) MP3 Streaming Lots more!
A collection of software to implement search engine technology. The overall search technology is built on the individual components of this project, each component is released under the BSD License, and is written in the language most suited to its task.
Data migration/conversion library based on STX and XSLT transformation
Infofuze is a Java library and server application that can be used to transform and combine data from various sources into a specific XML or other text output format that can be stored or indexed.
The Infomap NLP software performs automatic indexing of words and documents from free-text corpora, using a variant of LSA to enable information retrieval and other applications. It was developed by the Infomap Project at Stanford University's CSLI.
High-performance software for information retrieval research. Emphasis on semi-structured text retrieval, especially for HTML and XML. The goal is to facilitate information retrieval research by providing an interchangable toolkit of functions.
Irudiko is a library written in C++ for generating Locality Sensitive Hashing sketches from any textual and web document. Mainly designed to work with HTML pages, it has also an optimization support for English or Italian documents.
A web content management system with special emphasis on multimedia content. Designed as part of the TITAN grant at Manhattan College. Special thanks to Mike Mucciardi (Project team leader), Matt Joyce (Me), Vlad Panov (Design Layout), and Ananda Das (
Job publish and search engine based on Java2EE, Hibernate, PostgreSQL and Jersey with Web interface based on JQuery
Jake is a console based app written in python and qt4. Plugins will let you do almost anything, for example, search in google, translate, view images, talk with it (aka AI bot). Also, skining system will let you choose how should jake look.
JamDB (Just another music DataBase) is a fast PHP/DB based mp3 collection management software with many interesting features.
This is a simple java based interface to the Open Directory Project. (www.dmoz.org) The java class supplied can retrieve data from dmoz on a request per request basis to give your site access to dmoz data.
The Java-Sitemapper is a Java API for building sitemap files to improve search indexing on Google, Yahoo!, MSN, and Ask.com. This project strives to implement the latest in search technology for use on the Java platform.