XML documents To Generated dynamic web application supporting CRUD actions. Credits to Ministry of Culture and Communication, France; UNESCO; Ecole Nationale des Chartes, France; PASS-TECH, France.
open source SOA middleware
open source SOA middleware
A Java library for complying with the standard Web Robot Exclusion Protocol, robots.txt.
InfraRed is a Information Retrieval system. Its purpose is to allow you to find the information you need from a collection of documents, ignoring the unnecessary details, exactly as if you were taking an infrared picture.
S3B - Social Semantic Search and Browsing - is a middleware that delivers a set of search and browsing components that can be used in J2EE web applications to deliver user-oriented features based on semantic descriptions and social networking
Поисковая система Nero
In this project we like to implement an interface for the open yahoo search service which is called BOSS. Then we will develop multi new search futures based on this API.
XmlTvProducer for PHP is extendable engine to grab tv/radio listings from websites and produce XMLTV output. Data distribution for TV-Browser is included. Primary focus is on Slovak and Czech channels, but the development is open to anybody.
Timedex is a project that uses Wikipedia data to display an indexed and searchable time line of events in an easy-to-use web app.
The aim of OpenLogbooks is to provide "Unified Logbook Management" by providing data backing and a "pluggable", searchable datastore using Hibernate, Spring, SPring Security (formerly ACEGI Security Framework), dojo(toolkit) and other appropriate fra
This is a collection of REST specifications, and implementations of those specs, for very low-level information sharing and workflow operations using REST actions over HTTP. Implementations are in various languages, mainly Java, Python, and Ruby.
This UNIX shell utility reads a the HTML of a webpage and creates an output-stream of it.
PyEsp - Enhanced/Evolving/Extensible Semantic Profiling. This Python program will sort and filter search results by applying semantic profiling on web pages. The program will learn the user preferences and profiling will be done on the client computer.
Directorio comercial de empresas. Version PHP
Massively parallel computing using p2p netowrk.
open-search is a framework to build a p2p web search engine, whereby people mutually form a search engine without the intervention of central servers or a central actor.
MyLinkExchange creates a search engine friendly directory listing of websites posted by users, categorized according to the website type.
KeyWi is the Web-enable Key Words Indexing engine, which allows Site Operator / Owner to forward site visitors with a keyword, instead of going thru the search engines. Written in PHP and MySQL backend. Similar to AOL's Keyword concept.
Web-as-corpus tools in Java. * Simple Crawler (and also integration with Nutch and Heritrix) * HTML cleaner to remove boiler plate code * Language recognition * Corpus builder
This is search engine written in JAVA
Visualizer to facilitate browsing of data provided by Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)-Data Providers
Funnel is a project for use on intranets, or selected sites on the Internet to gather together and index information from several different sources and make it available through a sane, usable interface.
This is a browser/navigator for the FedoraForum,which is based on wxPython. (http://forum.fedoraforum.org/)
Tool for getting data from the Internet. Download all or part of a website to your computer, enabling you to browse the site directly from your hard disk at much greater speeds than if you were to browse the site online
c++ implemented lucene information retrieval index search query distributed