Regain is a Java search engine based on Jakarta Lucene. It provides indexing and searching files for plenty of formats (HTML,XML,doc(x),xls(x),ppt(x),oo,PDF,RTF,mp3,mp4,Java). A TagLibrary eases integrating search results in your JSP based web page.
e-DSG Descoberta de Serviço Eletrônico Governamental
GUDDI é uma solução livre desenvolvida com o Framework Demoiselle que implementa o conceito de e-DSG (Descoberta de Serviço Eletrônico Governamental) e segue os padrões do e-PING para auxiliar Entidades Públicas a divulgarem seus serviços.
Data migration/conversion library based on STX and XSLT transformation
Infofuze is a Java library and server application that can be used to transform and combine data from various sources into a specific XML or other text output format that can be stored or indexed.
JuniCoder is a Java project that uses unicode as a base for decoding and encoding formats that invented workarounds to express characters not covered by ASCII. Decoders translate those inventions to unicode. Encoders encode to these inventions.
Java GUI that connects to content providers API such as Google, Bing, Wikipedia and implements a local search engine powered by Lucene, to search different contents: images, videos, articles, files and display them in an ergonomic OpenGL component.
Suzzy Project - Solr Dismax Fuzzy
TestEl is a Java-based learning analyzer for HTML (and possibly other) structured documents. It can be trained to detect structures in such documents and renders hits in XML.