CLucene is a C++ port of Lucene: the high-performance, full-featured text search engine written in Java. CLucene is faster than lucene as it is written in C++.
zfind is a fast file search utility/API for searching within archives
Many times, developers (or even, normal users) run into an issue of searching for a resource that is inside a zip. Worse, that resource is inside a zip which is itself inside ANOTHER zip (i.e. nested archive). Possibly you need to verify whether a particular resource is present in an ear or you want to know part of a file name but don't know which zip/jar on your hard disk contains that class file. zfind solves this problem. zfind is very fast and it works on most common archive/compress files. Apart from GUI and command line, it also provides API support which let's you use it in your java code as a 3rd party library. Thanks for using, and if you find any bug, do post a ticket or discussion topic. I can't fix anything if I don't know the problem :) **Check Wiki for USAGE details
SCAN (Smart Content Aggregation and Navigation) is a universal semantic content aggregator. It combines search, text analysis, tagging and metadata functions to provide new user experience of desktop navigation and document management.
Datafari is an advanced open source enterprise search solution!
TiTLi: A Google-like, full-text search tool for relational databases. It is very fast and independent of vendor specific RDBMS's like mysql, oracle etc. It uses Apache Lucene for indexing and provides some advanced features like similarity search.
BeanQuery is a Java solution that allows for querying arbitrary collections of arbitrary object types by using a criteria like API in a declarative and typesafe manner.
Sistema de busca que objetiva fornecer automaticamente resumos sobre temas solicitados.
Dessy is a DEsktop Search and SYnchronization framework for mobile devices. It allows file search using content, metadata, and custom tags, and synchronization of results, and entire file system subtrees. It works on MIDP 2.0 handsets and Java 2 SE.
Looks at file names in a directory and finds common parts in them trying to search similar and repeated ones. Useful when you have multiple files which differ in checksum and somehow in names. You can collect them and make a decision afterwards.
Hyper-M is a bluetooth based DHT peer-to-peer infrastructure for J2ME (CLDC1.1/ MIDP2.0) enabled handphones. Hyper-M allows the user to create a peer-to-peer network and share and retrieve files on this network. Has been tested mainly on Nokia handphones
The Java API search engine will allow users to search for Java classes that may be useful in the development of their current project
Lucino is a C-library with some support for reading and writing Apache Lucene-indices. It currently comes with PHP- and Python bindings.
Open Enterprise Search Platform. The project aims to deliver a complete Open Enterprise Search Platform, by packaging several available freely available components together. The core component is Apache SOLR based on Lucene.
Paluxy is an application that makes available for very fast searching files' metadata. It's intended to be something like Google Desktop Search, but it also allows you to store custom fields (for example, the author, ISBN etc. of a PDF/CHM book).
This project is abandoned. Download these files at your own risk. For the latest version please visit http://decafbad.net/projects/piddlepodder/ This application is a podcatching client written in Perl. It is based off of the simplicity of bashpodder with a few added features. It downloads podcasts from a list of XML feeds, and puts them into a download directory by date.
The goal of this project is to provide a Content Provider plugin to RSS Feeds for iQser GIN Semantic Middleware (www.iqser.com). It is an easy way to integrate news sources for smart mashups.
a Solr Based Semantic Mediawiki Store
The SSAF ("Secure Search And Forwards") is a dirt-simple standalone web app for inexpensive and secure information sharing. Any uploaded record may be forwarded to an intended destination, and may also be stashed in a searchable repository.
(Project is discontinued.) Seekafile Server is a Windows Service that indexes DOC, PDF, XLS, PPT, RTF, HTML, TXT, XML, and other file formats on the background. It includes sample desktop search application and a sample ASP.NET search application.
Lucene/Solr based search engine and workflow system
Important: This project has been moved to https://github.com/statsbiblioteket/summa/ Lucens (and Solr) based search engine with very flexible setup and workflow system. It supports incremental updates, hierarchical faceting and index lookup with low memory overhead. Note: Although Summa is open source, the focus is on features used at Statsbiblioteket. No explicit resources has been allocated for support of external users.
Clucened is a project to build a daemon around CLucene, which is a C++ implementation of the Lucene search engine. This is *not* the CLucene project, but is a separate project to write a generic daemon based on CLucene.
find extension tool : list all extensions of a directory
fsearch is a desktop search tools under Linux.
iMeMex is a dataspace management system. iMeMex is a research prototype. The package also provides several useful components for research such as external sorting, B+-trees, inverted indexes, content converters, query operators, and graph indexes.