Hunspell is a spell checker and morphological analyzer library and program designed for languages with rich morphology and complex compounding or character encoding. Hunspell interfaces: Curses, Ispell compatible pipe interface, OpenOffice.org UNO module
Open Source Intelligence Automation.
SpiderFoot is an open source intelligence automation tool. Its goal is to automate the process of gathering intelligence about a given target, which may be an IP address, domain name, hostname or network subnet. SpiderFoot can be used offensively, i.e. as part of a black-box penetration test to gather information about the target or defensively to identify what information your organisation is freely providing for attackers to use against you.
Virtuoso is a scalable cross-platform server that combines Relational, Graph, and Document Data Management with Web Application Server and Web Services Platform functionality.
The stuff here has no documentation and some of it may never be completed. This is my playground, use at your own risk.
A search application to watch and download movies and TV shows
A federated search desktop application to read about, preview, watch, and download any movie and television titles that are being shared online.
Open Source Semantic Web Search Engine Software: If two machines anywhere on the web can agree on the same definition of a digital service or digital good, then machine to machine transactions can use this lingua franca to transact on the users behalf.
The Batino Browser is the next generation rich web browser platform. It is based on Eclipse technology.
Indir is a network application designed for server scanning. It can search files with convenentional names (used by programmers) for data that may be dangerous. The database presently contains over 2000 records and is constantly growing.
A utility to extract meta-information (properties/comments) out of various file-types; e.g. HTML, PDF, RTF & various Office documents; OGG/MP3 files and JPEG/PNG/GIF images, which can be presented in various output formats (HTML, XML, LaTeX & plain t
Irudiko is a library written in C++ for generating Locality Sensitive Hashing sketches from any textual and web document. Mainly designed to work with HTML pages, it has also an optimization support for English or Italian documents.
OpenCLAS is an open source implementation of ICTCLAS (Institute of Computing Tech.,Chinese Lexical Analysis), which contains 3 language branches, such as C++, Java and C#. The library can be used to segment Chinese sentence to words with tags (POS).
MOVED: http://code.google.com/p/rssfilter/ Selects and/or rejects articles from RSS feeds based on keywords you supply. Runs in your taskbar! Originally intended to filter new content from scientific journals such as PRL or the arxiv (cond-mat, etc)
Xapian is a Search Engine Library, written in C++ with bindings for Perl, Python, PHP, Java, Tcl, C# and Ruby. Xapian allows you to easily add advanced indexing and search facilities to your applications. See www.xapian.org for more information.
'Yfittu Finds It Thanks To yoU' is a peer-to-peer (p2p) web search engine. It distributes both the searching and the indexing between multiple machines in the network.