BeeSeek is a project to build a free, open-source search engine based on a peer to peer technology. Code and bug reports are available on https://launchpad.net/beeseek-project
FirteX is a high performance,full-featured text indexing and retrieval platform.It provides a flexible and feasible experiment platform for researchers,as well as a scalable platform for Web search development.It is very fast,and well support for Chi
Export google search result links to file.
Google Mass Search is a small script written in python to get large number(as you need) of urls from google search results of a specified string. It is really simple to use but fast & powerful. You can specify a search string, no. of results filename, and some optional fields. GMS retrieves all the required links in a few seconds and save it to the file. It also eliminates the redundant links. You can also apply filters like links containing a given string or not containing a string. If you know a bit of python programming, you can even customize GMS as you wish.
High Availability Distributed Search Engine
HADSE is a server software for storing indexes in a cluster of server. The goal is to handle high availability by storing indexes on several nodes. HADSE provides RESTFUL APIs to easily populate and request data on your index. Using the powerful cluster APIs you can retrieve the data whatever the node that hosts it. To avoid any single point of failure, it is possible to apply a request to any node of the cluster, there is no master node. HADSE is in active development. A first running version should be available in few weeks.
Data migration/conversion library based on STX and XSLT transformation
Infofuze is a Java library and server application that can be used to transform and combine data from various sources into a specific XML or other text output format that can be stored or indexed.
JuniCoder is a Java project that uses unicode as a base for decoding and encoding formats that invented workarounds to express characters not covered by ASCII. Decoders translate those inventions to unicode. Encoders encode to these inventions.
Java GUI that connects to content providers API such as Google, Bing, Wikipedia and implements a local search engine powered by Lucene, to search different contents: images, videos, articles, files and display them in an ergonomic OpenGL component.
Search Comparator is a web-based platform to compare results of different popular search engines like Google, Bing, Yahoo etc. Visit the project web for details.
Simple, small and fast dictionary lib on C
The library does quick search of dictionary words in arbitrary input strings. Known problems are known. :) See appropriate section in documentation. Also, only ASCII words and strings for now.
Suzzy Project - Solr Dismax Fuzzy
TestEl is a Java-based learning analyzer for HTML (and possibly other) structured documents. It can be trained to detect structures in such documents and renders hits in XML.
Regain is a Java search engine based on Jakarta Lucene. It provides indexing and searching files for plenty of formats (HTML,XML,doc(x),xls(x),ppt(x),oo,PDF,RTF,mp3,mp4,Java). A TagLibrary eases integrating search results in your JSP based web page.
Программа ведения истории принадлежности игроков к кланам в игровом проекте World of Tanks.