Indexing/Search
Showing page 1 of 7.
-
Enterprise
Hibernate Hibernate - Relational Persistence for Idiomatic Java
15,989 weekly downloads -
Hunspell Hunspell is a spell checker and morphological analyzer library and program designed for languages with rich morphology and complex compounding or character encoding. Hunspell interfaces: Curses, Ispell compatible pipe interface, OpenOffice.org UNO module
1,056 weekly downloads -
Enterprise
LogicalDOC Document Management - DMS LogicalDOC is a modern document management system with a nice interface, easy to use and very fast. It uses open source Java technologies such as GWT, Spring, Lucene in order to provide a flexible and scalable DMS solution. http://www.logicaldoc.com
681 weekly downloads -
CLucene - a C++ search engine CLucene is a C++ port of Lucene: the high-performance, full-featured text search engine written in Java. CLucene is faster than lucene as it is written in C++.
341 weekly downloads -
JSpider A Java implementation of a flexible and extensible web spider engine. Optional modules allow functionality to be added (searching dead links, testing the performance and scalability of a site, creating a sitemap, etc ..
261 weekly downloads -
Infinispan High performance distributed in-memory key/value store
16 weekly downloads -
ht://Dig The ht://Dig system is a complete indexing and searching system for a domain or intranet. This system is not meant to replace the need for powerful internet-wide search systems like Lycos, Infoseek, Google and AltaVista.
13 weekly downloads -
Google Search Client Retrieve Google Search results, cached web pages and other services using this Java client.
19 weekly downloads -
Bibliophile Bibliophile is a loose grouping of independent OS or GPL bibliographic systems and aims at promoting discussion, standards and the development of common utilities.
10 weekly downloads -
Contineo Contineo is a Web-based Document Management System (DMS). Features: Folder organization, document Versioning, Bulk import, import from mailbox. NOTE: this project has been DISMISSED in favor of LogicalDOC http://sourceforge.net/projects/logicaldoc
17 weekly downloads -
LIUS (Lucene Index Update and Search) The development of this project has ended. Please take a look to Constellio Enterprise Search. Constellio is based on Apache Solr, Apache Tika, and google search appliance connectors. http://www.constellio.com
17 weekly downloads -
WebSPHINX WebSPHINX is a web crawler (robot, spider) Java class library, originally developed by Robert Miller of Carnegie Mellon University. Multithreaded, tollerant HTML parsing, URL filtering and page classification, pattern matching, mirroring, and more.
3 weekly downloads -
Google Sitemaps Toolbox Google Sitemaps Toolbox (GSToolbox) is a toolbox designed for webmaster to generate, manage and view Google sitemaps files. It is composed of Google Sitemaps Stylesheet (GSStylesheet) and Google Sitemaps Director (GSDirector).
13 weekly downloads -
News Aggregation Library for Java The Informa library provides a convenient Java API for handling news channels and metadata about them. Different syntax formats (RSS 0.91, 1.0, 2.0 and Atom 0.3, 1.0) for feeds are supported. Also support for channel information descriptions (OPML) avail
13 weekly downloads -
Hyper Estraier Hyper Estraier is a full-text search system. It works as with Google, but based on peer-to-peer architecture. Using Hyper Estraier, we can construct a large-scaled search engine with cheap computers.
4 weekly downloads -
Crawl-By-Example (Heritrix plugin) Crawl-By-Example runs a crawl, which classifies the processed pages by subjects and finds the best pages according to examples provided by the operator. Crawl-By-Example is a plugin to the Heritrix crawler, and was done as a part of GSoC06 program.
11 weekly downloads -
Triplify Triplify provides a building block for the semantification of Web applications. Triplify is a small plugin for Web applications, which converts database content into RDF or JSON feeds and provides a Linked Data interface.
11 weekly downloads -
Scientific Searcher Sciense Searcher is a system that lets you search, organize and share bibliographic cites of research articles, books, booklets, collections, manuals, thesis, proceedings, technical reports, unpublished publications and misc.
9 weekly downloads -
Qualipso-A3-A4-XFSearch This project provides cross-forge semantic search for the Qualipso Forge. It integrates A4 AdvDoc prototype (semantic search GUI and engine) with A3 homogeneous and heterogeneous cross-forge semantic search capabilities. See Qualipso.org for details
8 weekly downloads -
contentix - open source cms contentix - open source content management system contentix is a cms and a framework to develop any personalized browser based application. It use xml to store data in media nutral way and xsl to generate output. Check the demowebsite from downloads.
8 weekly downloads -
GoldSeeker data extraction tool GoldSeeker is a small formatted data extraction application. It can parse informations from a text, html or other file, and export it in a database.
7 weekly downloads -
NGramJ Provide a robust and efficient implementation of n-gram based classifiers to Java. N-Gram algorithms have shown to be surprisingly good at tasks like guessing the language/encoding from an arbitrary text file. And there are many more applications.
7 weekly downloads -
iVia iVia is an Internet subject portal or virtual library system. As a hybrid expert and machine built collection creation and management system, resources can be crawled and metadata and selected full-text can be automatically generated/extracted.
7 weekly downloads -
Anywhere Location Search The Anywhere Location Search allows for location searches using a wide range of inputs (address, city/state, zip code, search string, IP address, landmark name, etc).
6 weekly downloads -
Car Show Classifieds Classfieds for cars with mootools, php and mysql, totally in ajax.
6 weekly downloads