Indexing/Search
Showing page 1 of 81.
-
Enterprise
Hibernate Hibernate - Relational Persistence for Idiomatic Java
16,411 weekly downloads -
TouchGraph TouchGraph provides a set of interfaces for graph visualization using force-based layout and focus+context techniques. For now only older code is available, but we are planning to release new versions as well.
36 weekly downloads -
3store 3store is an RDF "triple store", written in C and backed by MySQL and Berkeley DB. It is an optimisation and port of an older triple store (WebKBC). It provides access to the RDF data via RDQL or SPARQL over HTTP, on the command line or via a C API.
5 weekly downloads -
CLucene - a C++ search engine CLucene is a C++ port of Lucene: the high-performance, full-featured text search engine written in Java. CLucene is faster than lucene as it is written in C++.
338 weekly downloads -
OpenSearchServer Powerful search engine and crawler with REST API, PHP/ASP client
189 weekly downloads -
SWISH-Enhanced Search Engine SWISH-Enhanced is a fast, powerful, *flexible*, free, and easy to use system for indexing collections of Web pages or other files. Key features include the ability to limit searches to certain HTML tags (META, TITLE, comments, etc.).
0 weekly downloads -
PyGoogle A Python wrapper for the Google web API. Allows you to do Google searches, retrieve pages from the Google cache, and ask Google for spelling suggestions.
8 weekly downloads -
The Lemur Project The Lemur Project develops search engines, browser toolbars, text analysis tools, and data resources that support research and development of information retrieval and text mining software, including the Indri search engine and ClueWeb09 dataset.
200 weekly downloads -
LXR Cross Referencer A general purpose source code indexer and cross-referencer that provides web-based browsing of source code with links to the definition and usage of any identifier. Supports multiple languages. Up-to-date information in http://lxr.sourceforge.net
192 weekly downloads -
JavaScript Offline Search An easy but fast search engine based on JavaScript. Ideal for offline documents (e.g. on CDROM or for offline-readable documentations).
0 weekly downloads -
dirLIST - PHP Directory Lister dirLIST displays files and folders in a given HTTP/FTP directory. It has a wonderful interface with choice of Thumbnail or List view along with gorgeous icons for different file types. Includes a sleek gallery, web based mp3 player, file admin + more
104 weekly downloads -
Torrent Search Torrent Search is a cross-platform application, allowing to search for torrent files on different websites. Supported websites are integrated through plugins, which allows to easily extend the number of websites supported.
809 weekly downloads -
Geoportal Server Geoportal Server is a standards-based, open source product that enables discovery and use of geospatial resources including data and services.
84 weekly downloads -
Greenstone Greenstone is a complete digital library creation, management and distribution package created and distributed by the New Zealand Digital Library Project. Click "Browse all files" for the Source versions and the Binaries for other operating systems
923 weekly downloads -
Hunspell Hunspell is a spell checker and morphological analyzer library and program designed for languages with rich morphology and complex compounding or character encoding. Hunspell interfaces: Curses, Ispell compatible pipe interface, OpenOffice.org UNO module
941 weekly downloads -
OpenLink Virtuoso (Open-Source Edition) Virtuoso is a scalable cross-platform server that combines Relational, Graph, and Document Data Management with Web Application Server and Web Services Platform functionality.
190 weekly downloads -
CyberNeko HTML Parser NekoHTML is a simple HTML scanner and tag balancer that enables application programmers to parse HTML documents and access the information using standard XML interfaces.
250 weekly downloads -
regain Regain is a Java search engine based on Jakarta Lucene. It provides indexing and searching files for plenty of formats (HTML,XML,doc(x),xls(x),ppt(x),oo,PDF,RTF,mp3,mp4,Java). A TagLibrary eases integrating search results in your JSP based web page.
107 weekly downloads -
Enterprise
LogicalDOC Document Management - DMS LogicalDOC is a modern document management system with a nice interface, easy to use and very fast. It uses open source Java technologies such as GWT, Spring, Lucene in order to provide a flexible and scalable DMS solution. http://www.logicaldoc.com
672 weekly downloads -
WebHarvest - web data extraction tool Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
182 weekly downloads -
DocMGR A full-featured document management system
129 weekly downloads -
PDFBox PDFBox is a Java PDF Library. This project will allow access to all of the components in a PDF document. More PDF manipulation features will be added as the project matures. This ships with a utility to take a PDF document and output a text file.
302 weekly downloads -
Large Knowledge Collider This is the official collaborative development environment of the Large Knowledge Collider (LarKC), a platform for massive distributed reasoning that aims to remove the scalability barriers of currently existing reasoning systems for the Semantic Web
17 weekly downloads -
SeerSuite Digital Library Search Engine
22 weekly downloads -
Simple Directory Listing A php application that provides a web-based graphical interface similar to apache directory listing. Functions:copy, move, delete, rename files, etc. For more detail, please go to the official site.
40 weekly downloads