Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.
Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
DBPrism is a framework to generate dynamic XML from a database, it provides an high performance DBGenerator for Cocoon2. Also is a J2EE replacement for Oracle mod_plsql. This project also includes a Restlet-Oracle connector exam. and Lucene Domain In
Ex-Crawler is divided into 3 subprojects (Crawler Daemon, distributed gui Client, (web) search engine) which together provide a flexible and powerful search engine supporting distributed computing. More informations: http://ex-crawler.sourceforge.net
A PHP library/framework for the development of websites. The main features are: database independence, template-driven content, theme-able content generation, integrated WML generation, user content management, Lucene server integration.
Web-as-corpus tools in Java.
* Simple Crawler (and also integration with Nutch and Heritrix)
* HTML cleaner to remove boiler plate code
* Language recognition
* Corpus builder
jBingAPI is a java library to query the microsoft search engine bing (http://www.bing.com/) using their public api. jBingAPI just makes it a lot easier to communicate with this api.
JeCARS (Java Extendable Contents And Rights System) is a RESTful webservice which delivers pluggable output formats, e.g. Atom feeds or HTML.
Third party applications can be plugged in.
A JCR (JSR-170) repository (Jackrabbit) is used for storage.
Javen library is a framework for developing C++ application simply, with similar API to Java library. Hawk search engine is a software platform that used to build Vertical Search Product more easily for the Moderate Company or End Users.
The complete suggestions framework for java, supporting single and multi field suggest, java suggest box, client/server with hessian or json-rpc, and GWT AJAX suggest box, phonetic plugins. Proven high performance for data sets > 1 Mio.
Visualization of the contact network and user data from the popular business network XING.com. The web-based software can be used by every registered user from XING.
The search aggregator allows users to initiate searches across multiple applications and receive aggregated results. This project is based on Lucene, written in Java, exposes web and plugin interfaces, and supports the Open Search and Json standards.
Open Source Semantic Web Search Engine Software: If two machines anywhere on the web can agree on the same definition of a digital service or digital good, then machine to machine transactions can use this lingua franca to transact on the users behalf.
OpenMKS is a search & navigational tool for large multimedia collections. With pluggable functionality and a core subsystem supporting the z39.50 ZING Community SRW search & retrieval specification, it can be run either as a Servlet or as a Web Service.
The Retrieval Component Integrator Project (RECOIN) intends to provide an extensible framework of Java classes to build a meta-search and information retrieval (IR) system based on heterogenous IR components as part of a modular retrieval process. The so
This forum software is a Java based discussion forum, that uses JDBC to store data in a database. This discussion forum is available in different languages and has features for easy integration into a site and easy administration of forum.
The development of this project has ended. Please take a look to Constellio Enterprise Search. Constellio is based on Apache Solr, Apache Tika, and google search appliance connectors. http://www.constellio.com
Aracnis is a Java based framework for building distributed web spiders. These spiders can be used to accomplish a variety of tasks, for example, screen-scraping and link integrity checking.
The Informa library provides a convenient Java API for handling news channels and metadata about them. Different syntax formats (RSS 0.91, 1.0, 2.0 and Atom 0.3, 1.0) for feeds are supported. Also support for channel information descriptions (OPML) avail
LIMO stands for Lucene Index Monitor. It is a web application that gives basic information about indexes used by the Lucene search engine (http://lucene.apache.org). It allows you to browse and search the index, and reconstruct stored fields.
This project intends to create an indexing search engine, for knowledge management. The primary object is to apply an information retrieval core. And implement a knowledge data discovery theory such as data mining algorithm, text mining.
SCAM is a development environment for building metadata stores for RDF and the Semantic Web. SCAM is built upon international technology standards and metadata standards. Such as RDF, Dublin Core, IEEE/LOM and IMS.
VDC has been superseded by DVN: https://sourceforge.net/projects/dvn/ ---- The Virtual Data Center project is building an operational, open-source, digital library to enable the sharing of quantitative research data, and the development of distribute
list2db reads digested email files generated by the mailman mailing list software and converts them into SQL for a relational database. The project also includes a PHP frontend for users to search and browse archived list emails.