Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.
Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
Greenstone is a complete digital library creation, management and distribution package created and distributed by the New Zealand Digital Library Project.
There are two major versions of the software. Greenstone 3 is under active development, and is recommended for download. We also provide maintenance releases for its forerunner, Greenstone 2.
Featured download not what you're looking for? Click "Browse all files" to access binaries and source releases of both versions.
Virtuoso is a scalable cross-platform server that combines Relational, Graph, and Document Data Management with Web Application Server and Web Services Platform functionality.
icfs provides a way for mapping multiple URLs, referencing various web-enabled artifacts, into a single file system presentation under Linux. URLs can reference static pages, REST calls, or web objects in an object store.
Syndicateme.net ... Ajax Atom 1.0 Syndication Engine Tell your story ... Especially if you are a business along Queen St. in Toronto Canada or King Street Waterloo Canada. Syndication can be from a pop mailbox, and can use XInclude.
Hypermail is a program that takes a file of mail messages in UNIX mailbox format and generates a set of cross-referenced HTML documents.
Development of hypermail continues now at github: https://github.com/hypermail-project/hypermail
This project aims to build a suite of Natural Language Processing tools. Modules will include corpus indexing and access tools, a part-of-speech tagger, tokenisers, text classification software, etc.
The library does quick search of dictionary words in arbitrary input strings.
Known problems are known. :) See appropriate section in documentation. Also, only ASCII words and strings for now.
HXPath is a command line tool useful to extract data from HTML documents. HXPath can select sub trees, like the standard xpath tool, but is also able to read contents and attributes and output them in a bash friendly format. HTML Tidy and HTTP/HTTPS get are built in too.
ht://Check is more than a link checker. It's particularly suitable for checking broken links, anchors and web accessibility barriers, but retrieved data can also be used for Web structure mining. Uses a MySQL backend. Derived from ht://Dig.
A function-testing, performance-measuring, site-mirroring, web spider that is widely portable and capable of using scenarios to process a wide range of web transactions, including ssl and forms.
OpenFTS (Open Source Full Text Search engine) is an advanced PostgreSQL-based search engine
that provides online indexing of data and relevance ranking for
database searching. Close integration with database allows use of
metadata to restrict search re
Sgrep (sorted grep) is a much faster alternative to traditional Unix grep when searching large files, because sgrep searches sorted input files using a fast binary search to find matching lines.
A threaded C application that searches torrent trackers/indexers for .torrent files and sorts the results according to user defined criteria. Uses glib2.0 and libcurl4
Remora provides local document search capabilities to the iPhone and iTouch. The project uses the open source search engine Hyper Estraier together with a live search powered by Yahoo!
SYRAH si propone di far emergere e rappresentare i concetti espressi per mezzo di un linguaggio naturale. SYRAH aims to discover and represent concepts expressed in natural languages. NLP, lemma, lemmario, italiano, rete, semantica, clustering, semantic