With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.
You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
Try free now
Cloud tools for web scraping and data extraction
Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Sciense Searcher is a system that lets you search, organize and share bibliographic cites of research articles, books, booklets, collections, manuals, thesis, proceedings, technical reports, unpublished publications and misc.
PHPDynaSite is a free Content Management System written in PHP/MySQL/Java applets.
It provides a lot of features such as image resizing, Spreadsheet and richtext editor, logs, ...
Oxyus is an opensource search engine written in 100% Java, aimed to provide a search button to your website in an easy way.
Oxyus uses Apache Lucene for indexing, Quartz for scheduling and other interesting software products.
Elvis Digital Library - e-Library with semantics - is a virtual library system based on J2EE platform, XML database and what is most important semantics. It is a complete solution for storing, presenting and SEARCHING. It is based i.a. on the RDF/DublinC
Enterprises and companies seeking a solution to manage all their procurement operations and processes
eBuyerAssist by Eyvo is a cloud-based procurement solution designed for businesses of all sizes and industries. Fully modular and scalable, it streamlines the entire procurement lifecycle—from requisition to fulfillment. The platform includes powerful tools for strategic sourcing, supplier management, warehouse operations, and contract oversight. Additional modules cover purchase orders, approval workflows, inventory and asset management, customer orders, budget control, cost accounting, invoice matching, vendor credit checks, and risk analysis. eBuyerAssist centralizes all procurement functions into a single, easy-to-use system—improving visibility, control, and efficiency across your organization. Whether you're aiming to reduce costs, enhance compliance, or align procurement with broader business goals, eBuyerAssist helps you get there faster, smarter, and with measurable results.
Lucene Server is a java server application for simply create and manage Jakarta Lucene Indexes. It is designed to help you integrate Lucene in distributed environnements.
A hypertext-browser written in Java which filters links (emails, docs or pics for e.g.) out of .html-documents and paints them on screen in hierarchical order.
Users get a quick overview of how a website is put together.
JavaMatch is an engine that can search inside a runtime Java data structures, and look for objects that best match the criteria that you specify. The extensive query mechanism allows for highly customizable tuning of your match queries.
Polish Flexion Engine provides ready-to-use polish flexion dictionary with flexion engine for full flexion text search easily integrated in portals, web search,database searching engines. First aim is polish flexion (pl. polska fleksja).Demo on Home Page
The Zaval File Search solution is a local area network tool designed for fast file search on SMB shares and FTP servers.
It supports lots features like regular expression usage and search based on custom/predefinded extensions.
Group-CCS development Components, templates, tools, accessories, tutorial, modules, translations, documentation, codes, scripts, everything that can improve the work of who uses the powerful tool of development, CCS - CodeCharge Studio.
Swing-Search tool to effectively search among a list of strings and open a corresponding webpage in a browser.
It was originally designed to quickly search all titles of pages that are stored in a Wiki.
The "Universal Content Evaluation and Categorisation Software" is a program for analysing a websites, or more generally, a texts content. The text is arranged in dozens of categories, permitting more efficient web searches and information processing.
The goal of this project is to develop a fast, simple, robust and fully JCR (JSR-170) compliant Content Repository on top of a number of RDBMS.
A dual-licensed CMS, Mosaďka-CMS, will be developped on top of this repository by Logyka Technologies.
This projects implements a complete entreprise solution based on lucene. It's a smart engine implemented to index numerous files formats (pdf, ps, xls, doc, ppt, ). The engine can index file systems (filtering), databases, mailing folders, web sites and
The LEADERS toolkit is a generic toolset that enables the creation of an online environment which integrates EAD finding aids and EAC authority records with TEI transcripts and digitised images of archival material suitable to a wide variety of archives.
Buzzsearch is a Perl and MySQL based SMB/FTP search engine that originated at Georgia Tech. It should run on any UNIX machine with Samba, however I have only tested it on Linux.
Provides efficient, effective implementations of 32- and 64-bit hash functions based on Rabin fingerprints / irreducible polynomials, in Java. Also provides integration with java.security.MessageDigest API.
juNK (java useful Net Kollektor) is a j2ee webcontainer based application capable of searching and indexing SMB/ CIFS shares in local networks. Every SMB/ CIFS Share (Linux, Windows, ...) is indexed in a database and can be searched via a web frontend.
"girtools" is an implementation of Grid Information Retrieval (GIR). GIR is an emerging open standard for IR on the grid designed to allow dynamic, secure creation and searching of distributed information systems.