Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
Dominate AI Search Results
Generative Al is shaping brand discovery. AthenaHQ ensures your brand leads the conversation.
AthenaHQ is a cutting-edge platform for Generative Engine Optimization (GEO), designed to help brands optimize their visibility and performance across AI-driven search platforms like ChatGPT, Google AI, and more.
The Nogis Webserver is a small server with PHP and CGI support. It is very portable and we are aiming to make it in a Class structre for others to use.
Napsack is a specialized multi-threaded client for broadcasting Napster queries across multiple servers; the list of target servers is retrieved from www.napigator.com, and is user-filterable (based on the number of users, files, or gigs indexed).
RealClient tries to provide an extensible way of programming applets using an XML file as the basis of Layout and transferring xml back and forth to a back-end.
Dun and Bradstreet Connect simplifies the complex burden of data management
Our self-service data management platform enables your organization to gain a complete and accurate view of your accounts and contacts.
The amount, speed, and types of data created in today’s world can be overwhelming. With D&B Connect, you can instantly benchmark, enrich, and monitor your data against the Dun & Bradstreet Data Cloud to help ensure your systems of record have trusted data to fuel growth.
Fed up with editing those cryptic XML files? The main goal of the TIDE project is to develop a graphical application wrapping Tomcat, supporting JSP webbuilders.
This project is a Dmoz RDF parser and utilities to allow you to manipulate, display, and navigate the Dmoz RDF data on your web site. It will make use of software at jakarta.apache.org and xml.apache.org to display the data and will attempt to tightly int
Voambolana (pronouce VOO-BOO-LUH-NUH) is an on-line dictionary that converts foreign languages to a native language. Voambolana uses SAX parser and XSLT transformer. The tools used includes Ant, Xerces, Xalan (XNI) and Apache from the Apache Group.
Secure and customizable compute service that lets you create and run virtual machines.
Computing infrastructure in predefined or custom machine sizes to accelerate your cloud transformation. General purpose (E2, N1, N2, N2D) machines provide a good balance of price and performance. Compute optimized (C2) machines offer high-end vCPU performance for compute-intensive workloads. Memory optimized (M2) machines offer the highest memory and are great for in-memory databases. Accelerator optimized (A2) machines are based on the A100 GPU, for very demanding applications.
JATISAT, this project was originaly an academic work, the project has evolve and the main objective has changed, right now the project main objective is to build enterprise tools based on java technology. The project wants to build Entreprise frameworks
A modular, database-backed system for 5 dimensional (5D) analytical biological microscopy and cell-based screening. Please note - we have moved. Please come visit us and download from our new site at http://cvs.openmicroscopy.org.uk
AccessChecker is an open source java application which will check web sites for accessibility problems. Batch checking is supported and reports can be saved or printed for each file.
Firestorm is a WSAP Web Server. The WSAP protocol is an extension of HTTP wich supports file management, RPC, and server events. Firestorm provides a framework for the Java Web Objects components and publishes them on the Internet.
Update: The code from this project has been contributed to the GNU Crypto project.
The Cryptix SASL Library is an implementation of the Java SASL bindings and a number of SASL mechanisms.
This J2EE/Java-based Portal Development Kit is designed to be a tool for web developers to quickly and easily integrate several heterogeneous applications and news sources together into a multi-page JSP portal layout.
The Motosoto Community Portal Server (CoPS). CoPS supports virtual communities through discussions, chat, news, broadcast, library, search, yellow pages, profiles, etc. CoPS runs on Linux using Apache, JBoss, JServ, Jabber and XMl/XSL.
This is a generator that creates PHP 4 code based on a PostgreSQL RDBMS for data entry into any database table or tables with 1-n associations or n-n associations.
The generator now manages projects of more tables and has a complete graphical layout (CSS
Open Story is a news broadcast plus forum system. It has the following functions: submit and manage stories; manage discussion on a story; manage user, and many other features arround news broadcasting such headlines, hot news, break news, etc.
Oroena is a fully Java 2 Enterprise Edition powered Open Source Reverse Proxy engine. It provides unique session, security or server access management for a multi-server architectured portal.
A GUI front-end to the EPP RTK written in Java. Scripts which use the EPP RTK can be written in several scripting languages and executed within the interpreter.
The JSearch Project wants to provide the internet with a Java based generic interface for search engines. It consists of a core interface, search engine adaptors, a sort/merge module and a JSP based GUI.
The PoolMan library and JDBC2.0 Driver and DataSource provide a JMX-based, XML-configurable means of pooling and caching Java objects, as well as extensions for caching SQL queries and results across multiple databases.