Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
Teradata VantageCloud Enterprise is a data analytics platform for performing advanced analytics on AWS, Azure, and Google Cloud.
Power faster innovation with Teradata VantageCloud
VantageCloud is the complete cloud analytics and data platform, delivering harmonized data and Trusted AI for all. Built for performance, flexibility, and openness, VantageCloud enables organizations to unify diverse data sources, run complex analytics, and deploy AI models—all within a single, scalable platform.
unique.timestamp provides a universally unique identifier which is also a human-readable timestamp. It includes a command line executable, a set of C routines for embedding into your own code, and some utilities for manipulating unique timestamps.
yadra - yet another distributed rendering application
A Network-Render-Tool for Blender. A Java based, platform-independent tool, for rendering animations of blender in a network. Easy setup (no SMB/CIFS-Connections). Web-Interface.
The Instrument Element is a Grid/Cloud component that provides the computational/data Grid with an abstraction of real instruments, and Users with a more interactive interface to control them. More information: http://instrumentelem.sourceforge.net/
BSPonMPI is an implementation of the BSPlib standard on top of MPI. Both MPI and BSPlib are API's of communication routines meant for parallel computing, but BSPlib is easier to learn and its performance easier to predict.
The SPASE Registry Services is a metadata sharing system for Virtual Observatories. It is a collection of servlets that utilize SPASE compliant resource descriptions to provide access to both the metadata and data.
OpenECP is an open source fork of the Enomaly Elastic Computing Platform (ECP) following its commercialisation. It is a web-based management platform for hypervisors which can be used to create "public" and "private" cloud computing environments.
Tpl makes it easy to serialize your C data using just a handful of API functions. The data is stored in its native binary form for maximum efficiency. C, Perl and XML supported. Data is portable across CPU types and OS's from Unix to Mac to Windows.
Rezku is an all-inclusive ordering platform and management solution for all types of restaurant and bar concepts. You can now get a fully custom branded downloadable smartphone ordering app for your restaurant exclusively from Rezku.
A very lightweight Python program that executes commands on multiple Linux hosts in a user-designated number of threads via SSH. Intended for management for a large number of Linux/Unix nodes
MLN (Manage Large Networks) is a perl program used to manage a set of User-Mode-Linux instances or Xen instances. From a configuration file, it is possible to specify a large complex network of e.g. Xen instances, which are each configured as specified.
NetWorkSpaces, developed by Scientific Computing Associates Inc., provides a framework to coordinate programs written in R. It allows users to write parallel programs in R easily.
NetWorkSpaces, developed by Scientific Computing Associates Inc., provides a framework to coordinate programs written in Python, making it easier to write distributed applications.
The Unified Search Framework is an HPC tool for conducting search optimization and experiment design on High Performance Computing platforms. USF has been designed for platforms containing thousands of processors, and tested to 10,000 CPU cores.
XAMM is an abstraction layer for X11/*nix systems that hides the nature of underlying servers, using SSH trusts to present all applications on the network as if they were local. Provides application grouping, load balancing, logging, and access control.
Fura is a self-contained grid middleware that allows the grid enablement and distribution of applications on heterogeneous computational resources. Fura features a web-based GUI, wizard-guided installation and configuration, and Web Services compliance.
NChord is a C# implementation of the Chord distributed hash table. The project provides a library containing the routing, lookup, and maintenance routines specified in the MIT Chord paper, and is quite stable including under heavy load and churn.
C Library to manage a pool of event/task in a persistent way to assure that your events/tasks won't be deleted because of a failure. Events/tasks are saved on a FS. if FS is NFS, NFS availability checks are made. (comes with a python binding)
Damon is a Distributed AOP Middleware on top of a p2p Overlay Network. Its main aim is to offer the necessary abstractions and services to develop distributed aspects, and to provide scalability to existent or new applications transparently.
A command line utility to read a text file containing lines of data, clean up any CR/LF anomalies, and output the lines of text with clean CR/LF terminators to standard output. The binary is a Windows 32 bit console app.
Acelet-Scheduler with SuperScheduler and SuperWatchdog is a Java task scheduler with daemon and GUI for system and application job scheduling, can schedule tasks in a distributed environment. Holiday adjustable. Works out-box. See http://www.acelet.com
Light Distribution tool is a java applet able to remotely install and deploy software on windows operating systems. Based on a XML file, you can copy, install, register applications and add paths into the window registry.
Jgroup/ARM is a replication framework that supports group method invocations, hides replication from the clients, and partitioning in distributed systems. In addition, the replication management facility supports autonomous replica deployment and recovery
Light workflow engine for developers built in Java. It is designed to be simple, versatile and focused on the developer. It includes a Petri Nets graphical process designer integrated to Eclipse and a query language similar to OQL