Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
Pest Control Management Software
Pocomos is a cloud-based field service solution that caters to businesses
Built for the pest control industry, but also works great for Mosquito Control, Bin Cleaning, Window Washing, Solar Panel Cleaning, and other Home Service Businesses in need of an easy-to-use software that helps you simplify routing, scheduling, communications, payment processing, truck tracking, time tracking, and reporting.
Light workflow engine for developers built in Java. It is designed to be simple, versatile and focused on the developer. It includes a Petri Nets graphical process designer integrated to Eclipse and a query language similar to OQL
Uniwiki is an efficient P2P system for storing distributed wikis, with a typical user interface, but extended to large-scale scenarios transparently. In this project, we provide the customized web server and the Uniwiki web application.
A Workflow Management System for scientific environments written in C++ based on the Petri Nets formalism. It uses the Grid Workflow Description Language (GWorkflowDL) as reference language.
The Simple, Scalable, Script-based Science Processor for Measurements (S4PM) is a system for highly automated processing of science data, scalable up to large processing systems and scalable down to small, special-purpose processing strings.
cids (component integration toolkit for distributed systems) is a toolkit to build and maintain distributed information systems. with cids one can easily build complex distributed metainformation or EAI systems. cids is fully platform-independent.
JaneySpaces is a network abstraction library based on a space of distributed objects. It is designed to fit with traditional Object Oriented designs and principles, and to be as unobtrusive as possible when it comes to the structure of your application.
toksta Server Monitoring is a web-basend frontend to consolidate monitoring tools like Monit etc. and management functions. It contains monitoring for replicated MySQL, Apache and WowzaMedia. It can be configured in a single configuration file.
Cloud data warehouse to power your data-driven innovation
BigQuery is a serverless and cost-effective enterprise data warehouse that works across clouds and scales with your data.
BigQuery Studio provides a single, unified interface for all data practitioners of various coding skills to simplify analytics workflows from data ingestion and preparation to data exploration and visualization to ML model creation and use. It also allows you to use simple SQL to access Vertex AI foundational models directly inside BigQuery for text processing tasks, such as sentiment analysis, entity extraction, and many more without having to deal with specialized models.
WBEM-based framework for managing providers to produce information according to the OGF GLUE 2.0 information model and to render them in different formats (e.g., XML, LDAP).
NGrid is a transparent C# .Net/mono grid computing model. NGrid abstracts the burden of the grid into a simple multithread and garbage collected programming model. NGrid is designed to carry computationally intensive tasks over multiple machines.
growler is a C++-based distributed object and event architecture.
Its primary application has been in support of interactive, distributed visualization, "concurrent visualization", and computational steering.
Soapod provides an open-source, open-standards Java(TM) application server. It embraces and implements an integrated peer-to-peer, service-oriented architecture based on open standards such as the Web service stack and JSR standards.
A novel Grid System which is Python based and Cell powered. By extending Namespace into GridSpace, any objects are accesable throughout the Grid. And the codes are distributed executed and be JIT compiled into Cell SPE instructions automatically.
Cognitive agent based social simulation toolkit (RBOT+MRS) / production system based on ACT-R (http://act-r.psy.cmu.edu) allows for modelling single actor cognitive experiments (RBOT) and multiple actors in a simulated (semiotic) world (RBOT + MRS).
Peruser is a open-source lightweight knowledge application framework. Peruser speaks XML and RDF and is implemented in Java. Peruser code runs within Cocoon, as a standalone webapp, as console programs invoked through ant, or as embeddable java code.
OSN is an open source open protocol distributed social network. Public key cryptography makes the network resilient to spam. User profiles are based on FOAF XML and users can migrate their profile from one site of the federation to another.
FedStage OpenDSP is an open implementation of SOAP Web Service multi-user access and policy-based job control using OGF DRMAA routines supported by distributed resource management systems like Sun Grid Engine (SGE), LSF, PBSPro, Torque or Condor.
IDEAIS is a enteprise service bus integration plataform for software development tools and activities. It uses Web Services (SOAP/HTTP) to integrate best of the breed software development tools (Eclipse, Subversion, Bugzilla, dotProject, vTiger).
Open Grid Manager (OGM) is a lightweight open source grid management framework that provides a cohesive solution for monitoring and managing arrays of heterogeneous grid resources deployed within live production grids.
pxe bootable, diskless openmosix based cluster node.
tools to manage these nodes, along with skeleton installs for nodes with disk, as well as workstation nodes for starting processes to be migrated
A peer-to-peer object sharing system, based on ICE, which can be applied to several P2P applications. An initial proof of concept will be a distributed web cache for LANs with no central proxy server.
Cube Platform is a decentralized grid computing system that uses P2P Pastry protocol for communication between nodes. It's a big data storage written in Java.