Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
AI-based, Comprehensive Service Management for Businesses and IT Providers
Modular solutions for change management, asset management and more
ChangeGear provides IT staff with the functions required to manage everything from ticketing to incident, change and asset management and more. ChangeGear includes a virtual agent, self-service portals and AI-based features to support analyst and end user productivity.
MACOW is a formal and scalable mandatory access implementation suitable on open worlds such as the provided on Semantic Web, Autonomic Computing and Coaltions and Federations scenarios. It is able to access control on distributed systems.
Self-organizing availability-monitoring p2p overlay for large-scale distributed applications. It addresses selfish and colluding nodes. Monitoring relationships are consistent, random, and verifiable. Discovery of monitors is fast and load-balanced.
Open SOA Repository is a JEE application enabling lifecycle management and contract management of your SOA services (SOAP WebServices, BPEL...). It can interact with UDDI and ebXML to publish your services.
Cryopid2 is a development of the excellent Cyropid process freezer for Linux developed initially by Bernard Blackham. Cryopid2 adds a host of functionality to the original package.
Pocomos is a cloud-based field service solution that caters to businesses
Built for the pest control industry, but also works great for Mosquito Control, Bin Cleaning, Window Washing, Solar Panel Cleaning, and other Home Service Businesses in need of an easy-to-use software that helps you simplify routing, scheduling, communications, payment processing, truck tracking, time tracking, and reporting.
The sysinfo-snapshot command generates a compressed HTML file with the detailed hardware and software information for maintenance and troubleshooting, with a simple navigation index. It can be used also as CGI script to generate the report on demand.
PPIV is a parallel software application used in Particle Image Velocimetry (PIV) analysis. The software will compute the vector field for one image pair or multiple image pairs. PPIV can run on a single desktop machine or in a cluster environment.
C Library to manage a pool of event/task in a persistent way to assure that your events/tasks won't be deleted because of a failure. Events/tasks are saved on a FS. if FS is NFS, NFS availability checks are made. (comes with a python binding)
Webclient is a novel client which provides web interface
to the resources provided through UNICORE 6. The Webclient takes advantage
of the UNICORE client API and technologies such as AJAX, JSP,
and JavaScript frameworks.
The CodeTime platform covers every aspect of parallel software from authoring, through distribution, to run-time. Its goals are: high programmer productivity; write once, run high performance anywhere; and wide acceptance.
open-fdc is short for "Open Framework for Distributed Computing". This framework allows developers to create "tasks" which are executed by the engine. A task is similar to a work unit type in the SETI project.
Damon is a Distributed AOP Middleware on top of a p2p Overlay Network. Its main aim is to offer the necessary abstractions and services to develop distributed aspects, and to provide scalability to existent or new applications transparently.
The program `pexec` executes parallely the given command or shell script on the local host or on remote hosts, while some of the execution parameters, namely the redirected standard input, output or error and environmental variables can be varied.
Because we have huge customer demand that we are working hard to support, we have not had the time to make contributions to the open source community in recent months. But this is only a temporary situation – we will be back!
A PHP Client Library for beanstalkd. BeanStalk allows PHP developers to make use of the beanstalkd in-memory workqueue server (http://xph.us/software/beanstalkd).
Intended as a tool for distributing build processes. Automatically forms clusters with other hosts and distributes command line processes. Files are transferred to the remote host on demand. Behaves like distcc, but for almost any type of process.
abiquo platform allows to create intelligent distributed networks interconnecting all kind of devices. We ease the interaction between gadgets, users and organizations through empowering technologies which add intelligence to actual networks.
lysis is working on a features rich home automation system that ranges from CAN-bus, 1-wire and RF up to a HTPC; both hardware and software are addressed to make the really smart home; the domotics will provide energy saving, comfort, flexibility, safety
Acelet-Scheduler with SuperScheduler and SuperWatchdog is a Java task scheduler with daemon and GUI for system and application job scheduling, can schedule tasks in a distributed environment. Holiday adjustable. Works out-box. See http://www.acelet.com
A Java library that simplifies thread development. It has support for continuations, fork/merge, signals, thread and process serialization, event picker (similar to BPEL's pick element), among other features.
A highly modular client remote/web services library written in Python supporting multiple protocols and transports through a unified interface. All modules are as independent as possible from each other to ensure high re-usability.
JOST is high performance JSON Parser and Generator written in C++ and relying on the BOOST Library. The current version is also being extended with a high performance caching service for JSON objects.