Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.
Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
Explore 10,000+ tools
AI-based, Comprehensive Service Management for Businesses and IT Providers
Modular solutions for change management, asset management and more
ChangeGear provides IT staff with the functions required to manage everything from ticketing to incident, change and asset management and more. ChangeGear includes a virtual agent, self-service portals and AI-based features to support analyst and end user productivity.
The Citizen Privacy Service is an asynchronous component using artificial intelligence capabilities including DL decidability and first order logic provenance that provide policy decision and policy enforcement points based on the US Privacy Act of 1974.
CarrierClass.net is the Free Software, large-scale, highly-available communications system for the enterprise, higher-education, non-profit, and governmental entities.
Self Organization of an adaptive learning framework for pervasive graph structures from grids, utilities to semantic governance. Business, performance and provisioning assurance are converged with EGA, ITIL, eTOM,SOX, HIPPA, CobiT and AC
Java Hibernation Package is an experimental implementation of Java thread serialization mechanism based on the Java Platform Debugger Architecture (JPDA). It is designed to support serialization of synchronizing multiple threads.
DAT Freight and Analytics operates DAT One truckload freight marketplace
DAT Freight & Analytics operates DAT One, North America’s largest truckload freight marketplace; DAT iQ, the industry’s leading freight data analytics service; and Trucker Tools, the leader in load visibility. Shippers, transportation brokers, carriers, news organizations, and industry analysts rely on DAT for market trends and data insights, informed by nearly 700,000 daily load posts and a database exceeding $1 trillion in freight market transactions. Founded in 1978, DAT is a business unit of Roper Technologies (Nasdaq: ROP), a constituent of the Nasdaq 100, S&P 500, and Fortune 1000. Headquartered in Beaverton, Ore., DAT continues to set the standard for innovation in the trucking and logistics industry.
A framework for building, deploying and managing well-described REST-ful Web services, including REST-ful Web Services realizations for RSS, XML Topic Maps, Structured Arguments, and Workflow.
Programming libraries for writing an XMLDC (XML Distributed Computing) applications in C++. XMLDC is a XML-based protocol for remote procedure calls (RPC) between different systems. XMLDC is more powerful than XML-RPC and much simpler than SOAP or CORBA.
InterMap is an Internet mapping application that allows the user to combine interactive maps from distributed Internet Map Servers in a browser. InterMap supports OpenGIS WMS and ESRI-ArcIMS and can be fully integrated with the GeoNetwork Metadata portal
For companies looking to automate their consolidation and financial statement function
The software is cloud based and automates complexities around consolidating and reporting for groups with multiple year ends, currencies and ERP systems with a slice and dice approach to reporting. While retaining the structure, control and validation needed in a financial reporting tool, we’ve managed to keep things flexible.
SimParEx executes a program(command) on many computers (farm) in parallel and collects the results (task farming). Major features: minimal requirements (TCP, SSH, Perl), flexible task definition, web interface.
GridBlocks builds a grid application framework via easy-to-use building blocks in distributed environment. The framework offers components for Grid security, distributed storage, computing, and Portlet web interfaces.
Gridster - a lightweight grid computing framework written in Java. Utilizes JMS and should work nicely in any standards-based Enterprise Java environment.
The CommGen platform is intended to be a simple, scalable integration platform
for small to medium problem sets. The architecture is simple, consisting of a
distributed kernel, distributed O/S, agent and application layer.
Raindrop is a Java based framework based on SEDA (Staged Event Driven Architecture) principles. It focuses on providing highly concurrent server side throughput by managing and controlling Thread usage and concurrency primitives.
OpenCDN aims to hierarchically chain a set of Streaming Servers, for deliver of Live Streaming contents to millions of viewers. Development is based on the Apple Darwin Streaming Server (DSS) and Linux, but porting to (Helix, WM) is possible.
soapgateway is a software element for the integration of clients and servers in a SOA environment. Roughly, it redirectes SOAP requests targeted to it, to the real webservers that offer the services, applying load-balance and authentication policies
The CredentialCompiler suite compiles the site credentials in grid environments from several LDAP databases. It is possible to feed the credentials in both UNICORE- and Globus-based systems, based on site-specific rules.
Aracnis is a Java based framework for building distributed web spiders. These spiders can be used to accomplish a variety of tasks, for example, screen-scraping and link integrity checking.
FAHWebMon is a web based log analyzer for Folding @ Home Diskless Folding Farms (F@HF). It allows an administrator of such a system to visually see the status of individual work nodes in a given farm.
It is an implementation of Tuple-Space by using MPI library. Tuple-space is a logically shared memory in many parallel computing. It provides the appearance of a shared memory but do not require on underlying physical shared memory.
A project to create a Distributed Relational Database File System for the GNU/Linux platform. The architecture will allow for multiple encryption and/or compression layers as well as providing the ability to complete live backups.
openCP is a fork of vhcs. Short term goals are: feature freeze, bug fix's and a more open developement cycle. Long term aims are: to be a scalable, distributed system to manage components of a HOSTING environment from a central management server.
iROS is a meta-operating system for technology-rich "interactive rooms". The core components (Event Heap, DataHeap, iCrafter) provide communication, data storage, and service management for an iRoom.