Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
Skillfully - The future of skills based hiring
Realistic Workplace Simulations that Show Applicant Skills in Action
Skillfully transforms hiring through AI-powered skill simulations that show you how candidates actually perform before you hire them. Our platform helps companies cut through AI-generated resumes and rehearsed interviews by validating real capabilities in action. Through dynamic job specific simulations and skill-based assessments, companies like Bloomberg and McKinsey have cut screening time by 50% while dramatically improving hire quality.
The Hybrid Ad Hoc Grid is a self-organizing computational grid which has simplified installation, reduced administrative activities and fault recovery.
StudiCloud provides a Webfrontent GUI for an OpenNebula driven cloud environment. It furthermore adds a backend proxy for using Shibboleth as an user authorization method.
DASM is a new simple platform which helps programmer to implement and evaluate any distributed algorithm. Programmer can implement his/her distributed algorithm using DASM primitives and monitors the results in diagrams during execution.
Fight skyrocketing paid media costs by turning your customers into a primary vehicle for acquisition, awareness, and activation with Extole.
The platform's advanced capabilities ensure companies get the most out of their referral programs. Leverage custom events, profiles, and attributes to enable dynamic, audience-specific referral experiences. Use first-party data to tailor customer segment messaging, rewards, and engagement strategies. Use our flexible APIs to build management capabilities and consumer experiences–headlessly or hybrid. We have all the tools you need to build scalable, secure, and high-performing referral programs.
A universal platform for resource discovery and description that shares XML meta-data over existing peer-to-peer (P2P) networks such as Gnutella and JXTA.
CloudBase is a data warehouse system for Terabyte & Petabyte scale analytics. It is built on top of Map-Reduce architecture. It allows you to query flat log files using ANSI SQL. Visit CloudBase home page for details- http://cloudbase.sourceforge.net
FpgaC compiles a subset of the C language to net lists which can be imported into an FPGA vendors tool chains. C provides an excellent alternative to VHDL/Verilog for algorithmic expression of FPGA reconfigurable computing tasks. More info in wiki.
SnowFlock enables high performance computing on virtual machine (VM)-based cloud environments. In SnowFlock, a VM transparently becomes a cluster of VMs by cloning in sub-second time into multiple copies executing on different physical hosts.
Rezku is an all-inclusive ordering platform and management solution for all types of restaurant and bar concepts. You can now get a fully custom branded downloadable smartphone ordering app for your restaurant exclusively from Rezku.
Soaplab is a generator of Web Services providing a programmatic access to command-line (and other) applications on remote computers (an example of its usage is available at http://www.ebi.ac.uk/soaplab/)
A multilingual, PHP-based internet portal, without using a MySQL database.
It shows the actual statistics of your distributed.net team out of the log-file of your dnet-proxy and additional information, including prognosis.
An application that can connect an collect data from vPro machines over network. It uses web-services for communication and works with machines that has AMT version >= 3.0
This project is to build a distributed Key-Value Store (KVS).
* Can survive software or hardware failures.
* Can tolerate failures of entire nodes.
* Can recognize that new storage nodes are joining the system and are able to utilize new nodes.
Equalizer is the standard middleware to create parallel OpenGL-based applications. Please visit https://github.com/Eyescale for current development information.
Soutei is a trust-management system for access control in distributed systems. Soutei policies and credentials are written in a declarative logic-based language. Soutei policies are modular, concise, readable, supporting conditional delegation.
Context-awareness is a key concept in ubiquitous computing. The Java Context-Awareness Framework (JCAF) is a Java-based context-awareness infrastructure and programming API for creating context-aware applications.
J2eeBuilder automates JSP<->POJO<->EJB service calls via Comonent Definitions(XML). It includes utilities for merging (ejb-jar.xml) & packaging component/ear. It is certified as J2EE-Compatible by Sun Microsystems and is the kernel of Ohioedge CRM Se
The BIS-Grid engine is an extension of the UNICORE 6 middleware (www.unicore.eu) that implements WSRF compliant services to support WS-BPEL based workflow execution in Service Grids. This project is partly based on the ActiveBPEL engine
Identity and service provisioning software component allows service developers to automate the creation, modification and deletion of users' identity and service subscription information on multiple disparate identity stores.
A project for my thesis at Nanjing University designed for processing distributed vision computing. Also it provides the features for virtual reality, including scene construction, editing on animation objects and cameras, and sampling video on them.
*UPDATE*
myHadoop is no longer being updated on SourceForge. For the latest, check it out on GitHub at:
https://github.com/glennklockwood/myhadoop/
myHadoop is a simple system for end-users to provision Hadoop instances on traditional supercomputing resources, without requiring any root privileges. Users may use myHadoop to configure and instantiate Hadoop on the fly via regular batch scripts.
This is a multi-threaded Java program that tries to maximise CPU utilization by separating and distributing the computational part of the work from the file reading and writing part by using separate threads.