Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)
Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
Calenco is a Web collaborative platform that enable remote teams of writers, proofreader, graphic designers, translators, etc. to produce together XML documents like user guides, security procedures, etc.
Polyester is a tuple space framework based on the lighTS project, with extensions to support asynchronous and decoupled interaction of agents. The provided example application is a (toy) library of XML documents that can be queried using XPath.
Foxon is a FO emitter/indenter to be used with Saxon. It can indent and prettify XSL-FO output, making it suitable for human inspection and editing, without introducing artefacts that change the layout of the PDF file.
Rapid development of portal-based user interfaces for submitting compute jobs to remote compute resources. It supports the mainstream compute engine vendors Sun Grid Engine, PBS and Condor. Get a job-submission portal up and running in a few hours.
For companies looking to automate their consolidation and financial statement function
The software is cloud based and automates complexities around consolidating and reporting for groups with multiple year ends, currencies and ERP systems with a slice and dice approach to reporting. While retaining the structure, control and validation needed in a financial reporting tool, we’ve managed to keep things flexible.
This is a Java API binding of the IEEE LTSC LOM (Learning Object Metadata) specification. It defines an implementation neutral Java interface for the LOM, as well as providing a default implementation of that interface, and XML marshalling functionality
Metadata.net is a collection of website components and tools for creating and using metadata produced by the eResearch Group at ITEE, University of Queensland.
A C/C++ based client and server implementation of the OGSA Basic Execution Service, used to provide a Web Services interface to distributed resource managers such as Platform LSF and PBS/PBS Pro. The SOAP stack is provided by the gSOAP toolkit.
XML documents To Generated dynamic web application supporting CRUD actions. Credits to Ministry of Culture and Communication, France; UNESCO; Ecole Nationale des Chartes, France; PASS-TECH, France.
VNC for use with the BrowserMob Selenium JavaScript Validator. This tool is made available for users of BrowserMob FREE Website Monitoring and Load Testing. The BrowserMob Local Validation Service can be downloaded from https://browsermob.com/tools.
Spreadsheet-to-RDF wrapper capable of transforming spreadsheets to arbitrary RDF graphs; supports mapping language with powerful formula expressions as you know from Excel, local/remote Excel/ODF/CSV, SPARQL service - source available at https://github.com/theandyl/xlwrap
SAADA transforms a set of heterogeneous astronomical data FITS files in a powerfull database deployed on the Web and including VO features. The database is located on your host and keeps under your full control. This job doesn't require to write cod
The csvdatamix project aims to randomize CSV input data files in order to conceal the original state of the data. Similar to data masking or data transformation. Also has mapping abilities to translate back to the original state of the data.
This is an advanced web scraper with user friendly GUI which let the user define rules and web addresses to extract data from one time or periodically and a target database filed that the data should be saved in.
The goal of this project is to provide a Content Provider plugin for the iQser GIN Platform (www.iqser.com) as a semantic middleware to synchronize and transform data objects of any Social Community supporting the Open Social standard.
Text and basic html renderer. Useful to read text/html files in mp3/mp4 devices using its photo/image viewer . Comes with default settings for a sansa fuze mp4 player. It needs a Perl interpreter, but is provided one in-pack for windows.
JACo XML is a handy, tiny component designed to help you with your work with XMLs in Java. The idea is to see every XML as an object instance and then work with it.