Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
Automated RMM Tools | RMM Software
Proactively monitor, manage, and support client networks with ConnectWise Automate
Out-of-the-box scripts. Around-the-clock monitoring. Unmatched automation capabilities. Start doing more with less and exceed service delivery expectations.
Methanol is a scriptable multi-purpose web crawling system with an extensible configuration system and speed-optimized architectural design. Methabot is the web crawler of Methanol.
The project consist in an opensource implementation of the W3C Recommendation of the XML Key Management Specification 2.0 XKMS 2.0. It is compound of a XKMS Server and a Client API to access to the Server via Web Service.
The XNGR XML Browser is an XML browser framework, that provides the functionality to associate Markers, Services and Actions with directories, files, XML documents, and nodes in an XML document. Services included are a simple XML Editor and XML Viewer.
XMPP Web Services for Java (XWS4J) is an implementation of machine to machine communication over XMPP. The communicated content is encoded in XML, according to customized definitions of input and output in W3C XML Schemata.
Say goodbye to broken revenue funnels and poor customer experiences
Connect and coordinate your data, signals, tools, and people at every step of the customer journey.
LeanData is a Demand Management solution that supports all go-to-market strategies such as account-based sales development, geo-based territories, and more. LeanData features a visual, intuitive workflow native to Salesforce that enables users to view their entire lead flow in one interface. LeanData allows users to access the drag-and-drop feature to route their leads. LeanData also features an algorithms match that uses multiple fields in Salesforce.
Content adaptation platform that adapts internet content to any mobile device in real time. Includes features such as device detection, image transcoding, DOM manipulation, XML/RSS feed support and much more.
TexOut convert Latex document to Output documents. TexOut use LateXml to convert document from Latex to xml and then use xslt transformations to create output documents.
Meta-Messager: A project that aims to develop a Semantic Web based framework for semantic annotation of B2B message schemas (e.g. XML, EDIFACT, ASN.1) and translation of B2B messages to and from business ontologies.
Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
FCOLLADA based COLLADA plugins for Maya and 3DSMax. Newer plugins with 1.5 support and support the files mentioned in "Papervision3D Essentials" by Paul Tondeur are available at http://opencollada.org . Please use this project for FCOLLADA related topi
A servlet (or a bunch of servlets) to access a database, perform ddlutils tasks (retrieve, create and update from or to a description). Uses the apache.org ddlutils (and some other libraries).
phpBib is PHP-MySQL-only-based literature management for scientific literature. Organise your papers with phpBib and share information with your whole research group. Help yourself with easy import/export, ranking, keywords and comments.
A command line utility to display statistics about a text file consisting of lines of data. The statistics include counts of line terminator pairs (CR, LF, CR+LF) and line counts. Also shows if there is an unterminated trailing line.
The mtconverter convert files from the MT940 format to the OFX format so that bank transactions can be imported in the GnuCash banking software. Successfully tested with ABNAMRO and GnuCash version 2.2.6
A C#/Mono.net 2.0 sealed class that, given most .net objects, will generate a valid XML PropertyList (plist) suitable for easy consumption by a Mac OS X/iPhone application.
A Drupal module that enables Drupal to view and manage objects stored in a Fedora Digital Repository. The Islandora project is no longer maintained on Sourceforge. For the latest download, please visit: http://islandora.ca/
Flexoodb stands for Flexible Object Oriented Database API. Using your choice of RDBMS, Flexoodb represents objects as XML, allows proprietary Serializable objects to be embedded within the XML representation and manipulation using extended classes.
Pheeder is a PHP5 RSS feed generator. It implements the complete RSS 2.0 specification and is extremely easy to use. It also features extensive documentation.
qJerry: write less, do more - now in PHP. qJerry is the PHP class that provides a way to work with XML documents in jQuery-like style. It has methods for creating, modifying, querying XML documents and more. qJerry is jQuery for server side.
The DITA Open Platform is a free, open-source project which goal is to provide an enterprise platform for the edition, management and processing of DITA documents.
Water: a fast, secure, dynamic OO language and database. Water is an all-purpose language (and meta-language) that runs Web applications in the browser or server-side. Water is compatible with .NET, Java, and C on Windows, Linux and Mac OS X system.
Semantic Web pipes is powerful data level mashup tools based on RDF.Semantic Web pipes are defined in XML and when executed they fetch RDF graphs on the Web, operate on them, and produce an RDF output which is itself accessible via a stable URL.