Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)
Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
XML Simple Parser is a parser and generator for xml files which is kept as easy and lightweight as possible. Its intended for use in games or other applications to store data in xml files and for sure also for reading it from there.
AutoSuggest for MooTools provides a simple AJAX JavaScript Auto-Suggest framework for use with MooTools 1.2+ What makes this framework different from the others is that it is built from the ground-up to work with key-value pairs, not just strings.
Blunder is an automated tool for analyzing chained exceptions in Java. It's usefull for classify, generate a customized error message and a list for possible solutions.
The CampusSource Engine Integration Platform (CSE-IP) supports the composition of an integrated information managment in heterogeneous IT environments of educational institutions. It realises a process-oriented link-up of campus information systems.
This project allows PHP programmers to export data into Excel workbook. You can create multiple sheets, use fonts, colors and borders. Exported data is in XML format, recognized by Microsoft Office Excel
Axe Credit Portal - ACP- is axefinance’s future-proof AI-driven solution to digitalize the loan process from KYC to servicing, available as a locally hosted or cloud-based software.
Banks, lending institutions
Founded in 2004, axefinance is a global market-leading software provider focused on credit risk automation for lenders looking to provide an efficient, competitive, and seamless omnichannel financing journey for all client segments (FI, Retail, Commercial, and Corporate.)
MessageReader is a java software that, given a Mail folder of a Symbian phone, takes all the sms messages (NOT (still) mms or anything else) and make a file in the unix mailbox format (suitable for e.g. kmail).
BASIS FOR AN OPENSOURCE PROJECT to convert NoteWorthy Composer files from/to MusicXML. Two Python 3 programs for NoteWorthy Composer files in nwctxt format. 1. nwctxt2xml.py - Converts nwctxt into XML 2. xml2nwctxt.py - Converts XML into nwctxt
Omni is a Java package for reading and writing a variety of data file formats. Currently, Omni supports formats such as CSV, tab-delimited, pipe-delimited, and fixed-width files. There are future plans for more formats.
LaTeX Letterizer Project is a robust opensource PDF document generator application for desktop environments. It uses the dinbrief class written by K.D. Braune and R. Gussmann to produce high quality letters. Visit our enhanced Website below.
DB2RDF is a a software tool that will convert data from relational data model to semantic data model (in the form of RDF and RDFS). A SPARQL endpoint for querying the converted data. For querying the semantic data, SPARQL query language is used.
tkBacktracer is a cross-platform interactive visualization tool to explore the tool motions in RS-274D "G-code" CNC files used in computer controlled milling and manufacturing.
This is an online html form generator written in PHP that can be used for creating forms online. You can add and modify html element attributes. On the left is a tree for the order, in the middle a preview, on the right fields for modifying elements
MIDlet showing status of GPS device connected to cellphone. Show availability of GPS device/signal, current position, speed, course and satellites' position and signal power. Log tracks in GPX format.
This web-application allows business users to design, in a collaborative working envirorment, a Business Process in Business Process Modelling Notation (BPMN 1.0) language and to serialize it in XML Process Definition Language (XPDL 2.0 and 1.0).
The Java Sitemap Parser can parse a website's Sitemap (http://www.sitemaps.org/). This is useful for web crawlers that want to discover URLs from a website that is using the Sitemap Protocol.
This project has been incorporated into crawler-commons (https://github.com/crawler-commons/crawler-commons) and is no longer being maintained.
With OSM Parser it is possible to process OpenStreetMap geodata for various purposes. For example, it can be used to extract streets or other OSM elements for user defined regions to generate a street directory.
Cairo (Complex Archive Ingest for Repository Objects) is a tool for processing digital archives prior to submitting them to archival storage for long-term preservation; among other features, this includes format identification and metadata extraction.
Open XML format for analytical chemistry and biology data.
This project supports ASTM E13.15 "Analytical Data Magement" in its work defining AnIML, an XML standard for representing, managing and interchanging analytical chemistry and biology data.