Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)
Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
XTR (XML Transformations) is project that allows you to transform, merge and summarize XML files basing on XSD files and mappings. Generates XSD file form XML (XML2XSD). XML editor with formatting capability, XSD visualization using tex, XML Validation.
Basic tools for the MPEG-21 standard. The MPEG-21 standard is a complex framework for exchange of content. This project aims to ease the burden of implementation of basic aspects such as parsing, validation and database mapping.
XMLfacade is a Java library for XML manipulation. This library builds on top of JAXP and DOM APIs available in Java platform and provides thin layer (facade) for easier XML construction and querying. It is inspired by LINQ and functional programming.
amorph is an any-to-XML-to-any data transformation library. Use amorph to read possibly ANY kind of data format required for further processing (csv,fixed length, xml, electronic bills, custom formats, ...) within your application.
Trial Criteria Online Data Entry (trialCODE): a Java-based user interface that codifies eligibility requirements used to automate the screening of potential subjects to clinical trials. Used for caMATCH screening engine on BreastCancerTrials.org site.
The gateway is an open source JavaEE application developed by the Vermont Dept of Taxes. It provides a web services framework for accepting Streamlined Sales Tax registrations and returns. It also includes a web interface for submitting transmissions.
A GUI-based text annotation tool for creating and visualizing annotations. It uses a flexible stand-off XML data format, and has advanced and customizable methods for information and relation visualization.
XTRACT4J V2 is a stand-alone, pure-Java program which creates XML file by dependent or independent SQL queries. It is designed as a drop-in replacement for Oracle Report to generate XML file. It also incorporates BI Publisher to create PDF reports.
It's a modern take on desktop management that can be scaled as per organizational needs.
Desktop Central is a unified endpoint management (UEM) solution that helps in managing servers, laptops, desktops, smartphones, and tablets from a central location.
Tpl makes it easy to serialize your C data using just a handful of API functions. The data is stored in its native binary form for maximum efficiency. C, Perl and XML supported. Data is portable across CPU types and OS's from Unix to Mac to Windows.
ServingXML is an open source, Apache 2.0 licensed, framework for flat/XML data transformations. It defines an extensible markup vocabulary for expressing flat-XML, XML-flat, flat-flat, and XML-XML processing in pipelines.
Shared Questionnaire System(SQS) is a full-functional Optical Mark Reader(OMR) form processing system implemented in Java-Swing, XSL-FO and AJAX with straightforward GUIs. It is aimed at developing social platform to share knowledge about questionnaire.
Fydji WS is a Web Services creation framework allowing to build services mapped to database operations, Java methods or other sources. It produces a Web application for services publishing and Java/JavaScript/PHP client classes for their invocation.
Small footprint, fast Java XML parser. Parses to an easily manipulable nested class structure which can be converted back to formatted or unformatted XML with a single call. Uses and creates straight XML, no DTDs necessary or used.
The ProM Import Framework allows to extract process enactment event logs from a set of information systems. These can be exported in the MXML format, which is the standard event log data format for Process Mining analysis techniques.
The XNGR XML Browser is an XML browser framework, that provides the functionality to associate Markers, Services and Actions with directories, files, XML documents, and nodes in an XML document. Services included are a simple XML Editor and XML Viewer.
COLLADA Refinery is a GUI-based application for creating content pipelines for 3D assets using modular components called conditioners. Contact a project admin to become a developer!
VINE is a vocabulary Integration Environment Tool, developed at MMI. It allows to create fast mappings and alignments from ontologies in OWL. VOC2OWL is a tool to create ontologies from ASCII files. They are both JAVA based and are delivered as a Rich Cl
The main purpose of this project is to develop a Graphic policy definition editor implemented in Java language and based on the XACML standard which manages access control policies
CookXml is a powerful dynamic XML data binding tool for Java. It is designed to be easily extensible and yet simple to use, with advanced exception handling. CookSwing and CookSwt are tag libraries for CookXml to do XML->GUI (XUL) for Swing and SWT.
ChainBuilder ESB is Java Business Integration (JBI) compliant open source solution for use in Service Oriented Architecture (SOA) environments. ChainBuilder ESB's graphical integration enables new and older applications to easily enter into SOA.
Wow Log Parser is a combat log parser for the game World of Warcraft. The purpose of the program is to parse the files generated with the /combatlog command. The source code can be found on: http://www.gurre.eu/wowlogparser/forum