Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)
Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
The Xerlin project is a Java based XML editor that can run on any Java 2 virtual machine. The application is extensible via custom editor interfaces. Xerlin can be used to provide simple, intuitive interfaces for users who know nothing about XML.
Zarbi is a tool to create slideshows. Using a simple XML based input format, it can generates several outputs: HTML (single and splitted), PDF, LaTeX, TXT. Themes are available.
Research Description Language (RDL) is an XML application for describing and publishing scientific research efforts. Research Editor (REd) is a tool for editing RDL documents, and exporting them to LaTeX, PDF, etc.
Piccolo is the fastest SAX parser for Java, supporting SAX1, SAX2, and JAXP (SAX only). Piccolo is different from other parsers in that it was developed using parser generators. It weighs 160K including XML APIs. See http://piccolo.sf.net for more info.
Total Network Visibility for Network Engineers and IT Managers
Network monitoring and troubleshooting is hard. TotalView makes it easy.
This means every device on your network, and every interface on every device is automatically analyzed for performance, errors, QoS, and configuration.
GenDiapo is renamed GenDoc. GenDoc is a XML Editor written in Java2. It is based on a existing project (MerlotXML now called Xerlin), and can use two kinds of plugins (DTD and/or action) for editing and publishing
...The tool suite will provide integrated support for writing screenplays and prose, and developing plot and character. The backend will support many target formats, including XML and PDF.
Project to create a unified FAQ XML format with all applicable software to convert it to various formats, such as multiple forms of HTML, TeX, PDF, text files, etc. Useful for most of "FAQ keepers" on various forums and discussion lists.
XML::Template is an X(HT)ML template processing framework written in Perl designed for constructing web sites and web applications. It provides XML document parsing and caching; scalar, array, nested, and XPath variables; and numerous plugin modules.
NunniMJAX is a minimal java library for parsing XML. The API and functioning recall SAX and is sequential and event-driven. The parser strives to verify that the XML is well-formed, but no validation. NunniMJAX's FSM has been generated using NunniFSMGen
subtex is an XML version of LaTeX, the popular typesetting system. It contains a subset of the latex commands (hence SUBtex) but also contains many improvements, including a fabulous built-in bibliography system. LaTeX tags in XML! Yeah!
The tool FTS2DB is a translator from FTS (pure text with some structuring, and format tags, see bellow) text format to Docbook
(SGML/XML) documents. It was created to provide an easy way to those users who
(temporally) doesn't have a Docbook text forma
It's a tools generating some graphics interfaces for applications in Java language. It's to gain a lot of time while building some windows. The Swings classes are very difficult to use! (especially the Layouts)
We describe the windows content in XML!
Chaperon is a LALR(1) parser, which parse structured text documents and
generate XML documents as output. It includes a parser generator like yacc
and a regex scaner like lex. As input use Chaperon a grammar written in XML.
Splitting an XML file into components of a pure TextArray and the XML markup information allows to examine the text further with simple PCRE matching and easy memorization of results in the XML tree, done here on the basis of glib2 and libpcre.
XML C Parser Generator (xmlcpg) is a xml processor coupled with a flex/bison C parser generator. A DTD can be processed to build a specialized parser for the grammar.