Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)
Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
Set of tools and libs for
managing structured data
in a very flexible way:
Imp./Exp. ASCII, XML, SQL,
PS, Tex/LaTex, RTF
GUI: X-Windows, MS-Windows
Interface to C++, DBs, Perl,
PHP, Java, TCP/IP
LISP-like interpreter
written in C++ using C-LIB
A C# Object Persistent Framework.
Allows the users of this framework to write concise code for Business Applications without the need to manually map objects to RDBMs. The Framework will create all the Data Access methods automatically
Orient is 100% Pure ODBMS. It works with classes and objects instead of tables and records. Orient supports C++ language using ODMG 3.0 interface and Java using JDO 1.0.
It's available as Just Edition, a small library, or as Enterprise Edition with dis
POPsearch is a desktop search engine that's designed to help you find
information on your computer. This information can then be accessed remotely with RSS feeds, email feeds, or from any computer that has a web browser.
Chronus is an user-friendly object persistance layer for Java. It supplants the need for any database setup or SQL statements. Futhermore Chronus is designed to work without the need for any user configuration or XML descriptor files. Zero setup time
PureSilk Web System, a set of packages to harness the power of Python for building web applications. Packages include PWS HTTP Server, weblet base class modules, and the PureSilk XML Beans (pxb) package for processing XML in a more Python-esque way.
Tracks project details using a variety of data options. Primarily designed to contain and organize software requirements in a multi-project, multi-user, environment using a networked SQL compliant database management system while tracking changes.
A cross-browser WYSIWYG XML editor for all your (XML) editing needs. Use it for your CMS, your guest-book entry page, or to input your corporate financial records!
The FormattedDataSet Java API is the easiest way to generate dynamic text (HTML, XML, WML, JavaScript,...). The dynamic text is generated by using tabular data as input (SQL statements, ResultSets, 2 dimensional arrays,...).
It's a modern take on desktop management that can be scaled as per organizational needs.
Desktop Central is a unified endpoint management (UEM) solution that helps in managing servers, laptops, desktops, smartphones, and tablets from a central location.
PyPergamo is a python (and wxPython) application for the management of archives
of parchments. It includes possibility to search via keywords and to show results in various formats (HTML, XML, plain text).
It uses MySQL and includes support for pics.
Object Relational Persistence and Object-Based SQL Query API. Similar to that of Hibernate or JDO but much better structured design. This API has several layers of abstraction to be useful for nearly any situation.
Cheshire3 is a fast Z39.50, SRW, XML search engine, written in Python for extensability and using C libraries for speed. Next generation of the Cheshire system (http://cheshire.berkeley.edu) and designed around a distributable, object oriented model.
Xinity BASE (Business Application Standard Environment) is a Java-based framework that allows easy development and integration of databased rich-applications using xml. a set of modules already ship with the framework like a CMS, HRM & some eGov-funct.
A connector to integrate XMLBeans with Hibernate, providing a model to generate Java data objects and database tables purely from database schemas. This allows for truly XML-centric projects, with support for huge (>100M), complex XML files.
XQDB is a XML-XQuery-Database Management System. Based on PHP and a relational database (SQL). The treestructur is saved by the nestedset-technology. XQDBMS is supporting XPath- and XQuery-expressions.
Tool Suite of GUI's to manipulate data. The first tool is a GUI for XML schemas and manages/stores the xml documents. The other tool is a GUI for Hibernate. It allows users to navigate an ObjectStore and to insert, update, delete, view the objects store
Catalogs your media files (HD & CD-R). Lets you browse, sort, search and organize them once indexed. Features user-friendly GUI, hierarchical categories, ADC (.cat) & XML catalog support, filtered search & statistics, installer. Reads CD labels & mp3ID3
From the XML definition of your database, DewDrop will generate clean and human-readable SQL DDL code for any supported target database. This means you can define your database just once in XML, and have it work for any database automatically.
reppy is a PDF-Report Generator for databases (MySQL, Postgres, CSV) written in Python. The report definition is based on an XML-template, which can be edited with the included program XTRed. It needs the python library reportlab for pdf-creation.
JDBF is a JDBC-based Object-Relational mapping system.It needs a xml configurator file called "repository" where objects are mapped.
It generates SQL for retreiving, saving, and deleting objects.