Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
The Most Powerful Software Platform for EHSQ and ESG Management
Addresses the needs of small businesses and large global organizations with thousands of users in multiple locations.
Choose from a complete set of software solutions across EHSQ that address all aspects of top performing Environmental, Health and Safety, and Quality management programs.
Free and Easy to use tool for scientists to manage the references of their work. Automated Fulltext retrieval from the Open Repositories. Flexible Metadata Model and tranformation of metadata to different standard models. nice print formats of the metada
A backup/transfer files program to move and process files between computers in a production (industrial) process. The processing of file begins when it are created or copied to directory and they can be redirected to other machine to be used.
Process OpenOffice.org Writer Files and transform them to PDF without installing OpenOffice.org
What is PyOpenOffice?
* It is a class library, written in the Python Language.
* It is a platform-independent command-line utility (many abilitie
A XHTML to PDF converter: with this library, you can transform simple XHTML pages to nice and printable PDF files.
This project is based on the excellent webzine article "Pdfizer, a dumb HTML to PDF converter, in C#" written by Jonathan de Halleux.
Full access to Enterprise features. No credit card required.
What if You Could Automate 90% of Your Repetitive Tasks in Under 30 Days? At ServoDesk, we help businesses like yours automate operations with AI, allowing you to cut service times in half and increase productivity by 25% - without hiring more staff.
Zarbi is a tool to create slideshows. Using a simple XML based input format, it can generates several outputs: HTML (single and splitted), PDF, LaTeX, TXT. Themes are available.
Research Description Language (RDL) is an XML application for describing and publishing scientific research efforts. Research Editor (REd) is a tool for editing RDL documents, and exporting them to LaTeX, PDF, etc.
Piccolo is the fastest SAX parser for Java, supporting SAX1, SAX2, and JAXP (SAX only). Piccolo is different from other parsers in that it was developed using parser generators. It weighs 160K including XML APIs. See http://piccolo.sf.net for more info.
Migma intends to help teachers to get several different multiple-choice exams based on the users original. This command line program quickly generates a number of different new tests (and their respective answer sheets).
A preprocessor for TeX and LaTeX documents, especially to support typing documents in Indian (South Asian) languages using a natural Latin-alphabet input, but producing a correctly typeset native-alphabet result.
GenDiapo is renamed GenDoc. GenDoc is a XML Editor written in Java2. It is based on a existing project (MerlotXML now called Xerlin), and can use two kinds of plugins (DTD and/or action) for editing and publishing
The program is a text editor(in the future a source editor) using C++ and ncurses library, it is made at the first moment for linux, but in the future for other systems too. It is suposed to work like EDIT(DOS EDITOR).
The aim of this project is to provide exportfilters and extensions that enable the popular Star/OpenOffice Suite to be used as a single source publishing application. Initial target formats are PDF and HTML customized for Print, Web and Presentations.