Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
Run applications fast and securely in a fully managed environment
Cloud Run is a fully-managed compute platform that lets you run your code in a container directly on top of scalable infrastructure.
Run frontend and backend services, batch jobs, deploy websites and applications, and queue processing workloads without the need to manage infrastructure.
SAADA transforms a set of heterogeneous astronomical data FITS files in a powerfull database deployed on the Web and including VO features. The database is located on your host and keeps under your full control. This job doesn't require to write cod
This application uses XML files as input to apply a XSLT transformation and export user-readable documents, such as HTML or LaTeX.
Il was originally intended to create Yearbooks based on the already existing files with details about an organisation.
Paradice is a Theme for the popular web-based database administration tool phpMyAdmin. You will find the development version here. All official releases will be made by the phpMyAdmin project.
OrangeHRM provides a world-class HRIS experience and offers everything you and your team need to be that HR hero you know that you are.
Give your HR team the tools they need to streamline administrative tasks, support employees, and make informed decisions with the OrangeHRM free and open source HR software.
Website Content Management System which uses Perl & MySQL to create an XHTML 1.0-compliant CSS-layout website. News, comments, discussion forum, content pages, side blocks, admin panel, anything can be added as a module or side block.
PHP web API designed to simplify object handling(loading, saving, querying, displaying, and editing), abstract the data from its display structure, and layout and allow the target data to be delivered to any supported format without special logic.
SYMPLiK RANGEHOOD is a Javadoc-like tool for Oracle database. This pure-Java program "sucks up" data dictionary and object source code from database and generate document for Tables, Views, Triggers, Packages, Procedures, Functions, and others.
This is a (very) simple chat-room created using php. Instead of using MySQL or any other database software this chat-room uses a flat-file database (created using XML). The main aim of this chat-room script is to be minimal (small codebase).
A lightweight, easy to use PHP content management system (CMS). Written to be fast and to use as little memory as possible. Placeto CMS offers browser and server caching, provides gzip compression and to cut down on bandwidth and CPU time.
Safetica Data Loss Prevention Software for Businesses
For companies that want to protect their sensitive data against leakages, insider threats and want to be compliant with regulations.
Safetica is a cost-effective, easy-to-use Data Loss Prevention (DLP) solution. It performs security audits, prevents sensitive data from leaving your company, and sheds light on what is going on in your organization.
Program converts HTML pages into LaTeX format. Own mappings between HTML tags and character entities can be defined. CSS formatting properties are also supported (including colours). Implemented in Java.
Uses XSLT to generate HTML content and XSL-FO formatting for XEP from XHTML sources. Follows the spirit of W3C specifications. Features content generation such as TOC, footnotes, custom bookmarks, hyperlink abstracts and complex widgets.
With the EpisodeGrabber you can keep track of your favorite TV-Show. It is parsing and collecting informations from all TV-Shows on epguides.com even with Cast Pictures. Make a Watchlist and download directly from isohunt.com
Storm MVC is a php framework based on the model-view-controller design pattern featuring pretty URLs, site themes via inherited master pages, and easy forms processing. It is a mix of the best ideas from Rails, Django and ASP.NET MVC.
ServingXML is an open source, Apache 2.0 licensed, framework for flat/XML data transformations. It defines an extensible markup vocabulary for expressing flat-XML, XML-flat, flat-flat, and XML-XML processing in pipelines.
HTMLtools includes several Java HTML tools for preparing Web pages. The HTMLtools program automates batch conversion of tab-delimited spreadsheet text files to HTML Web-page files, file & table editing, keyword mapping, templates, and more.
An all-in-one authentication with mysql as backend. Features: - Howto/Document - user info - libnss-mysql - pam-mysql - usersql - pdbsql (samba) - radius-mysql - mail
POESIA= Public Opensource Environment for a Safer Internet Access
an opensource Internet content filter (multimodal, mulitlingual) aimed for protection of youth (in schools...); partly funded by the European Commission
A web application built in java to collect snippet of code or items on the web in categories. The application is provided with user registration, snippet moderation, advanced code formatting and ActionScript API for Flash applications integration.
Content adaptation platform that adapts internet content to any mobile device in real time. Includes features such as device detection, image transcoding, DOM manipulation, XML/RSS feed support and much more.
A servlet (or a bunch of servlets) to access a database, perform ddlutils tasks (retrieve, create and update from or to a description). Uses the apache.org ddlutils (and some other libraries).