Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.
Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
Try for free
Run applications fast and securely in a fully managed environment
Cloud Run is a fully-managed compute platform that lets you run your code in a container directly on top of scalable infrastructure.
Run frontend and backend services, batch jobs, deploy websites and applications, and queue processing workloads without the need to manage infrastructure.
XSDB XML is to DATA as HTML is to DOCUMENT. Publish and combine data as easily as HTML format and web browsers publish and view documents. Implementations in Python, javascript, java, C#/.NET.
Monolith (lith) is a tiny, java-based embedded file-system like implementation. It allows you to store and retreive multiple sets of named data in a single file and access it without the speed costs typically associated with file archives.
E-Nature CD/DVD Archiver lets you easily read all files on any data CD/DVD to a local database. With the integrated search function you can search for files stored on any previously added DVD or CD in your collection.
Total Network Visibility for Network Engineers and IT Managers
Network monitoring and troubleshooting is hard. TotalView makes it easy.
This means every device on your network, and every interface on every device is automatically analyzed for performance, errors, QoS, and configuration.
RSYNC Backup front-end with Web Interface, email reports with attached summary. Simple to use; back up servers with a single line, or your entire server farm from a single machine. Custom email tags sort responses in your MUA. Supports authentication.
ArcAngel is a simple backup utility for programmers comfortable with XML and RegExp. It creates Zip format archives of specified files, with flexible file-selection rules, and pre- and post-backup task execution.
JudoScript is a general-purpose, Java scripting, multi-domain scripting tool/language. It combines the powers of declarative scripting for many modern tasks and general object/procedural programming. It is simple, intuitive, practical and powerful.
JSPackaging is a JavaScript framework that provides simple directives for defining, loading & importing uniquely named packages of JavaScript modules. UPDATE: This project now exists as Ajile at http://sf.net/projects/ajile & http://ajile.iskitz.com.
The DocWay Published Folder portlet allows you to easily embed Documentum content into your JSR-168 compatible portal. The portlet allows countless users to access files with only a single account. It can be extended to support additional data sources.
Quickly and easily setup a folder synchronization with a colleague anywhere in the world. Allows you to setup a direct connection through the Internet so that friends, family, or colleagues may access your folders and you theirs.
The JCA 1.5 compliant resource adapter provided by this project grants J2EE 1.4 components access to files hosted on file systems. J2EE components can use it to load and store files on local and remote file systems.
JPackIt is a Java application for packaging a Javaproject into single executable package (such as jar, class or exe) containing all java application resources (code, images, properties, etc.) and referenced libraries.
The OS561 operating system based around FORTH/Java. The OS is to run on a VHDL chip OpenHardware design called the Minon, but could become available for other platforms. The unique point of the design is a revolutionary data compression technology.
Jxtray is a database explorer written in Java. Jxtray can be used to connect to any type of database with suitable JDBC drivers. It can provide you with information regarding your data bases and extract data in the following formats: CSV, SQL, HTML, XML
A plattform independent encryption-tool with a simple user interface (swing/java). It uses 'Password Based Encryption' (PBE) to encrypt and/or decrpyt files on the filesystem. Encryted files will also be compressed using the zip-format.
Develop a java API (JAR library, with an example web GUI) for content management. Simple but powerful, based on Apache Lucene project, it would be embeded on projects requiring content management.
A CVS with simple architecture and easy interface. Use to sincronize automatically a directory, project or for backup. Works in any OS with Perl. FEATURES: web access; client/server; protocol compression; own chksum; Web Admin.
EBML, or Extensible Binary Meta-Language, is a simple XML like binary language for describing data in structured style. EBML was originally designed for use in the Matroska project, but the developers saw that EBML was very flexible and extensible.
idyuts is \"I Dare You to Use This Shell\"; a pre-hibernate approach to replacing an ORM written with jython functors into a pure-Java language command pattern. The \"pipeline codegen artifacts\" are simple IoC templates, and trivial to adapt
Data Evolution for Java. Finch is a Data Transformation System based on JDBC and other Java blessed specifications. Finch is based on a component task architecture allowing users to create complex data manipulation jobs.
fuglue (fugu Glue) is a modern Project Management System based on HTTP. In addition to the access with web browser, it also supports WebDAV. The system is installed and operated as a web application on a web server (with Java Servlet support)
Bookmark.inc is a PHP class that provides the essential functions to manage a bookmark. It provides an (multilingual) administrator interface, and allows the personalization of the bookmark visualization through a simple template system.